Nougat Loader
NougatPDFLoader Class Documentation
Overview
The NougatPDFLoader
class is a powerful tool for loading academic document PDF files. It leverages the capabilities of the Nougat model, developed by Meta, to provide an accurate conversion of academic papers from PDF format.
Usage
Run Nougat API sever
You must run Nougat API server for using this loader. You will need server with CUDA installed for running nougat model properly. More detailed installation of nougat, please go to official github repo.
Use Docker (Recommend)
First, clone facebookresearch/nougat repository to your machine, and move to docker folder.
Then, build and run your docker container following this instruction.
Use pip
First, install nougat package api version using pip.
Then, run api server with this command.
Initialization
After runs your Nougat API server, you first need to create an instance by providing two parameters: file_path
and nougat_host
.
file_path: This is a string representing the path to your PDF file.
nougat_host: This is a string representing the host address where your Nougat API server is running.
Example:
During initialization, it checks if it can establish a connection with the provided Nougat server host. If it cannot establish a connection, it raises a ValueError.
Loading Documents
The class provides two methods for loading documents: load()
and lazy_load()
.
Both methods accept three optional parameters:
split_section (default True): If set to True, it splits the document by section.
split_table (default True): If set to True, it splits the document by table.
You can also pass other arguments such as start page number (
start
) or stop page number (stop
) as keyword arguments (kwargs
). These are optional parameters specifying which pages of your PDF you want to load.
Example:
or
These methods return instances of Document objects that contain processed content from your PDF file.
Last updated