parsing_docling

Parse the data provided by the loader using Docling. Docling is a more robust and thorough document parsing library that:

Note that docling uses ML models for improved parsing, which makes it slower than simpler parsers like pymupdf.

Samples

SELECT ai.create_vectorizer(
    'my_table'::regclass,
    parsing => ai.parsing_docling(),
    -- other parameters...
);

This function takes no arguments.

A JSON configuration object that you can use in ai.create_vectorizer.

⌘I