Skip to content

Data extraction

Data extraction is an integral component of creating appropriate overviews of data.

Tooling

There are a number of libraries and functions that help to enforce structureed output from unstructured data.

Research

??? important "Extracting accurate materials data from research papers with conversational language models and prompt engineering" chatextract Developments The authors reveal a quality manner of extracting structured data using a language model with a series of engineered prompts that identify data and validate its correction with follow-up questions.

![image](https://github.com/ianderrington/genai/assets/76016868/cab6fd6a-5eed-4bbc-9ea7-4a4f19a68696)

![image](https://github.com/ianderrington/genai/assets/76016868/87fb9420-f9ae-45d6-b1d9-327d357cfbab)