The Datograde Blog

Expand your AI and data extraction knowledge with detailed tutorials and case studies.

Entity Recognition and LLMs

How to recognize entities in unstructured text, from rules-based techniques to LLM prompt engineering

5/21/2025

NLP vs LLMs

Combining the precision and efficiency of NLP with the generative power of LLMs, you get more robust and accurate performance.

5/21/2025

What is Retrieval Augmented Generation (RAG)?

Retrieval Augmented Generation (RAG) combines vector search and LLM narration to provide a brand new search experience for users - learn how it works.

5/15/2025

Search, Information Retrieval and LLMs

Building search engines and information retrieval apps - how is it done, how does LLMs change the game, how did we get to RAG?

5/13/2025

Sentiment Analysis with LLMs

Different types of sentiment analysis and opinion mining tasks, why they are hard to solve, and how LLMs can help.

5/11/2025

Types of Unstructured Data

Different types of unstructured data - text, voice/audio, images and video, and machine data - and why it is hard to make sense of them.

5/1/2025

What is Unstructured Data Extraction?

Unstructured data extraction is a key use case of generative AI. Dive into examples, types, methods, and benefits in this guide.

3/31/2025

Data Extraction Through the Ages - From Manual to AI

Data extraction is the practice of selecting data from different sources, so that you can analyzing it for business intelligence or advanced analytics.

3/7/2025

Fine-tuning generative models 101

Fine-tuning helps you customize models for specific tasks and data. Get everything you need to know about fine-tuning in plain English and with practical examples.

3/5/2025

How to build a company research agent

Let's explore the different ways we can build a company research agent. I managed to make one in just 115 lines of code, no frameworks needed.

2/21/2025

Generating Better Blog Posts with LLMs

I tried to generate a blog post about the five mother sauces of French cooking with an LLM. Here's what happened.

2/17/2025

Data extraction using generative AI [2025 guide]

What is data extraction, and why is it suddenly back in the spotlight with the rise of generative AI and LLMs?

2/16/2025

AI and LLM Observability

Learn how to implement observability for AI systems in prod, capture model behavior, data quality, and system performance.

2/15/2025

What is AI and LLM Evaluation?

AI Evaluation is a systematic process that measures a system’s performance, reliability, and fairness. Learn about the different types of AI evaluation.

2/14/2025

Web Scraping with AWS Lambda

Comprehensive guide to web scraping with AWS Lambda and S3, including how to set up an S3 bucket, create a Lambda function, and schedule it to run at specific intervals.

1/31/2025

How to extract data from PDFs with Python

A step-by-step guide to extract data from PDFs with Python

1/16/2025

The Ultimate Guide to Fine-Tuning AI Models: Comparing Offerings from OpenAI, Google, Meta, and More

A deep dive to fine-tuning AI models, comparing offerings from OpenAI, Google, Meta, and more.

11/22/2024

New: Advanced Merge Tools and Dynamic Pipelines

Product Updates: Introducing powerful CSV merge operations, backup functionality, and dynamic pipeline steps for enhanced data processing workflows.

9/30/2024

New: Files, Merge, and Save

Product Updates for Sep 20th, 2024: you can now merge and save your processed data directly in your library, and export it in a variety of formats.

9/20/2024

New: Markdown, Text, and TSV

Product Updates for Sep 16th, 2024: you can now export your data in Markdown, Text, and TSV formats.

9/16/2024

Building an Automatic Schema Generator for Arbitrary Datasets

A behind the scenes look at how I built an automatic schema generator for arbitrary datasets.

9/1/2024