The Datograde Blog

Expand your AI and data extraction knowledge with detailed tutorials and case studies.

AI for Marketplaces

Use AI to unlock the value of unstructured data in your marketplace, increasing match rates, reducing operational costs, and improving customer satisfaction.

5/25/2025

Entity Recognition and LLMs

How to recognize entities in unstructured text, from rules-based techniques to LLM prompt engineering

5/21/2025

NLP vs LLMs

Combining the precision and efficiency of NLP with the generative power of LLMs, you get more robust and accurate performance.

5/21/2025

What is Retrieval Augmented Generation (RAG)?

Retrieval Augmented Generation (RAG) combines vector search and LLM narration to provide a brand new search experience for users - learn how it works.

5/15/2025

Search, Information Retrieval and LLMs

Building search engines and information retrieval apps - how is it done, how does LLMs change the game, how did we get to RAG?

5/13/2025

Sentiment Analysis with LLMs

Different types of sentiment analysis and opinion mining tasks, why they are hard to solve, and how LLMs can help.

5/11/2025

Types of Unstructured Data

Different types of unstructured data - text, voice/audio, images and video, and machine data - and why it is hard to make sense of them.

5/1/2025

What is Unstructured Data Extraction?

Unstructured data extraction is a key use case of generative AI. Dive into examples, types, methods, and benefits in this guide.

3/31/2025

Data Extraction Through the Ages - From Manual to AI

Data extraction is the practice of selecting data from different sources, so that you can analyzing it for business intelligence or advanced analytics.

3/7/2025

Fine-tuning generative models 101

Fine-tuning helps you customize models for specific tasks and data. Get everything you need to know about fine-tuning in plain English and with practical examples.

3/5/2025

How to build a company research agent

Let's explore the different ways we can build a company research agent. I managed to make one in just 115 lines of code, no frameworks needed.

2/21/2025

Generating Better Blog Posts with LLMs

I tried to generate a blog post about the five mother sauces of French cooking with an LLM. Here's what happened.

2/17/2025

Data extraction using generative AI [2025 guide]

What is data extraction, and why is it suddenly back in the spotlight with the rise of generative AI and LLMs?

2/16/2025

AI and LLM Observability

Learn how to implement observability for AI systems in prod, capture model behavior, data quality, and system performance.

2/15/2025

What is AI and LLM Evaluation?

AI Evaluation is a systematic process that measures a system’s performance, reliability, and fairness. Learn about the different types of AI evaluation.

2/14/2025

Web Scraping with AWS Lambda

Comprehensive guide to web scraping with AWS Lambda and S3, including how to set up an S3 bucket, create a Lambda function, and schedule it to run at specific intervals.

1/31/2025

How to extract data from PDFs with Python

A step-by-step guide to extract data from PDFs with Python

1/16/2025

The Ultimate Guide to Fine-Tuning AI Models: Comparing Offerings from OpenAI, Google, Meta, and More

A deep dive to fine-tuning AI models, comparing offerings from OpenAI, Google, Meta, and more.

11/22/2024

New: Advanced Merge Tools and Dynamic Pipelines

Product Updates: Introducing powerful CSV merge operations, backup functionality, and dynamic pipeline steps for enhanced data processing workflows.

9/30/2024

New: Files, Merge, and Save

Product Updates for Sep 20th, 2024: you can now merge and save your processed data directly in your library, and export it in a variety of formats.

9/20/2024

New: Markdown, Text, and TSV

Product Updates for Sep 16th, 2024: you can now export your data in Markdown, Text, and TSV formats.

9/16/2024

Building an Automatic Schema Generator for Arbitrary Datasets

A behind the scenes look at how I built an automatic schema generator for arbitrary datasets.

9/1/2024