The Datograde Blog

Expand your AI and data extraction knowledge with detailed tutorials and case studies.

What is Unstructured Data Extraction?

Unstructured data extraction is the process of extracting data from unstructured data sources.

3/31/2025

Data Extraction Through the Ages - From Manual to AI

Data extraction is the practice of selecting and retrieving data from one or more sources for the purpose of storing, transforming, integrating, and analyzing it for business intelligence or advanced analytics.

3/7/2025

Fine-tuning generative models 101

Fine-tuning helps you customize models for specific tasks and data. Get everything you need to know about fine-tuning in plain English and with practical examples.

3/5/2025

How to build a company research agent

Let's explore the different ways we can build a company research agent. I managed to make one in just 115 lines of code, no frameworks needed.

2/21/2025

Generating Better Blog Posts with LLMs

I tried to generate a blog post about the five mother sauces of French cooking with an LLM. Here's what happened.

2/17/2025

Data extraction using generative AI [2025 guide]

What is data extraction, and why is it suddenly back in the spotlight with the rise of generative AI and LLMs?

2/16/2025

AI and LLM Observability

Learn how to implement observability for AI systems in prod, capture model behavior, data quality, and system performance.

2/15/2025

What is AI and LLM Evaluation?

AI Evaluation is a systematic process that measures a system’s performance, reliability, and fairness. Learn about the different types of AI evaluation.

2/14/2025

Web Scraping with AWS Lambda

Comprehensive guide to web scraping with AWS Lambda and S3, including how to set up an S3 bucket, create a Lambda function, and schedule it to run at specific intervals.

1/31/2025

How to extract data from PDFs with Python

A step-by-step guide to extract data from PDFs with Python

1/16/2025

The Ultimate Guide to Fine-Tuning AI Models: Comparing Offerings from OpenAI, Google, Meta, and More

A deep dive to fine-tuning AI models, comparing offerings from OpenAI, Google, Meta, and more.

11/22/2024

New: Advanced Merge Tools and Dynamic Pipelines

Product Updates: Introducing powerful CSV merge operations, backup functionality, and dynamic pipeline steps for enhanced data processing workflows.

9/30/2024

New: Files, Merge, and Save

Product Updates for Sep 20th, 2024: you can now merge and save your processed data directly in your library, and export it in a variety of formats.

9/20/2024

New: Markdown, Text, and TSV

Product Updates for Sep 16th, 2024: you can now export your data in Markdown, Text, and TSV formats.

9/16/2024

Building an Automatic Schema Generator for Arbitrary Datasets

A behind the scenes look at how I built an automatic schema generator for arbitrary datasets.

9/1/2024