Blog posts

2026

Introducing synthetictext

less than 1 minute read

Published:

synthetictext is an LLM-powered Python package for generating synthetic text data for text classification tasks. Read more

2024

Getting started with Selenium Webdriver and Requests in Python

9 minute read

Published:

In this short tutorial, let’s look at the US Patent and Trademark Office (USPTO) website and scrape the patent database using a keyword search. We will use Selenium WebDriver to scrape the data. We will then use the Requests library to download the individual patent PDF documents. Read more

2021