Building a Scalable and Customizable Data Scraping Pipeline. Part 1: Overview

In the world of big data, having access to timely and accurate information is crucial. However, with the vast amount of data scattered across the internet, gathering that data is far from simple. This is where data scraping comes into play. In this post, we will explore the challenges and solutions around building a scalable […]

Discovering Topics in News Articles

Introduction Digital transformation (DX) is growing rapidly, and with it the necessity of classifying massive text sets. Latent Dirichlet Allocation (LDA), a popular approach for locating hidden topics in text data, is one effective way to handle this problem. This article will show you how to use LDA with the AG News dataset, which is […]