top of page

Let's Learn and Share AI
Challenges in Relational Multi-Table Synthetic Data Generation
1. Introduction Synthetic data generation is increasingly important when working with sensitive or regulated datasets. While generating synthetic data for single tables is straightforward using GANs or statistical models, generating relational multi-table synthetic data is significantly more complex. Relational databases do not exist in isolation. They contain relationships that define how information flows across the system: Foreign keys (parent → child) Many-to-one (St
Nov 195 min read


Semantic Data Matching for Large Datasets: A Scalable Pipeline
In the realm of data management, integrating information from diverse sources poses significant challenges due to variations in terminology, structure, and content. Traditional matching methods, which depend on exact or approximate string comparisons, often fail to capture underlying meanings, leading to incomplete or inaccurate alignments. To overcome this, fuzzy logic and phonetic matching became prominent approaches. Fuzzy matching uses algorithms like Levenshtein distanc
Oct 228 min read


Entity resolution using Artificial intelligence
In the age of big data, organizations are swimming in vast oceans of information. While this data holds immense potential, its true value...
Sep 238 min read
Personnel Selection and Management in the Age of AI: A HR Perspective
Hey there! Artificial intelligence is shaking up HR like a new intern with big ideas—full of promise, but not without quirks. From...
Jun 177 min read


Step-by-Step Guide to Configuring GPU in Azure N-Series Virtual Machines
Overview The Azure GPU VM “NC4as_T4_v3” is equipped with a single NVIDIA T4 GPU, offering 16 GB of GPU memory optimized for AI...
May 282 min read
Creating and Using MCP inside Langflow ( No Code) -Part II
What is a MCP ? MCP is an open protocol that standardizes how applications provide context to LLMs. We can consider MCP like a USB-C...
Apr 283 min read
Extending LangFlow Components - Part I
Introduction Welcome to the series of blogs—Extending Langflow Components, where we will share our journey of exploring and traversing...
Apr 193 min read
Harnessing the Power of TimescaleDB
TimescaleDB is an open-source database designed to make SQL scalable for time-series data. It is engineered up from PostgreSQL and packaged
Nov 25, 20244 min read
bottom of page
