Jan 24, 2026
How a design principle from 1970s operating systems is quietly shaping the future of autonomous AI Introduction In 1971, Ken Thompson and Dennis Ritchie faced a problem that sounds oddly familiar tod…
Read
Jan 20, 2026
Introduction I’ve been working in and around AI research and applied machine learning for close to two decades now. I’ve lived through multiple hype cycles, winter cycles, algorithmic plateaus, infra…
Read
Dec 15, 2025
Insights, pitfalls, and architectural truths from real-world deployments The first time you demo an agent internally, it feels magical. It answers questions, writes decent text, sometimes even calls…
Read
Nov 19, 2025
Imagine you deploy a chatbot assistant at your company. It can answer FAQs, lookup policies, book meetings — all sorts of tasks. Early on, it seems to handle single questions well. But when employees…
Read
Jul 17, 2025
“In science, if you know what you’re doing, you shouldn’t be doing it. In engineering, if you don’t know what you’re doing, you shouldn’t be doing it.” — Richard Hamming This quote resonates deeply w…
Read
Mar 16, 2025
cross-posting from linkedin Deepak Babu P R on LinkedIn: #coding #ai #hci #reasoning #windsurf #cursor A͟I͟ ͟f͟o͟r͟ ͟c͟o͟d͟i͟n͟g͟ ͟t͟a͟s͟k͟s͟ One of the recurring topics in my coffee chats with scien…
Read
Feb 1, 2025
Introduction: A ChatGPT Moment, Reimagined The release of DeepSeek this week has sparked a familiar buzz — a ChatGPT moment for 2025. Users outside the AI/tech bubble are once again witnessing the ma…
Read
Jan 6, 2025
Information systems have evolved from simple keyword-based search engines to sophisticated natural language-based question-answering (QA) systems. I distinctly remember my early days in 2012, buildin…
Read
Oct 28, 2024
As an AI scientist who has been immersed in the field for many years, I’ve witnessed firsthand the grassroots innovations that have propelled artificial intelligence forward — be it large language mo…
Read
Aug 27, 2024
M achine learning (ML) and artificial intelligence (AI) are fundamentally about learning patterns and models from data or a collection of experiences. These experiences can be thought of as individua…
Read
Jun 25, 2024
I had a fantastic time at the Databricks Data + AI Summit (DAIS) 2024, connecting with over 2K+ participants from diverse backgrounds, including CXOs and ICs (engineers, scientists, and product manag…
Read
Jun 9, 2024
I might be biased, but it seems that Siri’s ability to understand and execute commands has recently taken a significant leap forward. This step-function improvement in Siri’s responses might be due t…
Read
Jun 7, 2024
Welcome to “Thinking-Out-Loud,” a series where I delve into intriguing research papers and share my spontaneous reflections, insights, and personal experiences. Each post will explore a different pap…
Read
May 20, 2024
In recent years, the field of artificial intelligence has experienced unprecedented growth. Two driving forces underpin this advancement: the availability of colossal datasets, exemplified by the Lla…
Read
Apr 9, 2024
Sample Efficiency (Data) In the realm of machine learning, “sample efficiency” is a term that defines a model’s ability to learn effectively from a limited number of examples. High sample efficiency…
Read
Mar 31, 2024
Multimodal models are creating a synergy between previously separate research areas such as language, vision, and speech. These models use universal architectures that treat each modality as a distin…
Read
Jan 22, 2024
We have been discussing audio and language models in last few posts ( Multimodal audio-language LMM and Qwen-Audio FM ). In continuation to these, here we discuss another important foundation model f…
Read
Jan 10, 2024
As a follow-up to the last blog on large multimodal audio models (LMM), we’re here to explore an open-source large LMM audio model. While dozens of image and text MM models exist, those involving aud…
Read
Dec 29, 2023
Recent advancements in Large Language Models (LLMs) have demonstrated a significant capability for reasoning and planning, a trait predominantly seen in models with over 100 billion parameters (consi…
Read
Dec 4, 2023
In the age of foundational models that are based on deep learning architectures like transformer models, we can process large amounts of data to learn complex representations of data, forming world k…
Read
Nov 4, 2023
Mechanistic Interpretability is a field of study that concerns study of neural networks (more generally ML models) with an intent to understand and explain inner workings of a machine learned model.…
Read
Oct 9, 2023
We often hear reward hacking in the context of LLMs generating content that mimics style over substance. so what exactly does this mean ? why does it occur ? Before we dive into concept of reward hac…
Read
Sep 30, 2023
As we transitioned from the traditional AI methods of the 2010s to deep learning-based approaches in the 2020s, there’s been a marked shift towards leveraging large-scale datasets. These datasets are…
Read
Sep 28, 2023
A Comprehensive Look at ‘The Cold Start Problem’ Having been a part of several unicorn startups in India, including Mu Sigma, InMobi, and Snapdeal, I’ve had the opportunity to contribute to and obser…
Read
Sep 28, 2023
Having been a part of several unicorn startups in India, including Mu Sigma, InMobi, and Snapdeal, I’ve had the opportunity to contribute to and observe the unique challenges and dynamics of their gr…
Read
Aug 16, 2023
I so wish research community stressed more on this. (negative results) I think some conferences have started to have a forum dedicated to these, but these are not yet mainstream.
Read
Aug 16, 2023
I would emphasize, taking an independent shot at the problem first and then refer literature and third, see if existing methods can be improved fusing ideas from of your own from (i) or those resulti…
Read
Jun 28, 2023
LLMs are fast-evolving and we have a new model every week, showing up on leaderboards[1] beating previous SoTA model on multiple NLP benchmarks. There are multiple architectures with nuances to train…
Read
Jun 5, 2023
Emergent abilities refer to the capabilities that arise spontaneously from the complex interactions of simpler components. They are properties that can’t be predicted solely based on the individual p…
Read
Apr 28, 2023
The growing capabilities of Large Language Models (LLMs) have led to an increasing demand for their integration into production systems. However, implementing LLMs with low-latency requirements like…
Read
Apr 20, 2023
Large Language Models (LLMs) have caused a paradigm shift in the way NLP is traditionally done i.e anything to do with text. I have been working in this domain for 10+ years having seen n-gram ways o…
Read
Nov 18, 2022
We have all been in situations where we have felt a lack of clarity and understanding on a complex project, metrics defined for measurement disagree with customer anecdotes, deploying a solution that…
Read
Apr 24, 2020
Reinforcement Learning (RL) is a learning paradigm different from traditional machine learning (supervised and unsupervised). The learning problem considered here mimics humans learning from interact…
Read
Apr 23, 2017
prdeepakbabu/Python Python - All *.py scripts github.com Deep learning neural networks have shown promising results in problems related to vision, speech and text with varying degrees of success. I h…
Read
Jun 11, 2016
Machine Learning – Linear Regression using gradient descent vs. neural network Machine learning or Supervised Learning broadly encompasses two classes of problems – regression and classification. Reg…
Read
Feb 28, 2016
Information Retrieval : Spell Correction in Search Engines – Algorithms & Strategies Spell Checking & Correction is an important component of any information retrieval(aka document retrieval) system…
Read
Apr 30, 2015
In this blog post, I am going to discuss about mobile advertising networks, ecosystem & industry trends. This understanding would serve as base for the future posts I plan to write around data scienc…
Read
Nov 7, 2014
We talk about big data quite often these days, wanted to put some fundas about basics around data. Do you know the singular form of data ? How data differs from information vs. knowledge? How insight…
Read
Jan 24, 2014
I have spent most of my life into data and its applications to problems. Now, when i look at some patterns in algorithms we use in analyzing data, one thing that emerges is increased use of meta-algo…
Read
Oct 6, 2012
We are in the era of big data, with newer sources of data emerging at an exponential rate involving sensor data, EHR, social network/media data & machine generated data. In this blog post, I will be…
Read
May 25, 2012
Clustering is an unsupervised classification (learning) technique, where the objective is to maximize inter-cluster distance while minimizing the intra-cluster distance. By unsupervised, we mean clus…
Read
Mar 5, 2012
Hadoop is an open source framework for writing and running distributed application that process huge amounts of data ( more famously called Big Data). The name Hadoop is not an acronym; it’s a made-u…
Read
Oct 25, 2011
In the recent past, a variety of new social media sites have emerged – location Based Services( like foursquare, gowala), Group Deals( like groupon), microblogging( like twitter, fb). These social me…
Read
Oct 12, 2011
This blog post is about the analysis of implementation of helmet rule in various Indian states and the effect it had on bringing down accidental deaths due to 2 wheelers. Here we are specifically foc…
Read
Jul 31, 2011
This blog post is about comparison of amazon.com and linkedin.com in terms of similarities across dimensions of analytic maturity & use of data shared by their customers. As Thomas Davenport mentions…
Read
Jul 2, 2011
Turning raw data into insights often involves integrating data from multiple disparate sources (not just limited structured one), analyzing the data, visualizing it and socializing the results/insigh…
Read
Feb 19, 2011
In this blog post, i talk about 3 scenarios where there had been highly valuable insights derived, yet remaining simple. 1. Customers shopped online returned via stores Randy Lea, VP product & servic…
Read
Nov 13, 2010
In my previous post , i had discussed about Association rule mining in some detail. Here i have shown the implementation of the concept using open source tool R using the package arules. Market Baske…
Read
Sep 11, 2010
For the last 6 months, i have been closely following trends in information management. Below are few of my observations. Data source explosion: Business Problems are gaining complexity day by day, he…
Read
Mar 3, 2010
Do you find analytics/data mining a difficult topic to understand and learn? To a certain extent true if you were to use books as the source. Friends, i found these two very valuable and high quality…
Read
Feb 24, 2010
Association Rule Mining [ Implementation using R here] Association Rule mining is one of the classical DM technique. Association Rule mining is a very powerful technique of analysing / finding patter…
Read
Jan 17, 2010
Here is the continuation to the article i had posted few days back < here >. I am back with some interesting info on recent advancements in the area of analytics. Before going on to the details, wann…
Read
Dec 28, 2009
Informatica recently unvieled its latest release informatica 9 with some major enhancements as compared to its earlier versions, believed to revolutionize data integration & ETL market. I know lot of…
Read
Dec 10, 2009
Visualization is considered to be one of the valuable tools in data mining. Visual analysis helps to understand data with minimum effort. Using graphs/ 3D plots to visualize data is more effective wa…
Read
Nov 23, 2009
In my today’s post i will be talking about amazon.com, a leading e-commerce company. Amazon.com has huge amounts of data about its customer base, products and customer purchase behaviour. To boost up…
Read
Nov 14, 2009
Recently i had attended a conference on “Future of Predictive Analytics” here at Bangalore. Here is the summarized version of the topics covered; Data mining is an automated process of discovering hi…
Read