For all the right reasons, large language models have taken the world by storm! It’s pretty impressive what ChatGPT can do - unprecedented coherence, relevance and tone of delivery for a synthetic text generation. Well done OpenAI!
Read MoreSuneeta Mall
Rambling of a curious engineer & data scientist
What are manifold learning algorithms? What is t-SNE and UMAP? What are the differences between t-SNE and UMAP? How can I explore the features of a dataset using t-SNE and UMAP? These are my notes from my recent exploration into t-SNE and UMAP and trying to apply them to a multi-label dataset to understand the abilities and limits of these algorithms.
Read MoreWhat makes deep-learning so great, despite what you may have heard, is data! There is an old saying, that sums it up pretty well:
Read MoreHow do you manage the configurations of your model training experiments? Having good configuration management in place improves the user experience and simplifies experiment management. Some of the indirect advantages of having a good config solution are clean, reliable, and simple application code. This is true for any software application however some applications demand higher investment on the config piece than others. One key differentiator perhaps here is how many config fields one has and how each of these relates to the other. This can get real messy real quick with deep learning experiments where the list of config and hyperparameters can rapidly grow out of hand as soon as one breaks out of adhoc experimentations.
Read MoreThe energy and excitement in machine learning and deep learning communities are infectious these days. So many groundbreaking advances are happening in this area but I have often found myself wondering why the only thing that makes it all shine - yes I am talking about the dark horse of deep learning the data is so underappreciated. The last few years of DL research have given me much joy and excitement and I carry hope now that going forward we can see some exciting progress in this space that explore advances in deep learning in conjunction with data! In this article, I summarise some of the recent developments in the deep learning space that I have been blown away by.
Read MoreA few days ago, I deployed a brand new application onto a self-managed Kubernetes cluster (hereafter referred to as Kube).
Suffice to say, all hell broke loose. The pods were getting OOMKilled
with error code 137 left and right!
This technical blog series titled “Reproducibility in Machine Learning” is going to be divided into three parts:
- Reproducibility in Machine Learning - Research and Industry
- Realizing reproducible Machine Learning - with Tensorflow
- End-to-end reproducible Machine Learning pipelines on Kubernetes