How to retrieve credentials stored in AWS Secrets Manager from AWS Lambda running in VPC

AWS Secrets Manager is a secrets management service that enables you to store credentials and retrieve it dynamically when you need them. It helps protect access to your applications and services. With AWS Secret Manager you can - Programmatically retrieve encrypted secret values at runtimeStore different types of secretsEncrypt secret dataAutomate secret's rotation When you … Continue reading How to retrieve credentials stored in AWS Secrets Manager from AWS Lambda running in VPC

FastAI: Multi-Label Classification [Chapter-6]

PyTorch and fastai have two main classes for representing and accessing a training set or validation set: Dataset:: A collection that returns a tuple of your independent and dependent variable for a single itemDataLoader:: An iterator that provides a stream of mini-batches, where each mini-batch is a tuple of a batch of independent variables and … Continue reading FastAI: Multi-Label Classification [Chapter-6]

How to load table with JSONB data type into Aurora PostgreSQL using AWS Glue

In this blog post I will cover what is JSON data type, what options does PostgreSQL offers to store JSON data, how you can create AWS Glue connection to Aurora PostgreSQL database running in private subnet and how can you then use AWS Glue to write data into table with JSONB datatype into Aurora/RDS PostgreSQL … Continue reading How to load table with JSONB data type into Aurora PostgreSQL using AWS Glue

FastAI – Image Classification – [Chapter 5]

DataLoaders DataLoaders - Is a class that just stores whatever DataLoader object you pass to it and makes it available as train and valid. To turn downloaded data into a DataLoaders object we need at least 4 things - What kind of data we are working withHow to get the list of itemsHow to label … Continue reading FastAI – Image Classification – [Chapter 5]

FastAI : Training a Digit Classifier - [Chapter-4] - Part II

Metric is to drive human understanding and the loss is to drive automated learning. Stochastic Gradient Descent - As Arthur Samuel had mentioned the description of machine learning Suppose we arrange for some automatic means of testing the effectiveness of any current weight assignment in terms of actual performance and provide a mechanism for altering … Continue reading FastAI : Training a Digit Classifier - [Chapter-4] - Part II

fastai : Training a Digit Classifier - Baseline Model [Chapter-4] - Part I

Chapter 1 Notes - https://aprakash.wordpress.com/2021/07/05/fastai-just-go-out-and-play-chapter-1/ I want to start this blog with some of my personal notes before jumping on the book. Below is the notebook with examples of 1-dimensional, 2-dimensional and 3-dimensional array using numpy. 2. What is a norm? A norm is a measure of distance and have three properties All distances are … Continue reading fastai : Training a Digit Classifier - Baseline Model [Chapter-4] - Part I

fastai : Just Go out and Play [Chapter -1]

For quite some time I have been thinking to start writing about data science and machine learning, but somehow it wasn't happening. Recently, I joined 25 weeks long fastbook reading session organized by Weights & Biases and lead by Aman Arora ( awesome instructor 👏). This is my first time participating in a reading session … Continue reading fastai : Just Go out and Play [Chapter -1]

Export table from Aurora PostgreSQL to Amazon S3

In this blog post I discuss how to export 100GB non-partitioned table from Aurora PostgreSQL to Amazon S3. I will walk you through two approaches that you can use to export the data. Firstly I will demonstrate using aws_s3, a PostgreSQL extension which Aurora PostgreSQL provides and then using AWS Glue service. The post also … Continue reading Export table from Aurora PostgreSQL to Amazon S3

Build a basic UI for uploading data to s3 using AWS Amplify

Disclaimer: I am not a web developer and I do not have knowledge of JavaScript. The code snippet is mostly taken from different sources and tweaked for my specific use case. 🙂 I was looking for a way to enable a user to upload bunch of small size (< 10 MB) csv files to a … Continue reading Build a basic UI for uploading data to s3 using AWS Amplify

Handling missing values in Pandas to Spark DataFrame conversion

Data transformation is an important aspect of Data Engineering and can be a challenging task depending on the dataset and the transformation requirements. A bug in data transformation can have a severe impact on the final data set generated leading to data issues. In this blog I am going to share my experience of having … Continue reading Handling missing values in Pandas to Spark DataFrame conversion