About me

I am a Data Architect with ~15 years of experience in various Data roles.
I have been an Analyst, Project Manager and most recently a Data Engineer...
across hypergrowth teams ranging from Consulting, BigTech, pre-IPO, bootstrappy lean teams..
My strengths are in planning, designing and driving DataOps - pipelines, data models and data infra & systems
spanning data engineering, visualization, data classification, governance, stewardship and metadata management
Education: Bachelors Engineering, MBA, MS-Data Science
Programing Skills: Python, R, SQL, PL/SQL

Work Experiences

I have built data products for marquee Big Tech firms like AMZN, ORCL, NIKE in e-commerce, retail, cloud hardware planning domains. I have worked at pre-IPO initiatives like revenue accuracy, SOX reporting and compliance... at UDMY, SOND. I have worked as Consultant deployed on client sites, working on a schedule, built greenfield data stacks and been part of live migrations across stack with ~$200mn ARR analytic products as Amazon Brand Analytics - ARA Basic, ARA Premium. I often wear multiple hats in a team, with my techno-functional skills I can effectively partner business and technology teams with analytics engineering efforts. Here's a brief samples of my coursework efforts.

Story-telling using Data Visualizations

Project aims at learning visualization and presentation techniques. Being able to talk through a business case, connecting data points to be able to present a compelling story through graphical plots and visualizations. PowerBI dashboards, Visualizations using R and Python packages are some of the tools used

Churn Prediction

Explore factors leading to customer churn in a banking context. Project goal is to develop a prediction model to answer some questions like...
- is the customer going to churn ?
- what is the probability of churn ?
- can the churn redressal be actioned timely? what would be required to enable such?

Explore Jobs, Roles, Titles

There are quite a few similar sounding roles which can be performed using the skills gained in this program. A degree in data science enables candidacy to roles like Data Scientist, Data Engineer, BI Engineer, Data Analyst, ML Engineer, etc. With this project, perform a job ~ role analysis for -
* what jobs or titles get higher salaries ?
* what companies pay more ?
* does location play any role in salary variations ?

Donald Trump Tweet Study

Trump Tweets were quite a feature of his presidency. The project aims to understand Trump tweets.
~ who are his targets ? is there a pattern ?
~ what is the core message, sentiment ?
~ how does the verbiage, targets, sentiment change from campaign to presidency ?

Understanding Hadoop

Apache Hadoop is an open source framework that is used to efficiently store and process large datasets ranging in size from gigabytes to petabytes of data. This project intends academic study and poster presentation of various aspects of the data processing framework.

Churn Propensity Analysis

Acquiring a customer takes a lot more than retaining an existing one. Analyze for customer churn and propensity factors to switch to a different banking provider. Can these results be captured ahead/ in-time for remedial actions by business? Perform statistical analysis using R Packages.

Used Car Pricing Dynamics

Project intends to perform feature analysis for used car prices. This study is relevant for new entrants into US auto markets, product designers and engineers to identify and optimize on key selling features. Following are the questions we seek answers to in the project
§ What features impact car price?
§ How well do these features explain the price?

Data Pipelines QuickStart Guide

Follow the code and workings of this project to get started with a data pipeline infra. The workings result in an out-of-box ELT service, a data transformation framework, a scheduler / orchestrator and data visualization utility. The code snippets demonstrate a Salesforce object ingestion, and its $ value weekly trends for opportunity sizing. Airbyte, Airflow and Tableau are the various open-source utilities chosen for this project. The code deployment and the transformation framework is container based and runs using Python packages

Python Code Snippets - Programming 101

Learning python programming... here are some cool code snippets, workings from my learnings.
* generate itemized receipt
* billing and receipts for variable rates
* perform mathematical operations
* temperature conversion and listing
* count words in a file
* write word counts to a file
* capture response from an api
* cash register checkout
* "get your weather" app

Address

Fremont
California, 94536
United States of America