Close


Swapnil Phulse



Data, DataOps and Software Engineering

Big Data, Distributed Architecture, Microservice APIs, Cloud Computing

Database, ETL Pipelines, Cognitive Automation, Web Scraping, CLI tooling

Linkedin Github Blog

Get in touch for a resume (or a cup of coffee)?

About Me

Honestly speaking, these 'About me' sections are so hard to write; there's either so much to say that you don't know where to start, or there is nothing to say at all. But I'll give it a go.

I'm a data-enabled software engineering professional with focus on cross-functional enterprise applications and data acccess platforms. Architecting and building highly available, performant and scalable distributed systems. Industry experience with writing RESTful APIs, backend design, efficient data access layers, big data, modeling/warehousing, and building ETL pipeline solutions.

Having had the fortune of working on both software enineering and big data architecture, I bring in a diverse set of skills and an end-to-end expertise in clean and efficient coding, practical knowledge of design patterns(for APIs as well as database solutions) and test-driven development.

I'm also a chronic automator and web scraper, which blends in well with my innate DRY(Don't-Repeat-Yourself) personality type. Besides my work, I also work on creating pro-bono infomercial products aimed at teaching software development to the less priviledged.

Quite recently, I joined Thoughtworks Inc. as a Sr. Data Engineer(Remote). Super stoked to start this new journey!!

In the recent past, I was working with with Deloitte in their Analytics and Technology practice. The sheer exposure to range of technology at my workplace has made me comfortable working fullstack and back - right from digging into css, to writing APIs, to creating batch/streaming backend solutions. I write software backends that power performant data analytics platforms for leading banks across the globe. I also have previous experience working with top US Pharma giants, mainly on optimizing their data refineries & adjusting Incentive Compensation programmes. Banking and Pharma - needless to say, code that I commit has to respect the country specific boundaries, regulations and policies. But this complexity drives me to create simple yet maintainable solutions.

When I'm not fighting fire at home, I enjoy automating mundane tasks on the web (admittedly a life-hacker). I keep keen interest in shortform video editing - a skill I picked up while creating inhouse tutorial videos for firms I work(ed) at.Other interests include pondering about life, sometimes long enough to drop nuggets of wisdom - I'm my personal 'quote philosopher'. Besides that, I also have a penchant for humor, memes & staying woke.

Experience

ZS Associates

Senior Technology Analyst - Python Developer

Worked in large scale data refining and incentive compensation programs for sales representatives. Clients included Top 5 Pharma companies in USA and Europe.

Office of Graduate and Professional Studies - Texas A&M University

Graduate Assistant Developer

Development of confidential data and reporting platform on graduation completion and retention rates based on demographics - age, race & gender. End-users/Stakeholders incuded Board of Directors at Texas A&M University.

Deloitte

Senior Consultant - Analytics and Technology

Clients include top 3 banks in USA. Worked mainly in Transaction Monitoring and KYC(Know Your Customer) analysis. Dealt with Data Engineering, Software Architecture, Python automation, Machine Learning applications in FinTech as well as developing scalable backend for a data curation SAAS tool.

Thoughtworks

Sr. Data Engineer(Remote)

Education

College of Engineering, Pune

July 2008 - May 2012

Bachelor of Technology in Electronics and Telecommunications

Key accomplishments:

Texas A&M University

June 2015 - May 2017

Master of Science in Management Information Systems

Key core courses completed:

Projects

Finding fraudulent bot-culture in Amazon(semi-viral linkedin article: 700+likes, 100+shares)

One fine day during the last Christmas vacation, my broken earphone led me to investigate a notorious bot culture on Amazon.

Linkedin articleGithub linkNbviewer link

Evolving code using OOPs and Tests

Wrote a couple of blogposts to simplify understanding and highlight importance of OOPs at the backdrop of test driven development

Read part 1Read part 2

Scraping the entire Louvre artwork in one line of code

Using built in Unix tools to download all the artwork hosted by the newly opened Louvre website

Read BlogYT screencast

Apple Vs Microsoft : The race for market capitalization

This was the week when Microsoft overtaking Apple was the breaking news. I tried to source external data & plot this decade long game of catch-up.

Github linkNbviewer link

Scraping H1B data for Data Science/ML positions from h1binfo.org

A genuine request from my cousin to list all 'h1b sponsoring' companies for Data Science/ML positions. My very first UNPAID freelancing gig. Thanks a lot, cousin!

Github linkNbviewer link

AnalyticsVidhya loan default prediction practice problem - Leaderboard rank 17 of 47K participants

Weekend long hackathon conducted by Analytics Vidhya to predict loan defaults. Surprisingly only adding a couple of simple features worked superbly!

Github linkNbviewer link

Mr Obama Vs. Mr Trump : First 2 years in White House

Just a fun scraping project to get press briefings data from official state.gov website. Then plotting annual heatmaps to compare Obama & Trumps first 2 years in White house.

Github linkNbviewer link

Passion project - Humor Detection using Natural Language Processing(Word2Vec + Scikit)

This is still work in progress. Trying to classify whether a tweet is humorous or not. My capstone project for Machine Learning Guild.

Github linkNbviewer link

Ongoing Project for tricky SQL/Python questions - Leetcode, Hackerearth & other SPOJs

Created skeleton for now. Add more info here.

Github link

Skills (Daily-drivers)

Get in Touch