Building reliable data pipelines and scalable software. 3.5+ years of experience turning raw data into business value — and sharing the journey openly.
About
I'm a Data Engineer and Software Engineer with 3.5+ years of experience designing pipelines, building analytics infrastructure, and engineering backend systems.
Most recently at Infinite Computer Solutions, progressing from Software Engineer to Associate Data Analyst — working across PySpark, Spark Structured Streaming, Apache Airflow, HDFS, and Oracle Cloud.
I believe in learning in public — documenting wins, failures, and everything in between. You'll find that on my LinkedIn feed.
Skills
Data Engineering
Cloud & Databases
Languages
Tools & Libraries
Experience
Projects
Real-time data ingestion pipeline using Spark Structured Streaming, processing events from Kafka into HDFS with fault-tolerant checkpointing.
Exploratory data analysis on the Titanic dataset covering filtering, aggregation, missing value handling, and survival rate insights using Pandas.
Certifications
Contact
Open to new opportunities, collaborations, or just a good conversation about data engineering.