Hi, I'm Jatin Valecha
Data & Software Engineer

Building reliable data pipelines and scalable software. 3.5+ years of experience turning raw data into business value — and sharing the journey openly.

India · Open to Relocate Data Engineering Oracle Cloud Certified

I'm a Data Engineer and Software Engineer with 3.5+ years of experience designing pipelines, building analytics infrastructure, and engineering backend systems.

Most recently at Infinite Computer Solutions, progressing from Software Engineer to Associate Data Analyst — working across PySpark, Spark Structured Streaming, Apache Airflow, HDFS, and Oracle Cloud.

I believe in learning in public — documenting wins, failures, and everything in between. You'll find that on my LinkedIn feed.

3.5+
Years of experience
2
Oracle Cloud certifications
Pipelines debugged at 2am

Data Engineering

PySpark Spark Streaming Apache Airflow HDFS Kafka

Cloud & Databases

Oracle Cloud OCI SQL PostgreSQL Oracle DB

Languages

Python SQL Java Shell

Tools & Libraries

Pandas NumPy Git Linux ReportLab
Oct 2024 – Oct 2025
Noida, India
Associate Data Analyst
Infinite Computer Solutions
  • Designed and maintained PySpark batch and streaming pipelines on HDFS
  • Automated workflow orchestration using Apache Airflow DAGs
  • Reduced pipeline latency by optimising Spark job configurations
  • Collaborated on Oracle Cloud AI Vector Search and Data Science modules

Jun 2022 – Sept 2024
Software Engineer
Infinite Computer Solutions
  • Built backend components and REST APIs for enterprise applications
  • Performed data migration and schema management tasks using SQL
  • Contributed to QA processes and internal tooling
Spark Streaming Pipeline

Real-time data ingestion pipeline using Spark Structured Streaming, processing events from Kafka into HDFS with fault-tolerant checkpointing.

PySpark Kafka HDFS Airflow
Titanic EDA — Pandas Analysis

Exploratory data analysis on the Titanic dataset covering filtering, aggregation, missing value handling, and survival rate insights using Pandas.

Python Pandas Jupyter
Oracle Cloud Infrastructure — AI Vector Search
Oracle · 2024
Oracle Certified
Oracle Cloud Infrastructure — Data Science
Oracle · 2024
Oracle Certified

Let's connect

Open to new opportunities, collaborations, or just a good conversation about data engineering.