🚀 Architecting the Future of Data • Scalable Pipelines & Cloud Solutions
About Experience Skills Projects Awards Contact
Senior Data Engineer

SONU PARMAR

Spark · Databricks · Cloud Platforms

Building scalable data pipelines and automation solutions that transform raw data into actionable insights.

Scroll to Explore

About Me

Senior Data Engineer with 5+ years of experience building large-scale data pipelines and cloud-native platforms.

I specialize in designing and implementing scalable data solutions using Apache Spark, Databricks, and Azure cloud services. My work focuses on transforming complex data challenges into efficient, automated workflows.

Beyond traditional data engineering, I leverage Generative AI and LLMs to accelerate development cycles, automate documentation, and build intelligent data pipelines.

0 Years Experience
0 Awards Won
0 Hours Saved
Sonu Parmar

Work History

Recrosoft Technologies

Bhubaneshwar (Remote)
Data Engineer 2024 - Present
  • Reduced pipeline processing time from 8+ hours to ~30 minutes
  • Saved ~135 engineering hours and ~5 TB storage through automated decommissioning
  • Achieved 100% write storage savings with data federation approach
  • Reduced data availability latency from ~2 hours to ~12 seconds
Databricks Spark Unity Catalog AWS

IBM India Pvt. Ltd.

Bengaluru (Remote)
Data Engineer 2021 - 2024
  • Reduced memory consumption from ~37GB to ~500MB
  • Improved pipeline performance by ~74%
  • Saved ~1,800 engineering hours and ~$44K through automation
  • Built SSL monitoring protecting ~$1M in business systems
Azure Databricks ADF Spark SQL

Technical Expertise

Programming

Python PySpark Spark SQL SQL JavaScript PowerShell

Big Data & Processing

Apache Spark Databricks Azure Data Factory Kafka Airflow

Cloud Platforms

Azure ADLS Delta Lake Synapse AWS Redshift Unity Catalog

Data Engineering

ETL/ELT Batch Processing Streaming Data Modeling CDC

CI/CD & DevOps

GitHub Azure DevOps Azure Monitor Log Analytics

Automation

n8n Power Automate Selenium UiPath
Python
Azure
SQL
Data Factory
PowerShell
Databricks
Spark
Kafka
Azure DevOps
AWS
GitHub
JavaScript
UiPath
Power Automate
Airflow
Selenium
Logic Apps
n8n

Certifications

Azure Solutions Architect Expert

Microsoft

Issued Jan 2024

Azure Administrator Associate

Microsoft

Issued Nov 2023

SAP

SAP Certified Application Associate - Data Integration

SAP

Issued Aug 2022

Power BI Data Analyst Associate

Microsoft

Issued Jul 2022

Azure Data Engineer Associate

Microsoft

Issued Jun 2022

AWS Certified Cloud Practitioner

Amazon Web Services

Issued Jul 2021

Azure Fundamentals

Microsoft

Issued Apr 2021

Featured Work

AI Platform

Autogeniee

End-to-end Telegram-based AI chatbot for multi-modal content generation. Integrates locally hosted diffusion models and LLMs via Ollama for text, image, and video generation.

n8n Node.js Ollama LoRA Telegram API
View Project
Desktop App

Xraktor

Offline OCR-based text extraction Windows app using Python and Tesseract. Focused on data privacy with local processing and modular architecture for batch workflows.

Python Tesseract OCR Windows
View on GitHub

Awards & Achievements

GEM Award 2025

Recro

Exceptional performance and commitment to client success

Performance Award 2024

IBM

Leadership skills and creative solutions for client success

Entrepreneur Award 2023

IBM

Exceptional contribution in delivering innovative solutions

Hall of Fame Q4 2022

Royal Dutch Shell

Innovative automation solutions saving time and resources

Performance Award 2021

IBM

Innovative and impactful solutions

Let's Connect

I'm always open to discussing new projects, opportunities, or just having a chat about data engineering.