Job Title: Sr. Data Engineer
Location: Mumbai
Experience: 2-4 Years
Employment Type: Full-time

Position Overview:

We are looking for a highly skilled and hands-on Senior Data Engineer to join our growing data engineering practice in Mumbai. This role requires deep technical expertise in building and managing enterprise-grade data pipelines, with a primary focus on Amazon Redshift, AWS Glue, and data orchestration using Airflow or Step Functions. You will be responsible for building scalable, high-performance data workflows that ingest and process multi-terabyte-scale data across complex, concurrent environments.
The ideal candidate is someone who thrives in solving performance bottlenecks, has led or participated in data warehouse migrations (e.g., Snowflake to Redshift), and is confident interfacing with business stakeholders to translate requirements into robust data solutions.

Key Responsibilities:

●      Design, develop, and maintain high-throughput ETL/ELT pipelines using AWS Glue (PySpark), orchestrated via Apache Airflow or AWS Step Functions.
●      Own and optimize large-scale Amazon Redshift clusters and managing high concurrency workloads for very large user base:
●      Lead and contribute to migration projects from Snowflake or traditional RDBMS to Redshift, ensuring minimal downtime and robust validation.
●      Integrate and normalize data from heterogeneous sources including REST APIs, AWS Aurora (MySQL/Postgres), streaming inputs, and flat files.
●      Implement intelligent caching strategies, leverage EC2 and serverless compute (Lambda, Glue) for custom transformations and processing at scale.
●      Write advanced SQL for analytics, data reconciliation, and validation, demonstrating strong SQL development and tuning experience.
●      Implement comprehensive monitoring, alerting, and logging for all data pipelines to ensure reliability, availability, and cost optimization.
●      Collaborate directly with product managers, analysts, and client-facing teams to gather requirements and deliver insights-ready datasets.
●      Champion data governance, security, and lineage, ensuring data is auditable and well-documented across all environments.

Required Qualifications & Experience:


●      2-4 years of core data engineering experience, especially focused in Amazon Redshift hands-on performance tuning and large-scale management capacity.
●      Demonstrated experience handling multi-terabyte Redshift clusters, concurrent query loads, and managing complex workload segmentation and queue priorities.
●      Strong experience with AWS Glue (PySpark) for large-scale ETL jobs.
●      Solid understanding and implementation experience of workflow orchestration using Apache Airflow or AWS Step Functions.
●      Strong proficiency in Python, advanced SQL, and data modeling concepts.
●      Familiarity with CI/CD pipelines, Git, DevOps processes, and infrastructure-as-code concepts.

Preferred/Bonus Skills:


●      Experience with Amazon Athena, Lake Formation, or S3-based data lakes.
●      Hands-on participation in Snowflake, BigQuery, or Teradata migration projects.
●      AWS Certifications such as:
○      AWS Certified Data Analytics – Specialty
○      AWS Certified Solutions Architect – Associate/Professional
●      Exposure to real-time streaming architectures or Lambda architectures.


Soft Skills & Expectations:


●      Excellent communication skills — must be able to confidently engage with both technical and non-technical stakeholders, including clients.
●      Strong problem-solving mindset and a keen attention to performance, scalability, and reliability.
●      Demonstrated ability to work independently, lead tasks, and take ownership of large-scale systems.
●      Comfortable working in a fast-paced, dynamic, and client-facing environment.

What We Offer:


- Opportunity to work on cutting-edge Generative AI projects across industries.
- Collaborative, startup-like work environment with flexibility and ownership.
- Exposure to full-stack AI/ML project lifecycle and client-facing roles.
- Competitive compensation and learning opportunities in the AWS AI ecosystem.

About Oneture Technologies

Founded in 2016, Oneture is a cloud-first, full-service digital solutions company, helping clients harness the power of Digital Technologies and Data to drive transformations and turning ideas into business realities.

Our team is full of curious, full-stack, innovative thought leaders who are dedicated to providing outstanding customer experiences and building authentic relationships. We are compelled by our core values to drive transformational results from Ideas to Reality for clients across all company sizes, geographies, and industries. Oneture team delivers full lifecycle solutions—from ideation, project inception, planning through deployment to ongoing support and maintenance.

Our core competencies and technical expertise includes Cloud powered: Product Engineering, Big Data and AI ML. Our deep commitment to value creation for our clients and partners and “Startups-like agility with Enterprises-like maturity” philosophy has helped us establish long-term relationships with our clients and enabled us to build and manage mission-critical platforms for them.