Job description
Job Description Summary Data Architect with 13+ years of experience is accountable for driving productivity and ensuring successful delivery of web-based development initiatives across assigned products. Key Responsibilities Design and define enterprise-level data architecture aligned with business and technology strategy Architect and implement cloud-native data platforms on AWS (data lakes, data warehouses, lakehouse architectures) Lead design and optimization of batch and real-time data pipelines using Spark and related frameworks Establish data modeling standards (conceptual, logical, physical) for analytical and operational use cases Ensure data quality, governance, security, and compliance across platforms Provide architectural guidance for ETL/ELT frameworks, metadata management, and lineage Collaborate with data engineering, analytics, DevOps, and business stakeholders Evaluate and recommend tools, frameworks, and best practices for data stack modernization Mentor senior data engineers and review solution designs and code Support performance tuning, cost optimization, and scalability of AWS data services Required Skills & Experience 12–17 years of overall IT experience with 8+ years in data architecture / big data roles Strong hands-on expertise with: Apache Spark (batch and streaming) Python (data processing, orchestration, automation) Advanced SQL (complex queries, performance tuning) Extensive experience with AWS data stack, including: S3, Redshift, Glue, Athena, EMR Kinesis / MSK (streaming) Lambda, Step Functions IAM, CloudWatch, encryption, security best practices Strong knowledge of data warehousing, data lakes, and lakehouse architectures Experience with ETL/ELT tools and frameworks Solid understanding of data governance, metadata, cataloging, and data quality frameworks Proven ability to work with large-scale, high-volume, high-velocity data systems Good to Have Experience with Iceberg / Delta Lake / Hudi Exposure to Snowflake, Databricks, or other modern analytics platforms Knowledge of CI/CD for data pipelines and Infrastructure as Code (Terraform/CloudFormation) Experience in domain-driven data architecture (banking, retail, healthcare, etc.) AWS Certifications (Data Analyti...