ThoughtsWin Systems
Data Architect - Python/Spark
Job Location
jaipur, India
Job Description
Job Title : Data Architect Experience : 8 Years Location : Jaipur Employment Type : Full-Time Job Summary : We are looking for an experienced Data Architect with a strong background in data and analytics to design and implement scalable, high-performance data solutions. The ideal candidate must have extensive hands-on experience with Databricks, along with expertise in Azure, AWS, and modern data technologies. This role requires deep technical knowledge in big data processing, cloud architectures, and data engineering. Key Responsibilities : - Architect & Implement Data Solutions Design scalable data lakes, data warehouses, and real-time streaming architectures. - Databricks Expertise Build and optimize data pipelines, Delta Lake architectures, and advanced analytics solutions using Databricks. - Cloud Data Engineering Develop and manage cloud-native data platforms on Azure (Synapse, Data Lake, Data Factory, Cosmos DB) and AWS (Redshift, Glue, S3, Athena). - ETL/ELT Pipelines Design and automate data ingestion, transformation, and processing workflows using Databricks, Apache Spark, and cloud-native ETL tools. - Big Data Processing & Analytics : Work with Apache Spark, PySpark, Scala, Hadoop, and Kafka for large-scale data processing. - Data Governance & Security : Implement data quality, lineage, access control, and compliance policies (GDPR, HIPAA, etc.). - Performance Optimization : Fine-tune Spark workloads, storage optimization, and query performance in Databricks. - Collaboration & Stakeholder Management : Partner with engineering, analytics, and business teams to align data strategies with business Skills & Qualifications 8 years of experience in data architecture, data engineering, or analytics. - Mandatory hands-on experience with Databricks (Delta Lake, Spark, MLflow, Workflows, SQL Analytics, and Data Engineering Pipelines). - Strong expertise in Azure Data Services (Synapse, Data Lake, Data Factory) and AWS (Redshift, Glue, S3, Athena, Kinesis). - Proficiency in SQL, Python, and Scala for data engineering and transformation. - Experience with NoSQL databases (MongoDB, DynamoDB) and real-time data streaming (Kafka, Event Hubs, Kinesis). - Understanding of data governance, lineage, and compliance best practices. - Knowledge of CI/CD for data pipelines, DevOps, and Infrastructure as Code (Terraform, CloudFormation). - Strong analytical, problem-solving, and communication skills. (ref:hirist.tech)
Location: jaipur, IN
Posted Date: 5/8/2025
Location: jaipur, IN
Posted Date: 5/8/2025
Contact Information
Contact | Human Resources ThoughtsWin Systems |
---|