Architect, Cloudera Spark – Big Data Developer, Spark

Remote Full-time
Job Description: • Operate on high-complexity data engineering and data analytics projects • Ensure scalability and security of data architectures in enterprise environments • Develop and optimize distributed data pipelines in enterprise contexts Requirements: • Big Data Architect with deep experience in distributed environments and Cloudera technologies • Proven experience with Big Data architectures based on Cloudera Data Platform (CDP) and Apache Spark • Strong knowledge of HDFS, Hive, Impala, HBase, Kafka and NiFi • Proficiency with YARN, Ranger, Knox, Atlas and data security and governance tools • Experience in data modeling and design of ETL/ELT pipelines • Knowledge of Scala, Python and SQL • Good understanding of microservices, containerization (Docker, Kubernetes) and REST APIs • Familiarity with Linux/Unix environments and advanced scripting • Experience with monitoring tools and performance tuning for Spark and Cloudera • Experience in Public Administration or regulated environments is a plus • Big Data Developer with solid experience in Cloudera and Apache Spark environments • At least 3 years' experience developing applications on Apache Spark (Core, SQL, Streaming) • Deep knowledge of the Cloudera ecosystem (HDFS, Hive, Impala, Oozie, NiFi) • Strong proficiency in Scala and Python • Experience managing and optimizing Spark jobs in clustered environments • Knowledge of Kafka for real-time ingestion • Familiarity with Git, Jenkins, arenaflex/CD and DevOps best practices • Experience in query tuning, data ingestion pipelines and data transformation • Basic knowledge of Linux, shell scripting and distributed systems • Attention to detail and ability to work in structured environments • Good communication skills and a team-oriented attitude • Commitment to continuous improvement and adoption of quality standards Benefits: • Remote work Apply tot his job Apply tot his job
Apply Now →

Similar Jobs

Lead Big Data Engineer - PySpark

Remote

[Remote] Bilingual Customer Service Representative-SDU-Work From Home-TX ONLY

Remote

[Hiring] EHR Application II Analyst @BJC HealthCare

Remote

Blockchain Data Wizard, Analyst or Scientist

Remote

VDC/BIM Manager - HVAC - Remote Option

Remote

Senior/Lead Bioinformatics Scientist (Development)

Remote

Bioinformatics Developer 6314 Remote/Teleworker US

Remote

Native Japanese Chat Support Consultant, crypto; Remote

Remote

Tech Lead in Blockchain Consulting

Remote

Principal, Board Governance Advisor (Legal Counsel Support) – Hoag Hospital – Newport Beach, CA

Remote

Remote Data Entry Specialist – Join arenaflex for a Dynamic Career in Data Management and Enjoy the Flexibility of Working from Home

Remote

Advisor, Technical

Remote

Mid-level QA Automation Engineer, C#

Remote

Data Entry Clerk - Fully Remote Opportunity with a Dynamic Company - Accurate Data Management and Administrative Support

Remote

Experienced Remote Healthcare Customer Service Representative for Dynamic Government Client Engagement – Delivering Exceptional Service Experiences from Home

Remote

Experienced Part-Time Remote Data Entry Specialist – Accurate and Efficient Data Management Professional for arenaflex

Remote

Part Time blithequark Remote Careers (Remote Data Entry Jobs), blithequark Data Entry Jobs (Remote) $25/hour

Remote

Global 6000 Flight Attendant - Southern California – Amazon Store

Remote

**Experienced Database Analyst – Data Insights and Analytics at arenaflex**

Remote

Experienced Remote Sales and Business Development Consultant - Community Empowerment through Legal, Identity, and Insurance Solutions

Remote
← Back