الخبرة : 0-1 سنة
الراتب : not
المكان : egybt
We’re looking for a Cloudera Developer to design, build, and optimize data pipelines and processing jobs on our Cloudera platform.
This role focuses on leveraging NiFi, Spark, Hive, and Impala to deliver high-quality data flows.
Experience integrating with other enterprise data platforms (such as TIBCO) is a plus, but not mandatory.
Key Responsibilities
· Develop, test, and maintain data ingestion flows using Apache NiFi.
· Build Spark-based data transformation jobs (Scala or PySpark) for large-scale processing.
· Implement API-based integrations and data exchange with enterprise platforms.
· Ensure data security, quality, and compliance with governance standards (Ranger, Atlas, SDX).
· Monitor and troubleshoot data pipelines for performance and reliability.
· Contribute to automation and CI/CD processes for data pipelines.
· Document data flows, schemas, and transformations for ongoing support.
Required Skills & Experience
· 3+ years in data engineering or development on Cloudera/CDP platforms.
· Hands-on experience with Apache NiFi, Apache Spark (Scala or PySpark), Hive, Impala.
· Knowledge of data governance and security controls (Ranger, Atlas, SDX).
· Experience integrating with external platforms via REST APIs or message queues.
· Familiarity with version control (Git) and CI/CD pipelines.
· Ability to troubleshoot data flow, cluster, and performance issues.
Preferred Qualifications
· Familiarity with TIBCO Data Hub/Data Science or other enterprise data integration platforms (considered a plus).
· Cloudera Certified Associate (CCA) or Cloudera Data Platform certifications.
· Experience with cloud-native deployments or hybrid data platforms.
· Knowledge of containerization (Docker/Kubernetes) for data workloads.