- Home
- Remote Jobs
- Hadoop Developer
Hadoop Developer
Job summary
Work model
Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We leverage cutting-edge technologies to create scalable, secure, and user-friendly applications.
As we continue to grow, we're looking for a skilled Hadoop Developer to join our dynamic team and contribute to our mission of transforming business processes through technology.
This is a fantastic opportunity to join an established and well-respected organization offering tremendous career growth potential.
Position Details
- Location: 100% Remote (Continental United States)
- Position Type: In-house Bright Vision Technologies SOW engagement (no third-party client or vendor)
- Salary: $100K - $150K / Annum
- Experience: 5+ years
- Employment Type: Full-time, direct W2
- Sponsorship: No new H1B sponsorship available. H1B transfers welcomed for qualified candidates.
Employment Terms & Visa Policy
This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies. This role is part of Bright Vision Technologies' in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies — there is no third-party client, vendor, or implementation partner involved.
We do not engage in C2C, 1099, or third-party arrangements for this role.
STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE. Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables.
For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience.
Job Summary
We are seeking an experienced professional to design, build, and operate large-scale data processing pipelines and analytics platforms on Hadoop and related big-data ecosystems. In this role you will be responsible for ingesting, transforming, and analyzing massive volumes of structured and unstructured data to support enterprise analytics, machine learning, and reporting workloads.
Key Responsibilities
- Design, develop, and operate end-to-end big-data pipelines on Hadoop, ingesting data from a diverse mix of relational, file-based, streaming, and API-driven sources.
- Build robust ETL/ELT workflows using Apache Spark, Hive, Pig, and Sqoop, with strong attention to data quality, idempotency, error handling, and recoverability.
- Develop high-throughput streaming data pipelines using Kafka, Spark Streaming, or Flink.
- Optimize Spark and MapReduce jobs through careful tuning of partitioning, memory, serialization, and skew handling.
- Design and maintain data models and storage layouts on HDFS, Hive, HBase, and modern lakehouse formats (Parquet, ORC, Delta, Iceberg, Hudi).
- Implement data governance, lineage, and quality controls.
- Build robust monitoring, alerting, and logging strategies for big-data pipelines.
- Partner with data scientists and analysts to deliver curated, reliable, and well-documented datasets.
- Automate pipeline orchestration using Airflow, Oozie, or similar workflow engines.
- Continuously evaluate and adopt new technologies in the big-data and cloud ecosystem (EMR, Databricks, Snowflake, BigQuery).
- Lead performance reviews and architecture audits of existing pipelines.
- Document data architectures, schemas, pipeline behaviors, and operational runbooks.
- Mentor junior engineers and contribute to the team's engineering standards and best practices.
Required Qualifications
- Bachelor's degree in Computer Science, Engineering, or a related technical discipline.
- Five or more years of professional experience designing and operating big-data pipelines on Hadoop.
- Strong hands-on expertise with Apache Spark (Scala, Python, or Java) in production environments.
- Solid experience with Hive, HDFS, Sqoop, HBase, and the broader Hadoop ecosystem.
- Hands-on experience with streaming data platforms such as Kafka, Spark Streaming, or Flink.
- Strong SQL skills and experience working with both relational and NoSQL data stores.
- Experience with workflow orchestration tools such as Airflow or Oozie.
- Solid understanding of distributed systems concepts, including partitioning, replication, and fault tolerance.
- Strong scripting skills in Python or Shell.
- Excellent troubleshooting, debugging, and documentation skills.
Preferred Qualifications
- Experience operating Hadoop on cloud platforms such as AWS EMR, Azure HDInsight, or Databricks.
- Familiarity with modern lakehouse formats (Delta, Iceberg, Hudi).
- Exposure to data governance tooling such as Apache Atlas or Collibra.
- Experience with Kubernetes-based data platforms (Spark-on-K8s, Trino).
- Hands-on experience with CI/CD and infrastructure-as-code in data engineering workflows.
How to Apply
For immediate consideration, please send your resume to [email protected] or contact us at (908) 505-3899. Learn more about Bright Vision Technologies at www.bvteck.com.
Bright Vision Technologies is an equal opportunity employer and place a high value on diversity and inclusion at our company.