C

Data Engineer

Carnegie Affiliates

Full-time

On-site

New York, New York, United States

Company Description

Major Corporation

Job Description

Responsibilities:

Design and develop high-throughput, low-latency data processing pipelines to quickly ingest and make data available on the platform across various distributed data stores
Analyze large datasets to identify opportunities to tune and improve the system
Experiment with various Hadoop frameworks like Hive, Pig and Scalding to identify the optimal approach for extracting valuable insights from massive datasets

Tool We Used:

Scala
Hadoop (Hive, Pig, Scalding, Spark)
Kafka
MySQL, Redis, Vertica, Aerospike

Qualifications

Requirements:

Bachelors or Masters in Computer Science or related field
3+ years of experience ingesting, processing, storing and querying large datasets
Professional Hadoop ecosystem experience, including storage optimization and job performance tuning
Expertise in Java, Python or similar language(s). Functional programming experience is a plus
Passion for code correctness and intuition about which values in data are to be expected in a business context

Additional Information

All your information will be kept confidential according to EEO guidelines.

Apply now

Share this job

Twitter Facebook Linkedin Email