Data Engineer

Posted 3 years ago

Design use cases and perform technical design and implementation. Work on batch and streaming applications for data processing using Apache Spark. Work with native AWS Cloud Services like Glue, Sqoop, Athena in loading the data into different databases such as Snowflake, Hive, Mongo DB and SQL databases. Design and develop the data ingestion pipelines to move the data from source to target using Python programming language. Analyze the data in a snowflake database with SQL queries and analyze the trends in the data. Implement the continuous integration and deployment pipelines (CI/CD) with Maven and Jenkins. Develop spark programs in Scala to cleanse the data in HDFS obtained from heterogeneous data sources in various formats such as JSON, Avro, and Parquet to make it suitable for ingestion into Hive schema for analysis. Will work in Manchester, CT and/or various unanticipated client sites throughout the U.S. Must be willing to travel and/or relocate.

Mail Resume to HR Dept., Cyma Systems, Inc., 360 Tolland Turnpike, Suite 2D, Manchester, CT 06042.

Apply For This Job