Object Technology Solutions Inc (OTSI) has an immediate opening for an Sr. Data Engineer
Location : Irving TXOnsite
JOB DESCRIPTION :
- Analyze and understand data sources & APIs
- Design and Develop methods to connect & collect data from different data sources
- Design and Develop methods to filter / cleanse the data
- Design and Develop SQL Hive queries APIs to extract data from the store
- Work closely with data Scientists to ensure the source data is aggregated and cleansed
- Work with product managers to understand the business objectives
- Work with cloud and data architects to define robust architecture in cloud setup pipelines and work flows
- Work with DevOps to build automated data pipelines
Total Experience Required
5 years of experience with Hadoop (Cloudera) / big data technologiesAdvanced knowledge of the Hadoop ecosystem and Big Data technologies Handson experience with the Hadoop ecosystem (HDFS MapReduce Hive Pig Impala Spark Kafka Kudu Solr)Experience on designing and developing Data Pipelines for Data Ingestion or Transformation using Java or Scala or Python.Experience with Spark programming (pyspark or scala or java)Expert level building pipelines using Apache Spark Familiarity with core provider services from AWS Azure or Google Cloud Platform preferably having supported deployments on one or more of these platformsHandson experience with Python / Pyspark / Scala and basic libraries for machine learning is required;Exposure to containerization and related technologies (e.g. Docker Kubernetes)Exposure to aspects of DevOps (source control continuous integration deployments etc.)Proficient in programming in Java or Python with prior Apache Beam / Spark experience a plus.System level understanding Data structures algorithms distributed storage & computeCando attitude on solving complex business problems good interpersonal and teamwork skillsPossess team management experience and have led a team of data engineers and analysts.Experience in Snowflake is a plus.Desirable Technical Skills
Familiarity with HTTP and invoking webAPIsExposure to machine learning engineeringExposure to NLP and text processingExperience with pipelines job scheduling and workflow managementPersonal Skills
Experienced in managing work with distributed teams
Experience working in SCRUM methodologyProven sense of high accountability and selfdrive to take on and see through big challengesConfident takes ownership willingness to get the job doneExcellent verbal communications and cross group collaboration skillsAbout us
OTSI is a leading global technology company offering solutions consulting and managed services for businesses worldwide since 1999. OTSI serves clients from its 15 offices across 6 countries around the globe with a FollowtheSun model. Headquartered in Overland Park Kansas we have a strong presence in North America Central America and AsiaPacific with a Global Delivery Center based in India. These strategic locations offer our customers the competitive advantages of onshore nearshore and offshore engagement and delivery options with 24 / 7 support. OTSI works with 100 enterprise customers of which many are Fortune ranked OTSI focuses on industry segments such as Banking Financial Services & Insurance Healthcare & Life Sciences Energy & Utilities Communications & Media Entertainment Engineering & Telecom Retail & Consumer Services Hitech Manufacturing Engineering transport logistics Government Defense & PSUs.
Our Center of Excellence :
Data & AnalyticsDigital TransformationQA & AutomationEnterprise ApplicationsDisruptive TechnologiesKey Skills
Apache Hive,S3,Hadoop,Redshift,Spark,AWS,Apache Pig,NoSQL,Big Data,Data Warehouse,Kafka,Scala
Employment Type : Full Time
Experience : 5 years
Vacancy : 1