Support the development of the organizational Data Strategy and support data management in line with strategic business objectives and cultures and values.
Supports the DBA/Data Engineering/Informatica Operations in production and oversight of the big data platforms following standards, practices, policies and processes.
Promotes good administration practices and the management of data as a strategic asset.
Hadoop/informatica administrator is responsible for small developments, fixes, testing and maintaining architectures, such as NOSQL databases and Hadoop/Spark processing systems.
Oversee Data acquisition, develop data set processes. Resource and security management.
Troubleshooting application errors and ensuring that they do not occur again.
Minimum 3 years of direct experience in the ADministration of Apache Hadoop framework: Spark, HBase, HDFS, Hive, Parquet, Sentry, Impala and Sqoop, data warehouse and Informtica ideally in financial services.
Effectively maintaining the data pipeline architecture that accounts for security, scalability, maintainability, and performance.
Deploying a hadoop cluster, maintaining a hadoop cluster, adding and removing nodes using cluster monitoring tools like Ganglia Nagios or Cloudera Manager, configuring the NameNode high availability and keeping a track of all the running hadoop jobs. Hadoop Administration skills: Cloudera Manager and Cloudera Navigator and HUE.
Strong Unix/Red Hat skills, python scripting highly beneficial.
Excellent track record of administrating systems. Knowledge,
Skills, and Attributes: Knowledge and Skills
Good knowledge of Big Data platforms, Data Warehouse and Informtica, frameworks, policies, and procedures.
Proficient understanding of distributed computing principles
Good knowledge of Data Warehouse/RDBMS, Big Data querying tools, such as Pig, Hive, and Impala
Good knowledge of Informatica BDM, EDC, EDQ and Axon.
Experience with Cloudera, NoSQL databases, such as HBase, Big Data ML toolkits, such as SparkML.
SQL knowledge beneficial
Experience on Cloud technologies beneficial, like AWS and Azure