Cloudera Systems Administrator
PRINCIPAL ACCOUNTABILITIES: The incumbent s
accountabilities include, but are not limited to, the following:
System Maintenance: Independently installs and maintains Big Data (Cloudera, Horton Works, etc.) clusters in high available, load balanced configuration across multiple (Production, User Acceptance, Development) environments.
Systems Implementation: Independently implement complex technical system design changes for Big Data environments.
Procedural: Updates and maintains operations manuals, inventories, and written procedures relative to installing, maintaining, and using the Big Data environments. Works closely with Change management, Configuration Management and Security Management in developing
and maintaining said procedures.
Technology Consulting: Leads problem resolution and coordination with Level 2 support. Performs integration of component level systems into solutions as documented/directed by Lead Big Data Administrator. Liaison with Infrastructure, Security and application
Research and Development: Researches new technologies to benefit the business. Works with Lead/Expert Big Data Administrator to develop recommendations and provide implementation support.
Required: This position requires a BA/BS in Computer Science, Information Systems, Information Technology or related field with 3-5 years of prior experience in software development, Data Warehousing and Business Intelligence OR equivalent experience.
Administrator experience working with batch processing and tools in the Hadoop technical stack (e.G. MapReduce, Yarn, Pig, Hive, HDFS, Oozie)
Administrator experience working with tools in the stream processing technical stack (e.G. Spark, Storm, Sama, Kafka, Avro)
Administrator experience with NoSQL stores (e.G. ElasticSearch, Hbase, Cassandra, MongoDB, CouchDB)
Expert knowledge on AD/LDAP security integration with Big Data
Hands-on experience with at least one major Hadoop Distribution such as Cloudera, Horton Works, MapR or IBM Big Insights
Advanced experience with SQL and at least two major RDBMS s
Advanced experience as a systems integrator with Linux systems and shell scripting
Advanced experience doing data related benchmarking, performance analysis and tuning, troubleshooting
Excellent verbal and written communication skills
System usage and optimization tools such as Splunk
ETL solution experience, preferably on Hadoop
Experience with industry leading Business Intelligence tools
Big Data Administrator (certification)
Experience with Machine Learning and Artificial Intelligence