Experian Recruitment BigData Platform Engineer 2022

Experian Recruitment BigData Platform Engineer 2022
Selenium Online Training with Project (by GC Reddy)
Company: Experian
Job Role: BigData Platform Engineer
Eligible: 2021/2022 Batch
Location: Hyderabad, Telangana
Job Description For Experian Recruitment 2022
- Ascend Sandbox is an integrated BigData and analytics platform. Our clients, some of the America’s largest financial institutions, use this platform to understand consumer credit behavior and build
- models for credit decisioning, marketing and account review purposes. We are looking for an expert platform engineer with in-depth knowledge on BigData analytics & public cloud platforms to help us run this peta-byte scale BigData platform and provide the best possible experience for our clients.
What You’ll Do Here
- Responsible for continuous platform enhancements, upgrades, availability, reliability and security of the Ascend Sandbox platform.
- Provide end-to-end observability of our Ascend Sandbox platform.
- Responsible for resolving incidents reported by Sandbox users and take preventive actions.
- Help Sandbox users with troubleshooting failed MapReduce/Hive/Spark applications.
- Help Sandbox users to improve the performance and optimize their MapReduce/Hive/Sparkapplications.
- Participate in follow-the-sun on-call rotation to address any emergency production incidentsaffecting the Sandbox platform.
Skills Required For Experian Recruitment 2022
- Deep understanding of Linux, networking fundamentals and security.
- Solid professional coding experience with at least one scripting language – Shell, Python etc.
- Experience working with AWS cloud platform and infrastructure.
- Experience managing large BigData clusters in production (at least one of — Cloudera,Hortonworks, EMR)
- Excellent knowledge and solid work experience providing observability for BigData platformsusing tools like Prometheus, InfluxDB, Dynatrace, Grafana, Splunk etc.
- Experience managing BigData clusters with compute decoupled from storage (Eg: S3) on publiccloud platforms.
- Expert knowledge on Hadoop Distributed File System (HDFS) and Hadoop YARN.
- Decent knowledge of various Hadoop file formats like ORC, Parquet, Avro etc.
- Deep understanding of Hive (Tez), Hive LLAP, Presto and Spark compute engines.
- Ability to understand query plans and optimize performance for complex SQL queries on Hiveand Spark.
- Hands on experience supporting Spark with Python (PySpark) and R (SparklyR, SparkR)languages.
- Experience working with Data Analysts, Data Scientists and at least one of these relatedanalytical applications like SAS, R-Studio, JupyterHub, H2O etc.
- Able to read and understand code (Java, Python, R, Scala), but expertise in at least one scripting language.
- Experience managing JVM based applications in production.
- Excellent written and oral communication.Nice to have skills:
- Experience with workflow management tools like Airflow, Oozie etc.
- Implementation history of Terraform, Packer, Ansible, Chef, Jenkins or any other similar tooling.
- Prior working knowledge of Active Directory and Windows OS based VDI platforms like Citrix,AWS Workspaces etc.
- Professional coding experience in at least one programming language, preferably Java.
- Experience with other public cloud platforms like Azure and GCP is a bonus.
Selection Process For Experian Recruitment 2022
- Aptitude Written Test Online
- Group Discussion
- Technical Interview
- HR Interview