site reliability engineer (sre) | bucureşti

randstad romania
aplică acum

descriere job

bucureşti, bucuresti
IT / telekom
număr de referință
1760 / 2421
randstad romania
NU este obligatoriu să vă înscrieți CV-ul pentru a aplica pentru această poziție, dar dacă aveți o versiune actualizată, este un ajutor important pentru noi. Dacă nu este actualizat sau nu este disponibil acum, îl puteți încărca mai târziu, în orice moment când vă conectați la profilul dvs.


Who are we?
-our purpose is to make every customer experience personalized and profitable –delivering value to digital transformation, service, marketing, and compliance teams, enabling next-
generation experiences in many countries.

-our platform continually process the customer data from all systems, enriches it with real-
time insights, and transforms it into a patented Micro-DatabaseTM - one for every Business Entity. To maximize performance, scale, and security, every micro-DB is compressed and individually encrypted. It is then delivered in milliseconds to fuel quick, effective, and pleasing customer interactions. Global 2000 companies – including AT&T, Vodafone and Sky – deploy us in weeks to deliver outstanding multi-channel customer service, minimize churn, achieve hyper-segmentation, and assure data compliance.

In few sentences, we’re passionate about big data systems and data management platforms. We  count on our site reliability engineers (SREs) to empower our customers with a rich infrastructure and monitoring tools to maintain high availability, reliability, and stellar performance level to pursue their objectives.
As we expand our customer deployments, we are currently seeking an SRE to deliver insights from massive scale data in real time. Our SRE’s are responsible for creating, configuring, and maintaining monitoring environments and tools. They are experts in analyzing production systems metrics, identifying the root cause of systems performance issues, and taking reactive/proactive actions to remain the system in healthy state.

What will your job look like?
The Site Reliability Engineer is going to interact with DevOps and Software Support Application (T2) teams, Incident and Production Managers, being responsible for the monitoring of customer production system to ensure that services are continuously available and achieving the Service Level Agreement (SLA) levels.

Therefore, Site Reliability Engineer responsibilities include (but not limited to):
• Provide 24*7 monitoring of customers production systems
• Creating monitoring dashboards and setting thresholds for tracking overall systems health
• Provide SLA Infrastructure and Dashboards for service availability
• Provide Generate periodic system health reports.
• Identify system trends and prevent production failures
• Build the monitoring infrastructure, measure and optimize production system performance
• For critical production issues run initial triaging & Open escalation bridge
• Investigation, Recording and analysis of production Errors
• Run daily production processes

• Where applicable, restore the system to operational state
• Support Tier1 (First Level Application Support) and Tier2 (Software Support Engineer) group
investigations if required
• Support us, offering mPaaS (managed PaaS) installation for new customers
• Manage deployment for on-going Change Requests
• Run various production Investigation such as Cassandra, Golden Gate & Kafka

All you need is...
Bachelor's degree in Computer Science, Information system, Industrial Management, Computer
Science, or equivalent experience
• Some experience in Linux and Windows operating systems
• Ability to program (structured and Object Oriented) with one or more high level languages
(Java – advantage)
• Ability to analyse/debug large and complicated systems
• A proactive approach to spotting problems, areas for improvement, and performance
• Some Experience in SQL/NoSQL Databases such as PostgreSQL, SQLite etc.
• Work in shifts (including night shifts, Friday, and Saturday shifts)
• English - Excellent written and verbal communication.
• Previous experience as SRE for a SaaS/PaaS solution running in AWS, Azure or GCP
• Experience as a tier 1 customer support role
Why you will love this job:
You will be a key member of a technical, a dynamic and highly collaborative team with various
possibilities for personal and professional development.
You will have the opportunity to expose yourself to the most advanced cloud technologies (Kafka,
ElasticSearch, Casandra, AWS, Azure, Google) and tools and grow yourself as top expert person in a
multinational environment for different global market leaders in their field.
In addition:
• We believe in paying competitive salaries and offer a range of other attractive benefits,
• Career Path Development
• Share Purchase Plan
• Health insurance
• Meal Vouchers
• Gym subscription
• Phone subscription
• Team Buildings
• Company gifts for different anniversaries.
• Annual performance bonus