Senior DevOps Engineer

Tech Ops, Bangalore, India

SENIOR DEVOPS ENGINEER

ABOUT LOGMEIN

LogMeIn simplifies how people connect with each other and the world around them to drive meaningful interactions, deepen relationships, and create better outcomes for individuals and businesses. One of the world’s top 10 public SaaS companies, and a market leader in communication & conferencing, identity & access, and customer engagement & support solutions, LogMeIn has millions of customers spanning virtually every country across the globe. LogMeIn is headquartered in Boston with additional locations in North America, Europe, Asia and Australia.

LogMeIn has a broad portfolio of products that include award winning products such as join.me, GoToMeeting, GoToWebinar, GoToTraining, Grasshopper, Rescue, Bold360, LogMeIn Pro,Central, GotoMyPC, LastPass and many more.

ABOUT ROLE

The Senior DevOps Engineer supports high-availability 24/7 production systems of moderate to high complexity and risk. The role performs ongoing application support for live production systems by diagnosing and resolving highly complex issues, identifying, recommending and implementations options for improving performance, maintainability and operability; update existing practices and procedures, as defined by supervisor. The role may also directly work with Architect and above technical staff architecting and designing technical solutions; work on projects requiring technical decision making and providing input to delivery schedules; normally receive basic or no instruction on routine work and new assignments; and have work periodically reviewed by more senior team members or supervisors for the soundness of technical judgment and overall accuracy.

RESPONSIBILITIES
  • Acts as the single point of contact for development and product teams into Technical Operations and as the primary subject matter expert within Technical Operations for a limited number of applications and services
  • Works independently with little guidance from the Architect team to ensure operational requirements (reliability, availability, scalability, performance, capacity etc.) are met, and recommends operational improvements to them
  • Knowledgeable of monitoring for owned applications and proactively monitors and manages the runstate of them; responsible and accountable for ensuring all issues are addressed and resolved in a timely and robust fashion
  • Performs technical work to accomplish tasks and projects within defined time lines and in a professional manner in alignment with active standards • Installs, upgrades, configures, repairs and monitors high-availability 24/7 application service, third-party appliances, and applications; assists with automation scripts and tools to improve the team’s overall efficiency
  • Works with others or independently on projects medium to high complexity with minimal or no cross team alignment needs
  • Works with team members on project tasks that derive from release requirements
  • Can develop a high-quality documentation of all the technical and non-technical deliverables
  • Performs maintenance and service functions to support production infrastructure, including system installation, systems administration, patching and configuration and software upgrades
  • Participates in 24x7 on-call rotation and responds to production alerts involving multiple software system components, using background, experience and established procedures to resolve issues and restore services as quickly as possible
  • Follows and updates the Tactical Run Books and SOP’s, which define the processes and procedures for L3-4 level support of Web Operations systems; continues troubleshooting beyond what is documented
  • Keeps abreast of technical trends, and develops and incorporates them within assignments as appropriate; recommends improvements and changes to supervisor and team

REQUIREMENTS
  • Bachelor’s Degree or equivalent in computer science, electrical engineering, or related field is preferred with 8+ years of directly related work experience
  • In depth working knowledge of Unix/Linux systems administration and troubleshooting, with good understanding of Linux internals
  • Strong working proficiency in one or more administrative languages such Python, Bash, Ruby, Java
  • Working knowledge in Machine Learning would be a plus
  • Expert in Automation
  • Experience with SQL and/or No-SQL database technologies (SQL, schema design)
  • Sound Knowledge and exposure to network protocols and tools
  • Good Knowledge and exposure to WAF security policies and procedure would be a plus
  • In good understanding of Security tools and procedures
  • Exposure in designing the monitoring infrastructure with commercial or open source monitoring systems (e.g. Nagios, AppDyamic, WaveFront, New Relics etc) a plus
  • Knowledge of distributed systems development, tools and designs •
  • Experience with failure mode analysis
  • Experience with distributed computing and Software Registration systems such as Cassandra/Zookeeper a plus
  • Understanding of software lifecycle process
  • Working knowledge with version control tools (Git, Perforce, SVN, etc)
  • Familiarity with common frameworks, languages and application servers for Webapps, Java, C/C++ or other languages a plus
  • Expert with hosting applications in Cloud, and principles of elasticity using third party cloud technologies (Amazon AWS,)
  • Knowledge and experience in the administration and operations of large-scale distributed computing environments; experience with standard system Operations methods and procedures; prior hosting experience a plus
  • A solid 5-7 years experience in Production Support of highly available online applications
  • Overall have a clear understanding on how Internet Ecosystem is built and works
OUR VALUES
Be Accountable - even when no-one is looking
Thrive Together - greatness comes from unlocking each other’s potential
Advance Confidently - we find opportunity and act on it
Collaborate Openly - our whole is greater than the sum of our parts
Engage Fearlessly - we speak up and listen