Senior member of team that has operational responsibility
for major parts of the production infrastructure: distributed
storage, job scheduling, distributed
locking service, automated machine management
Manage day-to-day operation of the listed services,
including monitoring, automation, and emergency response
Assist in testing and qualification of new Linux
kernels
Troubleshoot system-level issues across 80% of Google's
servers
Provide mentoring and training for new team members
Worked as part of a small worldwide team responsible for
some of the most critical servers in the Fixed Income,
Currency, and Commodities (FICC) division
Provided long-term system engineering and 24/7 operations
support for Solaris, Linux, and NetApp servers
Maintained the most widely deployed Linux distribution in
the firm