2017 - 2020
Software Quality Engineer at Red Hat, Bangalore, India
- Closely Interacted with Upstream Linux (Fedora, centos), Ansible, Ceph, Openstack, Kubernetes, Jenkins open source communities and contributed to new feature/bug fixes discussions.
- Worked on Ceph, a unified distributed software-defined storage system with block-file-object level access from a single managed cluster on the backend.
- Usually called the Linux of Storage or the future of storage (almost 2 Million lines of C++ project ..low level distributed storage system at kernel level which can scale to PetaBytes lvl clusters).
- Contributed to multiple major & minor releases of RHCEPH 2,3 by automating critical test cases in rados, iSCSI & file system components.
- Automated IOPS testing by including them as part of the sanity suites with tools like dd, fio, covering various stress/load/idle scenarios.
- Wrote test cases for all possible workloads and components of RHEL/Ceph & various new storage features, keeping regressions in mind.
- Implemented container verification pipeline by configuring RabbitMQ for new container images built for multiple parallel releases of RHCEPH & RHEL; Integrated layered product interop testing of early RHEL 7.x, 8.x builds with Ceph.
- Actively collaborated in QE-CI initiative efforts across Red Hat's global QE organization, building common best practice CI/CD pipelines and tools from scratch, leading to 95% implementation of CI best practices in the CephQE team.
- Enhanced test-metric analysis for each release via integration of ReportPortal.io ML dashboard into automation framework.
- Core maintainer of CephCI and Teuthology test framework GitHub repositories, which are tightly coupled with pytest, apache-libcloud/gevent & ssh paramiko libraries, where containerized Ceph is deployed on top of auto-provisioned baremetal/VMs.
- Analyzed and filed critical bugs in librados and Ceph-Rados Gateway (RGW) component, a REST interface to Ceph cluster providing object storage with Amazon S3 and OpenStack Swift API compatibility.
- Worked with consensus protocols like Paxos (in Ceph) and Raft (in Kubernetes), as well as Byzantine fault tolerance at the kernel level, applying the Unix philosophy that treats everything as a file.Also navigated the performance trade-offs involved in the CAP theorem via these consensus protocol states(Leader, Candidate, Follower) with timeouts(heartbeat or pseduo random seed timeouts)
Certifications
Did Following certs as i wanted to take more responsbility and handle larger projects with holistic first principles approach.. And also it was mostly free for employees to train on these materials so why not :)
- Red Hat Certified Architect in Infrastructure (RHCA) Level 2
- ID: 190-012-484 in Aug 2019
- Red Hat Certified System Administrator (EX200K)
- Red Hat Certified Engineer (EX300K)
- Red Hat Certified Specialist in Ansible Automation (EX407V27K)
- Red Hat Certified Specialist in Virtualization (EX318V14K)
- Red Hat Certified Specialist in Security: Linux (EX415V75K)
- Red Hat Certified Specialist in Linux Diagnostics and Troubleshooting (EX342K)
- Red Hat Certified System Administrator in Red Hat OpenStack (EX210V13K)
- Red Hat Certified Specialist in Linux Performance Tuning (EX442V80K)
some other tooling i have worked on: buildah (oci spec container build tool), podman, provisioning AWS resources boto sdk and openshift on aws (ceph backend storage class test cases), libvirt API/QEMU(with KVM) to emulate block storage test cases across arch.
RedHat got acquired by bigblue whale (IBM) in 2019
Jensen made me gpu poor again (for the third time) by announcing ..... the "holy grail" of computer graphics "Ray Tracing"
Dint work great on launch smoothly.. needed lots of FLOPs and DLSS (Deep Learning Super Sampling) , thought DLSS was hacky back then but now based on recent advancements in AI it might be the future of gaming (eg: Diffusion Models Are Real-Time Game Engines paper)
Had to immigrate and cross oceans for those sweet mud puddle reflections 😂
Back to Timeline