BCM HGSC Hackathon

Published on
BCM HGSC Hackathon

“Innovation distinguishes between a leader and a follower.” - Steve Jobs

Introduction

In October 2022, I participated remotely in the BCM_HGSC Hackathon, where I focused on developing and optimizing MetScale, an open-source tool for querying genomic databases.

Progress

Initial Setup

My initial task was to understand the existing codebase and identify potential areas for optimization. The tool was initially set up for basic data retrieval but required enhancements for efficiency and scalability.

Code Optimization

I restructured the Python code to improve its performance and maintainability. This included refining the database queries and integrating more robust error handling mechanisms.

Documentation and Testing

Using Jupyter Notebooks, I documented the entire process, providing clear examples and instructions for future users. I also developed test cases to ensure the tool’s reliability.

Results

Header Image

The optimization efforts resulted in a significant improvement in query execution times and overall data handling. The tool now supports more complex queries and can handle larger datasets efficiently. Explore the final tool and its capabilities here.

Technology Learned/Used

This project allowed me to deepen my expertise in several areas:

  • Python for backend development
  • SQLite for efficient data management
  • Jupyter Notebooks for documentation and presentation

Conclusion

The BCM_HGSC Hackathon was a valuable experience in applying practical coding skills to real-world genomic research challenges. The enhancements made to the MetScale tool not only improved its performance but also contributed to its potential application in genomic studies.

References