About NIH
The mission of the Department of Health and Human Services (HHS) is to enhance the health and well-being of all Americans by providing for effective health and human services and by fostering sound, sustained advances in the sciences underlying medicine, public health, and social services.
Within HHS, the National Institutes of Health’s (NIH) National Center for Biotechnology Information (NCBI)—a vital component of the National Library of Medicine (NLM) under NIH—has been a pioneering force in biomedical information since its inception in 1988. Functioning as a premier online biomedical archive, NCBI conducts cutting-edge research, leveraging computational methods to address fundamental molecular-level biomedical challenges and builds production software and databases to support these analyses. With over 6 million daily users, NCBI manages key databases that receive hundreds of millions of monthly visits, including the Sequence Read Archive (which contains more than 25 petabytes of data), PubMed, and PubMed Central. PubMed Central (PMC) is the second most visited U.S. government website, with an average of over 4 million users accessing it daily.
The challenge
The existing PMC search interface was built on an older technology stack that had reached end-of-life. Over time, its infrastructure introduced several limitations:
- Components had become less reliable and increasingly difficult to maintain, which affected performance and made troubleshooting more complex;
- Accumulated technical debt slowed development and made onboarding new team members more challenging;
- The legacy user interface no longer aligned with modern accessibility or usability best practices;
- Inconsistent search behavior, including an autocomplete system that no longer reflected current user search patterns.
NIH needed a fully modernized search system: one that would be cloud-ready, accessible, user-friendly, maintainable for developers, and aligned with the architecture used by PubMed’s search platform.
The approach
Since January 2025, a U.S. Digital Corps Software Engineering Fellow has served as the sole front-end engineer designing and building a new search interface for PMC from the ground up. The Fellow rebuilt PMC Search as a modern cloud-native Django application integrated with multiple backend services, including a new Solr-based search infrastructure, to pursue the following improvements:
- Modernizing the codebase: To ensure long-term maintainability and avoid framework lock-in, the Fellow implemented a Django + VanillaJS + ViteJS approach and refactored major portions of legacy Perl code into Python.
- Redesigning the interface: The Fellow redesigned the entire interface with a mobile-first, user-centered approach. New features included faceted filters, sorting, pagination, a “Results by Year” timeline, persistent user preferences, enhanced accessibility, and completely rewritten autocomplete and filter generation services. The Fellow advocated for and implemented U.S. Web Design System (USWDS) components to ensure compliance, consistency, and reduced maintenance burden. User testing confirmed substantial improvements in clarity, usability, and visual design.
- Performance engineering: Because PMC serves millions of daily visitors, the Fellow led extensive performance engineering. He optimized front-end performance using Lighthouse, profiled backend behavior with Apache Bench, implemented connection pooling, and tested an experimental asynchronous uWSGI configuration to increase concurrent request handling. To validate real-world scalability, the Fellow used K6-based load testing with datasets representing actual PMC traffic patterns. He tuned CPU and memory settings, pod counts, and autoscaling strategies, and collaborated with the engineer managing the Solr backend to ensure full pipeline performance alignment.
The impact
The modernization of PMC Search dramatically improves access to trusted biomedical information for the public and scientific community. The Fellow completed the work more than two months ahead of schedule, allowing for extended beta testing and reducing the overall project cost by a third.
For the public and research community, the new system provides:
- A faster, more reliable, and more intuitive search experience;
- Modern features that help users locate research findings more efficiently;
- Improved accessibility and mobile usability;
- More accurate autocomplete suggestions and clearer search workflows.
These improvements streamline the work of scientists, clinicians, students, journalists, and millions of Americans who rely on PMC daily to conduct research or make health-related decisions.
For NIH, the modernization delivers meaningful, long-term institutional benefits:
- Reduced system maintenance costs through modern infrastructure and codebase modernization;
- A dramatically more maintainable and scalable platform aligned with NIH’s cloud strategy;
- A simplified onboarding experience for new developers;
- System reliability improvements that replace fragile legacy components;
- Architectural consistency with PubMed Search, enabling shared best practices and reducing divergence.
The Fellow’s work positions PMC for the next decade of growth while ensuring millions of users can access high-quality biomedical information quickly and reliably.
digitalcorps.gsa.gov
An official website of GSA’s Technology Transformation Services