Conexiant
Login
  • The Analytical Scientist
  • The Cannabis Scientist
  • The Medicine Maker
  • The Ophthalmologist
  • The Pathologist
  • The Traditional Scientist
The Analytical Scientist
  • Explore

    Explore

    • Latest
    • News & Research
    • Trends & Challenges
    • Keynote Interviews
    • Opinion & Personal Narratives
    • Product Profiles
    • App Notes

    Featured Topics

    • Mass Spectrometry
    • Chromatography
    • Spectroscopy

    Issues

    • Latest Issue
    • Archive
  • Topics

    Techniques & Tools

    • Mass Spectrometry
    • Chromatography
    • Spectroscopy
    • Microscopy
    • Sensors
    • Data & AI

    • View All Topics

    Applications & Fields

    • Clinical
    • Environmental
    • Food, Beverage & Agriculture
    • Pharma & Biopharma
    • Omics
    • Forensics
  • People & Profiles

    People & Profiles

    • Power List
    • Voices in the Community
    • Sitting Down With
    • Authors & Contributors
  • Business & Education

    Business & Education

    • Innovation
    • Business & Entrepreneurship
    • Career Pathways
  • Events
    • Live Events
    • Webinars
  • Multimedia
    • Video
Subscribe
Subscribe

False

The Analytical Scientist / Issues / 2021 / Jul / Small Molecule Discovery – and Make It Snappy!
Mass Spectrometry Pharma and Biopharma Data and AI

Small Molecule Discovery – and Make It Snappy!

By Lauren Robertson 07/20/2021 1 min read

Share

Researchers across the life sciences face the crucial challenge of correctly identifying small molecules in a sample. Historically, natural product drug discovery has been a low-throughput process that depends a lot on luck – just think of how penicillin was discovered! Though recent decades have shown significant advances in genomics and high-throughput MS-based data collection, trawling databases for this information is difficult and takes time. To add to this, existing approaches are based on chemical domain knowledge and often fail to explain many of the peaks found in mass spectra.

Now, a team of researchers from Pittsburgh’s Carnegie Mellon University and Russia’s St. Petersburg State University have created an MS-based algorithm that can quickly and accurately identify whether a particular molecule is truly new or has previously been discovered.

“When we started this study, efficient and accurate methods for identification of small molecules from their mass spectra were not available,” says Hosein Mohimani, part of the research team. “We had previously developed scalable methods (such as Dereplicator and Dereplicator+) for identifying small molecules, but they failed to correctly identify a large portion.” MolDiscovery builds on these previous attempts by combining machine learning and expert knowledge to create theoretical MS fragmentation patterns from the molecular structures and scoring these against query mass spectra.

“Our results showed that molDiscovery outperforms state-of-the-art methods in accuracy and efficiency. Additionally, unlike existing machine learning methods, molDiscovery generalizes well to unseen data,” says Mohimani. In fact, the paper reports that molDiscovery identified six times more unique small molecules than previous methods.

But the researchers don’t plan to stop there. They are already working on various extensions to molDiscovery and plan to incorporate expert knowledge from analytical chemistry literature into their model to further improve accuracy. “We are also working on more complex models that automatically learn unknown small molecule fragmentation rules. We also plan to integrate molDiscovery and its derivatives into our computational pipelines for high-throughput natural products discovery from multi-omic data, such as NRPminer and MetaMiner,” says Mohimani. “We believe MolDiscovery and its derivatives will play a crucial role in shaping the future of data-driven natural product drug discovery.”

Newsletters

Receive the latest analytical science news, personalities, education, and career development – weekly to your inbox.

Newsletter Signup Image

References

  1. L Cao et al., Nat Comms, 12, 3718 (2021). DOI: 10.1038/s41467-021-23986-0.

About the Author(s)

Lauren Robertson

By the time I finished my degree in Microbiology I had come to one conclusion – I did not want to work in a lab. Instead, I decided to move to the south of Spain to teach English. After two brilliant years, I realized that I missed science, and what I really enjoyed was communicating scientific ideas – whether that be to four-year-olds or mature professionals. On returning to England I landed a role in science writing and found it combined my passions perfectly. Now at Texere, I get to hone these skills every day by writing about the latest research in an exciting, creative way.

More Articles by Lauren Robertson

False

Advertisement

Recommended

False

False

The Analytical Scientist
Subscribe

About

  • About Us
  • Work at Conexiant Europe
  • Terms and Conditions
  • Privacy Policy
  • Advertise With Us
  • Contact Us

Copyright © 2025 Texere Publishing Limited (trading as Conexiant), with registered number 08113419 whose registered office is at Booths No. 1, Booths Park, Chelford Road, Knutsford, England, WA16 8GS.