lingo.lol is one of the many independent Mastodon servers you can use to participate in the fediverse.
A place for linguists, philologists, and other lovers of languages.

Server stats:

55
active users

#aiinscience

0 posts0 participants0 posts today

New Preprint Alert!

We're excited to share our latest work on #ChemRxiv! MARCUS (Molecular Annotation and Recognition for Curating Unravelled Structures) is a web-based platform for extracting chemical information from scientific papers.

📄 Preprint: doi.org/10.26434/chemrxiv-2025

🔗 Try it out: marcus.decimer.ai

ChemRxivMARCUS: Molecular Annotation and Recognition for Curating Unravelled StructuresThe exponential growth of chemical literature necessitates the development of automated tools for extracting and curating molecular information from unstructured scientific publications into open-access chemical databases. Current optical chemical structure recognition (OCSR) and named entity recognition solutions operate in isolation, which limits their scalability for comprehensive literature curation. Here we present MARCUS (Molecular Annotation and Recognition for Curating Unravelled Structures), a tool to aid curators in performing literature curation in the field of natural products. This integrated web-based platform combines automated text annotation, multi-engine OCSR, and direct submission capabilities to the COCONUT database. MARCUS employs a fine-tuned GPT-4 model to extract chemical entities and utilises an ensemble approach integrating DECIMER, MolNexTR, and MolScribe for structure recognition. The platform aims to streamline the data extraction workflow from PDF upload to database submission, significantly reducing curation time. MARCUS bridges the gap between unstructured chemical literature and machine-actionable databases, enabling FAIR data principles and facilitating AI-driven chemical discovery. Through open-source code, accessible models, and comprehensive documentation, the web application enhances accessibility and promotes community-driven development. This approach facilitates unrestricted use and encourages the collaborative advancement of automated chemical literature curation tools. We dedicate MARCUS to Dr Marcus Ennis, the longest-serving curator of the ChEBI database, on the occasion of his 75th birthday.

Four years after publishing my work on machine-driven parameter screen of biochemical reactions, it still has zero citations. That makes me sad; I am really proud of this work. So I asked ChatGPT to suggest me hashtags, for another chance to reach readers. academic.oup.com/nar/article/4 #MolecularBiology #BiochemicalReactions #ReverseTranscription #RNASequencing #LabAutomation #GeneExpression #SingleCellAnalysis #HighThroughput #AIinScience #ComputationalBiology #BiotechInnovation #DataDrivenResearch