Scientists at the Swiss Federal Institute of Technology in Lausanne (EPFL) have recently unveiled a groundbreaking artificial intelligence tool known as Meditron, an unparalleled, free-to-use, advanced language-processing system fine-tuned for the healthcare sector. This technological marvel is engineered to facilitate and enhance the accuracy of medical decisions by healthcare professionals.

MEDITRON 7B and MEDITRON 70BArtificial intelligence algorithms termed Large Language Models (LLMs) have undergone extensive training on extensive datasets to comprehend and internalize a plethora of connections within the language – these connections are quantified as ‘parameters.’ Common uses for such algorithms are found in conversational agents like OpenAI’s ChatGPT and the language models that Google leverages for Bard. The most complex models in existence to date encapsulate a staggering number of parameters reaching several hundred billion, with training costs ascending to comparable financial heights.

General-purpose LLMs like ChatGPT are extremely versatile, aiding with a variety of tasks from crafting emails to generating poetry. Zeroing in on an exclusive knowledge domain, such as medicine, can allow for the creation of more streamlined, efficient, and obtainable models. LLMs, when strategically trained using medical literature, have the potential to democratize the availability of scientific, evidence-based medical information that can assist in impactful clinical decision-making.

Significant strides have been made towards exploiting and boosting these models’ medical expertise and logical deduction prowess. Yet the reality is such that most sophisticated AI solutions (like MedPaLM and GPT-4) remain behind proprietary barriers, or are confined to a parameter limit of approximately 13 billion, which limits their usability and reach.

To advance ubiquity and inclusivity in the medical AI realm, EPFL’s team within the School of Computer and Communication Sciences has developed two versions of Meditron, called MEDITRON 7B and MEDITRON 70B. These iterations showcase 7 billion and 70 billion parameters respectively and are specifically tuned for the medical environment. These models were introduced through a research paper published on the preprint platform arXiv, under the title, “MEDITRON-70B: Scaling Medical Pretraining for Large Language Models.”

Utilizing the foundational Llama-2 model from Meta as a starting point, and incorporating ongoing feedback from medical experts and biomedical scholars, Meditron was meticulously trained using a selection of high-quality medical texts. This corpus included authoritative medical publications accessible through public databases like PubMed and an assortment of clinical guidelines representing various regions, healthcare institutions, and global entities.

Zeming Chen, the primary researcher and doctoral student at EPFL’s Natural Language Processing Lab under the direction of Professor Antoine Bosselut, reported that upon rigorous evaluation across four critical medical benchmarks, Meditron’s operational effectiveness surpassed all other free-to-use models and even edged closer to the performance of proprietary models like GPT-3.5 and Med-PaLM. The MEDITRON 70B model demonstrated results that nearly matched those of the ultra-high-performing yet proprietary models GPT-4 and Med-PaLM-2, which are currently tailored to medical knowledge.

In an era where the swift development of AI is met with skepticism and even trepidation, Professor Martin Jaggi, who leads EPFL’s Machine Learning and Optimization Laboratory, highlighted the significance of Meditron’s transparent, open-source configuration. This transparency extends to both the processes and data involved in training the model, spurring researchers worldwide to engage in its examination and enhancement toward a safer, more robust technology – a set of opportunities not available with proprietary systems crafted by large tech corporations.

Guiding the medical facets of the endeavor, Professor Mary-Anne Hartley, a physician, and director of the Laboratory for Intelligent Global Health Technologies, jointly operated by MLO and Yale School of Medicine, emphasized that Meditron was built with an inherent focus on safety. Its distinction lies in its ability to embed cogent medical knowledge derived from reliable, transparent evidence sources. The pivotal next step involves ensuring the model’s capacity to deploy this medical intelligence safely and effectively.

The International Committee of the Red Cross, known for its medical practice guidelines, is one such source of esteemed evidence. From the perspective of humanitarian medical practice, Dr. Javier Elkin, at the helm of the Digital Health Program at the International Committee of the Red Cross, acknowledges the infrequency of novel healthcare innovations being attuned to humanitarian needs. The EPFL collaboration, which integrates its guidelines into the AI, garners enthusiasm from the organization due to its adherence to humanitarian tenets.

A collaborative workshop scheduled for early December in Geneva aims to delve into the prospects, constraints, and potential hazards posed by this innovative AI, with a focus on Meditron as presented by its creators.

Professor Bosselut expressed a fundamental belief that underpinned the development of Meditron: the universal entitlement to medical knowledge. EPFL’s hope is that Meditron acts as a springboard for researchers to safely tailor and confirm the efficacy of this technology within their clinical settings.

Other posts

  • Comparison of Traditional Regression With Regression Methods of Machine Learning
  • Implementing Machine Learning Algorithms with Python
  • How Machine Learning Affects The Development of Cities
  • The AI System Uses a Huge Database of 10 Million Biological Images
  • Improving the Retail Customer Experience Using Machine Learning Algorithms
  • Travel Venture Layla Snaps Up AI-Driven Trip Planning Assistant Roam Around
  • Adaptive Learning
  • The Role of Machine Learning in Manufacturing Quality Control
  • Bumble's Latest AI Technology Detects And Blocks Fraudulent And Fake Accounts
  • A Revolution in Chemical Analysis With GPT-3
  • An Introductory Guide to Neural Networks and Deep Learning
  • Etsy Introduces Gift Mode, an AI-Powered Tool That Creates Over 200 Custom Gift Collections
  • Machine Learning Programs For People With Disabilities
  • Fingerprint Detection with Machine Learning
  • Reinforcement Learning
  • Google Introduces Lumiere - An Advanced AI-Powered Text-To-Video Tool
  • Transforming Energy Management with Predictive Analytics
  • Image Recognition Using Machine Learning
  • A Machine Learning Study Has Shown That Seagulls Are Changing Their Natural Habitat To An Urban One
  • The Method of Hybrid Machine Learning Increases the Resolution of Electrical Impedance Tomography
  • Comparing Traditional Regression with Machine Learning Regression Techniques
  • Accelerated Discovery of Environmentally Friendly Energy Materials Using a Machine Learning Approach
  • An Award-Winning Japanese Writer Uses ChatGPT in Her Writing
  • Machine Learning in Stock Market Analysis
  • OpenAI to Deploy Counter-Disinformation Measures for Upcoming 2024 Electoral Process
  • Clustering Algorithms in Unsupervised Learning
  • Recommender Systems in Music and Entertainment
  • Scientists Create AI-Powered Technique for Validating Software Code
  • Innovative Clustering Algorithm Aids Researchers in Deciphering Complex Molecular Data
  • An Introduction to SVMs for Beginners
  • Machine Learning in Cybersecurity
  • Bioengineers Constructing the Nexus Between Organoids and Artificial Intelligence Utilizing 'Brainoware' Technology
  • Principal Component Analysis (PCA)
  • AWS AI Unveils Data Augmentation with Controllable Diffusion Models and CLIP Integration
  • Machine Learning Applications in Healthcare
  • Understanding the Essentials of Machine Learning Algorithms
  • Harnessing AI Language Processing to Advance Fusion Energy Studies
  • Leveraging Distributed Ledger Technology to Boost Machine Learning in Crop Phenotyping
  • Understanding Convolutional Neural Networks
  • Using Artificial Intelligence to Identify Subterranean Reservoirs of Renewable Energy
  • Scientists Create Spintronics-Based Probabilistic Computing Systems for Modern AI Applications
  • Natural Language Processing (NLP) and Text Mining Techniques
  • Artificial Intelligence Systems Demonstrate Proficiency in Imitation, But Struggle with Innovation
  • Leveraging Predictive Analytics for Smarter Supply Chain Decisions
  • AI-Powered System Offers Affordable Monitoring of Invasive Plant
  • Using Machine Learning to Track Driver Attention Levels Could Enhance Road Safety
  • K-Nearest Neighbors (KNN)
  • Precision Farming, Crop Yield Prediction, and Machine Learning
  • AI Model Analyzes Characteristics of Potential New Medications
  • Introduction to Recurrent Neural Networks
  • Hidden Markov Models (HMMs)
  • Using Machine Learning to Combat Fraud
  • The Impact of Machine Learning on Gaming
  • Machine Learning in the Automotive Industry
  • Recent Research Suggests Larger Datasets May Not Always Enhance AI Model
  • Scientists Enhance Air Pollution Exposure Models with the Integration of Artificial Intelligence and Mobility Data
  • Improving Flood Mitigation Through Machine Learning Innovations
  • Scientists Utilized Machine Learning and Molecular Modeling to Discover Potential Anticancer Medications
  • Improving X-ray Materials Analysis through Machine Learning Techniques
  • Utilizing Machine Learning, Researchers Enhance Vaccines and Immunotherapies for Enhanced Treatment Effectiveness
  • Progress in Machine Learning Transforming Nuclear Power Operations Towards a Sustainable, Carbon-Free Energy Future
  • Machine Learning Empowers Users with 'Superhuman' Capabilities to Navigate and Manipulate Tools in Virtual Reality
  • Research Highlights How Large Language Models Could Undermine Scientific Accuracy with False Responses
  • Algorithm Boosts Secure Communications without Sacrificing Data Authenticity
  • Random Forests in Predictive Modeling
  • Decision Trees
  • Supervised vs. Unsupervised Learning
  • The Evolution of Machine Learning Algorithms Over the Years