Statistician raises red flag about reliability of machine learning techniques

Share

Machine learning is everywhere in science and technology: powering facial recognition, picking your recommendations on Netflix, and controlling self-driving cars. But how reliable are machine learning techniques really? A statistician says that the answer is “not very,” arguing that questions of accuracy and reproducability of machine learning have not been fully addressed.

Dr Genevera Allen, associate professor of statistics, computer science, and electrical and computer engineering Rice University in Houston, Texas has discussed this topic at a press briefing and at a scientific conference, the 2019 Annual Meeting of the American Association for the Advancement of Science (AAAS). She warned that researchers in the field of machine learning have spent so much time developing predictive models that they have not devoted enough attention to checking the accuracy of their models, and that the field must develop systems which can assess the accuracy of their own findings.

“The question is, ‘Can we really trust the discoveries that are currently being made using machine-learning techniques applied to large data sets?’” Allen said in a statement. “The answer in many situations is probably, ‘Not without checking,’ but work is underway on next-generation machine-learning systems that will assess the uncertainty and reproducibility of their predictions.”

As an example, recently machine learning has been used to study patients with cancer. To study the disease, scientists use machine learning to identify genetically similar individuals so that drug therapies can then be targeted to these specific genomes. But when comparing across different studies, the clusters identified by machine learning are completely different from each other.

The problem is that machine learning techniques do not have a way to say “I don’t know” or “It’s not clear.” The techniques will generally always produce an answer — in the example of the cancer patients, they will always identify a group in some way — but this answer may not be as certain or accurate as it is believed to be. The techniques are able to find a pattern that exists in the data set, even if only dimly, but the pattern may not hold in the real world.

“There is general recognition of a reproducibility crisis in science right now,” Allen told BBC News. “I would venture to argue that a huge part of that does come from the use of machine learning techniques in science.”

Editors’ Recommendations

What is artificial intelligence? Here’s everything you need to know
Learn something new, from coding to marketing, with these free online courses
Google Assistant will alert you if it thinks your flight will be delayed
Nvidia’s new A.I. creates entire virtual cities by watching dash cam videos
Gmail blocks 100 million spam messages daily with its A.I., Google says

News

Company:

How an ERP Software Can Help Your Biz

Google just changed forever

Google lays off more workers and fires protestors in tumultuous week

Report claims the Snapdragon 8 Gen 4 might require larger batteries

The war between PC and console is about to heat up again

Spigen Ultra Hybrid Samsung Galaxy S24 case review: Should you buy it?

Razer Kishi Ultra review: Should you buy it?

The Asus ROG Zephyrus G16 completely challenged my expectations

CUKTECH 20 Power Bank review: Should you buy it?

Smartish Wallet Slayer Vol 1 Samsung Galaxy S24 case review: Should you buy it?

I’ve worn two of the best smart rings. Here’s which one you should buy

I did a camera test with two $1,800 phones. Then something annoying happened

Google Pixel 7a vs. Pixel 7: don’t buy the wrong Pixel

This is the most unusual Galaxy S23 Ultra camera test I’ve ever done

I tested the Galaxy S23 Ultra and iPhone 14 Pro cameras. Only one is a winner

How to search ChatGPT conversations

How to set up Windows 11 without a Microsoft account

How to transfer a Wear OS smartwatch from one phone to another

How to type an em dash in Windows

Ask Jerry: How to fight email spam

8 iPhone browser apps you should use instead of Safari

Are Facebook and Instagram still down? Here’s what we know

Are Facebook and Instagram still down? Here’s what we know

The 1Password Android app just got a huge upgrade

I never knew I needed this mini Mac app, but now I can’t live without it

Statistician raises red flag about reliability of machine learning techniques

Editors’ Recommendations

Table of contents

How an ERP Software Can Help Your Biz

Google just changed forever

Google lays off more workers and fires protestors in tumultuous week

Report claims the Snapdragon 8 Gen 4 might require larger batteries

The war between PC and console is about to heat up again

More News

How an ERP Software Can Help Your Biz

Google just changed forever

Google lays off more workers and fires protestors in tumultuous week

Report claims the Snapdragon 8 Gen 4 might require larger batteries