What Is Machine Learning? | Extremetech

What Is Machine Learning? | Extremetech

Games

Products You May Like

AI-powered systems have swept through business, surfing a rising wave of occasionally justified hype. When they’re good, they’re really good—take, for example, a neural net designed to help Japanese bakery customers get through the checkout faster, which ended up being fabulous for detecting skin cancer. The ability to use image recognition to distinguish the unique outlines of baked goods turned out to be a transferable skill, because skin cancers have characteristic shapes and outlines that change over time.

The trouble with all this power is that it doesn’t necessarily come with accuracy. Imagine a rear-wheel-drive car with a V8 driving in the snow. Approximation is like dead reckoning, and if the heading is off, an AI can get way off course. The ability to learn isn’t automatically included in the structure of a neural net—and learning is a huge advantage when it comes to getting good results.

Machine Learning 101

So, what is machine learning in the first place? And if the machines are so smart, why are they still so dumb?

The point of learning is to improve results. For the best results, a model needs to be both powerful and accurate. Machine learning is a field of study within artificial intelligence, concerned with making algorithms that can learn from experience and thus improve their own performance.

Tom M. Mitchell, a machine learning expert from Carnegie Mellon University, put it like this: “A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P—if its performance at tasks in T, as measured by P, improves with experience E.”

Machine learning has two big jobs: pattern recognition and prediction. Effective learning uses past experience to successfully generalize, predict, or project the outcome of a new situation. To make an accurate prediction, an AI needs to know what parts of the problem are significant, and what relationships exist between them. Only then can it start making decisions.

Neural nets have to pick their way through a landscape of choices, choosing their path based on what relationships have the greatest weight. This process is done using a thing called gradient descent. It’s like how a watershed works: water always rolls downhill. Steep hills shed water faster, and deep valleys collect it. Many, including big names like ChatGPT, Copilot, and Google’s Vertex AI, do this navigation in the mathematical language of vectors and tensors, which describe and compare motion between coordinates in an n-dimensional space. Machine learning allows an AI to change the topography of its decision landscape.

An AI is software: an algorithm or logical model, written in a programming language. (Often it’s Python or a cousin, like PyTorch.) Those models combine a redundant, iterative process with a kind of gigantic spreadsheet called a matrix. But machine learning is more than just saving a file. When an AI learns, it changes its own assumptions or even its process.

The most common training algorithm for neural nets (at least, as of 2024) is called back-propagation. In back-propagation, information from late in a process is shuttled back to earlier steps, where it exerts an influence downstream.

Approaches to Machine Learning

Training a neural net is the process of teaching it how to perform its task. It’s a maximalist thing; bigger datasets are usually better. Powerful AIs like Copilot and ChatGPT are so flexible because they rely on logical models encompassing an entire language to understand the problems they’re given. Language, in turn, is defined by the relationships between elements in the system, such as words and grammar. Words that are used more commonly together have relationships that a neural net will judge to have a greater importance or weight.

Different machine learning approaches may be more or less useful, depending on the type of problem a given AI is trying to solve.

Thomas Malone | MIT Sloan. See: https://bit.ly/3gvRho2, Figure 2.
Credit: Thomas Malone, Laura Wentzel/MIT Sloan

Generally speaking, there are three types of machine learning, each with its own angle of approach and best use cases. Before it can do its work, a neural net has to be trained, so it can expand from the problem with a bit of context that allows it to generalize. If it’s a goal-directed system, it needs some way to decide whether its own output met the standard to get the best possible results next time around. And sometimes, problems are too complex to handle without paring them down a little.

Supervision

While an AI is in training, its directive is to find patterns in its training data set. When that process has guidance (such as from humans who helpfully label and sanitize data), it’s called supervised learning. On the other hand, unsupervised learning looses an AI on a data set to draw its own conclusions.

When it comes to data sets, bigger is usually better, but there’s a caveat: tons of messy data is not necessarily better than a smaller but well-organized data set. Unsupervised learning has a unique power: finding subtle patterns that humans may not even have perceived, let alone explained. But purely unsupervised algorithms can rapidly spin off into chaos. As a result, many AIs find success in a semi-supervised approach.

Simplification

Real-world problems can be tough to solve because data from the real world is usually messy. There’s just a lot going on in any given system, living or not. Complicated data with a lot of bit depth is sometimes described as having “high dimensionality,” because it has a lot of variables, which are sometimes called dimensions.

To make problems solvable within a reasonable period of time, humans often simplify our models. For example, a lot of math and physics problems about objects in motion contain the phrase, “ignore wind resistance.” (One affectionate physics in-joke notes that physicists will often “assume a spherical cow.”) The official name of this type of simplification is “dimensionality reduction.”

Deepfakes and self-driving are amazing, but for all the sophistication of AI, we've circled back to COUNT THE FINGERS LEST YE DEAL UNKNOWING WITH A FAE

Performance artist James Bridle “trapped” this self-driving car inside a salt circle of “‘do not pass’ sigils.”
Credit: James Bridle

Dimensionality reduction can make problems much less complicated, but that simplification comes at a price. For example, self-driving cars use software that makes some informed assumptions about their surroundings. But AI is still trapped by the iron triangle—between good, fast, or cheap, pick two. Using shortcut assumptions is like trying to watch video in potato resolution: it sacrifices accuracy for speed, which is mostly fine.

Mostly.

Reinforcement

Artificial intelligence has so much power that when it fails, it’s sometimes surprising to see how close the roots are to the surface. But when an AI gets it wrong, at best, the results are just meh—and sometimes they’re inaccurate, super cursed, or even dangerous. The problem is worse with generative AI than with the kind of AI meant only to identify patterns or solve an equation. For example, AI-generated recipes are notoriously terrible.

Whenever a man tells me he's looking for a real woman, I scurry away, because I'm actually three raccoons in a trench coat, and HE MUSTN'T FIND OUT

Calling this cursed is being generous.
Credit: AI Weirdness

Doesn’t it feel like, with a little more depth of understanding, this could be great? But it still needs some hand-holding, and maybe an ex post facto sanity check. That’s where reinforcement learning comes in.

Better, Faster, Stronger

Artificial intelligence is the study of systems that mimic some aspect of human intelligence or behavior. Once a goal-directed AI has made its choices, it can apply feedback to improve its results on the next iteration. Association is the fundamental unit of learning. In a way, this mimics simple conditioning, like Pavlov and his dogs. And the analogy holds, even in failure. Humans may be the most intelligent creatures on the planet, but we are capable of truly astonishing stupidity.

When it is cute, it is very, very cute. But when it is not, it is nightmare fuel (right). By way of apology we have included additional adorable raccoons in a trench coat (left).

AI image generators lean hard on a generative AI’s ability to hallucinate. Sometimes that’s awesome, but sometimes it’s nightmare fuel.
Credit: Extremetech/Craiyon

AIs are based on large language models (LLMs), which train on data pulled from a huge number of internet sources like Wikipedia and the natural-language exchanges between posters in the Ubuntu Help Forums. Consequently, AI chatbots can write with as much grammatical nuance as a human—but the prose is wooden and repetitive. AI image generation gets the number of fingers and teeth wrong way too often. Deepfakes and self-driving are amazing, but for all the sophistication of AI, we’ve circled back to “COUNT THE FINGERS LEST YE DEAL UNKNOWING WITH A FAE.”

In the end, better algorithms build on one another. These AI chatbots still feel like three raccoons in a trench coat, trying desperately to play it cool. But they also show just how much we stand on the shoulders of giants.

View original source here.

Products You May Like

Articles You May Like

Publishing’s Silence on Neil Gaiman Allegations is Loud
LA Now Faces Water Contamination Threat After Wildfires, Expert Warns : ScienceAlert
Ringo Starr says he once left an Elton John gig with Sir Elton’s mum because they didn’t know any songs
The Greatest Folk Horror Film of All Time Is Streaming Now!
Leeds nightclub to give revellers lollipops to “keep them quiet”