Foundation Models - A hidden revolution in enterprise Artificial Intelligence | HackerNoon

France Nouvelles Nouvelles

Foundation Models - A hidden revolution in enterprise Artificial Intelligence | HackerNoon
France Dernières Nouvelles,France Actualités
  • 📰 hackernoon
  • ⏱ Reading Time:
  • 77 sec. here
  • 3 min. at publisher
  • 📊 Quality Score:
  • News: 34%
  • Publisher: 51%

An introductory article to bring a preliminary cognizance on the broadening prospects of foundation models in the AI industry. - artificialintelligence machinelearning

“Why can't we train a complex AI model once for creating a generic representation of training data that is not specific to any downstream task, store it in a data store and later consume this trained model for solving multiple diversified problems by just tweaking the trained model?”The term was coined for the first time in a white paper namedpublished by some esteemed researchers at Stanford University.

“A foundation model is any model that is trained on broad data that can be adapted to a wide range of downstream tasks.”Let's consider a simple example. A model named “Alexnet” was developed for image recognition problems. A lot of trailblazing and sophisticated techniques such as Batch normalisation, Drop out, Relu activation etc were part of the optimization and development of this model.

This neural network model has eight layers with learnable parameters. The model consists of five layers with a combination of max pooling followed by 3 fully connected layers and they use Relu activation in each of these layers except the output layer. It is trained on a dataset called “Imagenet” that has almost 14 million images across a thousand classes. Later in a competition, it was proved that Alexnet can provide a state of the art outcomes for solving various problems.

If we have got a problem based on image processing, why can't we use a trained version of the Alex net model and tweak some of its parameters according to our downstream task?Let's deep dive into one more real-time example regarding natural language processing.. It is a process used for converting raw text into mathematical vectors. This is because a machine learning model will not be able to directly consume an input in text format for the various internal computational processes.

The embedding process carried out by algorithms such as Word2vec, Glove, etc is called word embedding or static embedding.

Nous avons résumé cette actualité afin que vous puissiez la lire rapidement. Si l'actualité vous intéresse, vous pouvez lire le texte intégral ici. Lire la suite:

hackernoon /  🏆 532. in US

France Dernières Nouvelles, France Actualités

Similar News:Vous pouvez également lire des articles d'actualité similaires à celui-ci que nous avons collectés auprès d'autres sources d'information.

MTA to use artificial intelligence tech to keep buses from breaking downOfficials plan to install the technology on 1,500 buses in December.
Lire la suite »

Tips for buying the best artificial Christmas treeTips for buying the best artificial Christmas treeArtificial Christmas trees are now more realistic than ever before. Here's how to find the most authentic tree and get the most bang for your buck.
Lire la suite »

Foundation to mark Deep Ellum’s 150th birthday in 2023 with new community center and moreFoundation to mark Deep Ellum’s 150th birthday in 2023 with new community center and moreThe yearlong jubilee includes a black-tie reception, music festival and parade.
Lire la suite »

Izabel Goulart models ‘love’ look for Germany’s Kevin Trapp at 2022 World CupIzabel Goulart models ‘love’ look for Germany’s Kevin Trapp at 2022 World CupOn Sunday, the Sports Illustrated Swimsuit alum, 38, gave fans a sneak peek at her World Cup ensemble for Germany’s Group E match versus Spain, where her fiancé, German goalkeeper Kevin Trapp…
Lire la suite »

Artificial neural network: Here's everything you need to know about black box of AIArtificial neural network: Here's everything you need to know about black box of AIArtificial neural networks (ANNs) are computing systems inspired by the way biological neural networks work in the human, or other animals, brains.
Lire la suite »



Render Time: 2025-03-10 23:33:32