Data2vec

7/29/2023

The student mode of data2vect will learn from the teacher mode and update the model parameters at each time step. Image Credit: Meta Image Credit: Meta Image Credit: Metaĭata2vec employs a single model but offers two modes: teacher and student. It was tested on the popular GLUE text benchmark suite and found to be on par with RoBERTa, a reimplementation of BERT. It outperformed two prior Meta AI self-supervised voice algorithms, wav2vec 2.0 and HuBERT. The data2vec surpassed current processes for a variety of model sizes on the primary ImageNet computer vision benchmark.

This deficit in the AI industry motivated Meta to develop data2vec, which not only unifies the learning process but also trains a neural network to recognize images, text, or speech.

As a result, researchers who work on one modality frequently use a completely different technique from those who specialize in another. Self-supervised learning research nowadays nearly often concentrates on a single modality. In simpler words, while we have built state-of-the-art machines for a particular application, it is confined to that niche, the machines’ AI prowess may not be transferable. For example, a facial recognition system cannot generate textual content nor can a credit card fraud detection system help in detecting tumors in patients. However, most of the existing models are proficient at performing only single tasks. This technique allows computers to tackle new complicated data-related jobs more efficiently, such as comprehending text in more spoken languages. But, things changed since the advent of self-supervised learning that allows machines to learn about their surroundings by watching them and then decoding the structure of pictures, voice, or text. The research team explained that data2vec is trained by predicting the model representations of the whole input data given a partial view of the input in the paper “ Data2vec: A General Framework for Self-supervised Learning in Speech, Vision, and Language.” Meta AI has released data2vec source code and pre-trained models for voice and natural-language processing on GitHub under the permissive MIT license.Įarlier, AI machines were made to learn from labeled data. Word2vec is an example of a neural network developed for a certain sort of input, in this instance text, since it anticipated how words cluster together. The moniker data2vec is a pun on “word2vec,” a Google technology for language embedding that was released in 2013. Last month, Meta had released its first high-performance self-supervised algorithm called data2vec for multiple modalities.

0 Comments

Data2vec

Leave a Reply.

Author

Archives

Categories