OpenAI introduces data partnerships to facilitate deep training of AI models


OpenAI introduces data partnerships to facilitate deep training of AI models
OpenAI, led by Sam Altman, has launched data partnerships to work with organizations in producing both public and private datasets for training AI models. The company is focused on collaborating with organizations to help AI models comprehend a wide range of subject matters, industries, cultures, and languages, which necessitates a comprehensive training dataset.
"Data Partnerships are intended to enable more organizations to help steer the future of AI and benefit from more useful models by including content they care about", the company said in a statement. The ChatGPT developer expressed interest in large-scale datasets reflecting human society that are not publicly accessible online.
"We can work with any modality, including text, images, audio, or video. We’re particularly looking for data that expresses human intention (e.g., long-form writing or conversations rather than disconnected snippets) across any language, topic, and format", the company noted.
OpenAI recently announced that it possesses the capability to work with data in almost any form and is equipped with advanced in-house AI technology that can assist individuals in digitizing and organizing their data. The company has top-notch optical character recognition (OCR) technology that can convert files like PDFs into digital format and automatic speech recognition (ASR) that can transcribe spoken words. Additionally, OpenAI is looking for partners to collaborate with in order to create an open-source dataset that can be utilized for training language models.
"This dataset would be public for anyone to use in AI model training. We would also explore using it to safely train additional open-source models ourselves. We believe open-source plays an important role in the ecosystem", said OPenAI. "We are also preparing private datasets for training proprietary AI models, including our foundation models and fine-tuned and custom models", it added.
Source: IANS