Hugging Face Launches ChatGPT Clone for Just $300

Alongside, the code, the model weights have also been made public.
Listen to this story

After Stanford University launched ChatGPT clone Alpaca for $600, Hugging Face today unveiled Vicuna-13B , an open-source alternative to GPT-4, which reportedly achieves 90% of ChatGPT’s quality, and the cost of training the model was around $300. The model has been fine-tuned using LLaMA and has also incorporated user-shared conversations gathered from ShareGPT.

Check out the GitHub repository here .

Additionally, the model weights were made publicly available along with the launch. Hugging Face’s latest innovation is expected to garner significant interest from businesses and individuals looking to leverage cutting-edge technology for natural language processing applications.

AIM Daily XO

Join our editors every weekday evening as they steer you through the most significant news of the day, introduce you to fresh perspectives, and provide unexpected moments of joy
Your newsletter subscriptions are subject to AIM Privacy Policy and Terms and Conditions.

Early assessments using GPT-4 as a judge have demonstrated that Vicuna-13B attains over 90%* of the quality of both OpenAI’s ‘ChatGPT’ and Google’s ‘Bard’, whilst surpassing other models such as LLaMA and Stanford Alpaca in more than 90%* of instances. Furthermore, Vicuna-13B’s performance level compares favourably to other open-source models, including Stanford Alpaca. The results have generated considerable interest in the field of natural language processing, particularly among businesses seeking to leverage the latest advancements in AI. Building an evaluation system for chatbots remains an open question requiring further research, stated the researchers in a blogpost.

The researchers have made some bold claims about its capabilities in natural language processing, and it’s exciting to see how it will stack up against other models like ChatGPT. While both models share some similarities, Vicuna-13B boasts impressive efficiency and customisation capabilities, thereby making it a strong contender in the NLP space.

Download our Mobile App

The contributors behind Vicuna-13B have asserted the model’s capabilities in natural language processing compared to other models like ChatGPT. While the two models exhibit some parallels, the Hugging Face model differentiates itself through its efficiency and customisation features. Industry experts are closely monitoring Vicuna-13B’s performance, anticipating it will set new benchmarks for AI-powered language processing.

Despite its impressive capabilities, the model has certain limitations. For example, it struggles with tasks that require reasoning or mathematical computation, and there may be some shortcomings in ensuring factual accuracy in its outputs.

Additionally, the model is yet to be thoroughly optimised to guarantee safety or mitigate potential toxicity, or bias. To address these concerns, the developers have implemented OpenAI’s moderation API to filter out inappropriate user inputs in their online demo.

Sign up for The AI Forum for India

Analytics India Magazine is excited to announce the launch of AI Forum for India – a community, created in association with NVIDIA, aimed at fostering collaboration and growth within the artificial intelligence (AI) industry in India.

Tasmia Ansari
Tasmia is a tech journalist at AIM, looking to bring a fresh perspective to emerging technologies and trends in data science, analytics, and artificial intelligence.

Our Upcoming Events

Regular Passes expiring on Friday
27-28th Apr, 2023 I Bangalore
Data Engineering Summit (DES) 2023

23 Jun, 2023 | Bangalore
MachineCon India 2023

21 Jul, 2023 | New York
MachineCon USA 2023

3 Ways to Join our Community

Telegram group

Discover special offers, top stories, upcoming events, and more.

Discord Server

Stay Connected with a larger ecosystem of data science and ML Professionals

Subscribe to our Daily newsletter

Get our daily awesome stories & videos in your inbox

Council Post: Retail Business through Generative AI

Today, retail technology is developing at a fast pace – whether it is business transformation or even exploring emerging tech (AR/VR and metaverse etc.) to give customers a more experiential journey. Businesses are innovating not only to remain relevant, but also, ahead. Some are really shaping the future of omni-channel retail by predicting customer expectations and market trends.

Cerebras Wants What NVIDIA Has

While OpenAI apparently utilised 10,000 NVIDIA GPUs to train ChatGPT, Cerebras claims to have trained their models to the highest accuracy for a given compute budget.