Databricks launches Dolly 2.0 a global Open Instruction-Tuned LLM

Facebook
Twitter
LinkedIn
Artificial Intelligence-Image By Gerd Altmann From Pixabay.com

Databricks, the lakehouse company, recently announced the release of Dolly 2.0, the world’s first open-source, instruction-following large language model (LLM), fine-tuned on a human-generated instruction dataset licensed for commercial use.

Industry sources added that it follows the initial release of Dolly in March 2023, an LLM trained for less than USD$30 to exhibit ChatGPT-like human interactivity (aka instruction-following). Dolly 2.0 is a 12B parameter language model based on the EleutherAI Pythia model family and fine-tuned exclusively on a new high-quality human-generated instruction-following dataset, crowdsourced among Databricks employees.

Databricks is an open-sourcing of the entirety of Dolly 2.0, including the training code, the dataset, and the model weights, all suitable for commercial use. This enables any organization to create, own, and customize powerful LLMs that can talk to people without paying for API access or sharing data with third parties.

Ali Ghodsi, CEO, of Databricks

“Dolly 2.0 is a game changer as it enables all organizations around the world to build their own bespoke models for their particular use cases to automate things and make processes much more productive in the field they’re in. With Dolly 2.0, any organization can create, own, and customize a powerful LLM to create a competitive advantage for their business,” stated Ali Ghodsi, CEO, of Databricks.

Industry sources further revealed that creating the databricks-dolly-15k dataset contains 15,000 high-quality human-generated prompt or response pairs specifically designed for instruction tuning large language models. Under the licensing terms for databricks-dolly-15k (Creative Commons Attribution-ShareAlike 3.0 Unported License), anyone can use, modify, or extend this dataset for any purpose, including commercial applications.

This dataset was created to address the limitations of existing well-known instruction-following models that prohibit commercial use due to their training data. It is the world’s first open-source, human-generated instruction dataset specifically designed to make large language models exhibit the magical interactivity of ChatGPT.

databricks-dolly-15k was authored by over 5,000 Databricks employees during March and April 2023. These training records are natural, expressive, and designed to represent a wide range of behaviors, from brainstorming and content generation to information extraction and summarization.

Press Release received on Mail

Share.

RELATED POSTS

ESET, a global leader in cybersecurity, today announced that its ESET PRIVATE portfolio will be available to demo at RSAC 2026. Image courtesy: ESET
ESET PRIVATE Showcases Security Solutions at RSAC 2026
Sanjay Kaul, Chief Revenue Officer at Circles (left) and Alex Kang, Huawei Cloud Ecosystem President (right) sign the strategic collaboration agreement at MWC26 (Image Courtesy: PRNewswire)
Circles partners with Huawei to launch AI-Native telecom solutions
Armor Dash gives C-suite and board leaders a real-time view of security posture, compliance, and AI readiness — pulled directly from source systems, with nothing to assemble. (Image Courtesy: PRNewswire)
Armor Unveils Dash for unfiltered view of Cybersecurity and AI risk
  • ADFX honored as the "Best Forex Broker Global 2025" by International Business Magazine, recognizing our gold-standard protection and global vision. Image Courtesy: ADFX

LATEST POSTS

Dubai Taxi Company and Baidu’s Apollo Go fully driverless taxi. Image Courtesy: Dubai Taxi Company
Ajman Bank has launched "Talahom", a dedicated initiative aimed at supporting frontline personnel in the UAE, in recognition of their vital role in serving the community and contributing to its stability. Image courtesy: Ajman Bank
Sky Innovo Developments has announced the launch of “Citystars Park St.”, a landmark mixed-use development in New Cairo, representing a total development value exceeding EGP 100 billion. Image courtesy: Sky Innovo Developments
The Abu Dhabi Department of Energy (DoE) has announced the launch of the second phase of its Solar Energy Self-Supply Policy, expanding its scope to include the residential sector for the first time in Abu Dhabi. Image courtesy: DoE