{"id":74518,"date":"2023-10-05T09:00:38","date_gmt":"2023-10-05T16:00:38","guid":{"rendered":"https:\/\/github.blog\/?p=74518"},"modified":"2024-05-20T07:34:59","modified_gmt":"2024-05-20T14:34:59","slug":"a-developers-guide-to-open-source-llms-and-generative-ai","status":"publish","type":"post","link":"https:\/\/github.blog\/ai-and-ml\/llms\/a-developers-guide-to-open-source-llms-and-generative-ai\/","title":{"rendered":"A developer&#8217;s guide to open source LLMs and generative AI"},"content":{"rendered":"<!DOCTYPE html PUBLIC \"-\/\/W3C\/\/DTD HTML 4.0 Transitional\/\/EN\" \"http:\/\/www.w3.org\/TR\/REC-html40\/loose.dtd\">\n<html><body><p>We all know that AI is changing the world. But what happens when you combine AI with the power of open source?<\/p>\n<p>Over the past year, there has been <a href=\"https:\/\/github.blog\/2023-06-27-the-economic-impact-of-the-ai-powered-developer-lifecycle-and-lessons-from-github-copilot\/\">an explosion of open source generative AI projects on GitHub<\/a>: by our count, more than 8,000. They range from commercially backed large language models (LLMs) like Meta&rsquo;s <a href=\"https:\/\/github.com\/facebookresearch\/llama\">LLaMA<\/a> to experimental open source applications.<\/p>\n<p>These projects offer many benefits to open source developers and the machine learning community&mdash;and are a great way to start building new AI-powered features and applications.<\/p>\n<p>In this article, we&rsquo;ll explore:<\/p>\n<ul>\n<li>The differences between open source LLMs and closed source pre-trained models<\/li>\n<li>Best practices for fine-tuning LLMs<\/li>\n<li>The open source LLMs available today<\/li>\n<li>What the future holds for the rapidly evolving world of generative AI<\/li>\n<\/ul>\n<p>Let&rsquo;s jump in.<\/p>\n<div data-target=\"content-table-wrap.container\" class=\"content-table-wrap\"><content-table-wrap><table style=\"border: 1px black\">\n<tbody>\n<tr>\n<td><strong>Interested in building with LLMs?<\/strong> <a href=\"https:\/\/github.blog\/2023-07-17-prompt-engineering-guide-generative-ai-llms\/\">Check out our guide on prompt engineering &gt;<\/a><\/td>\n<\/tr>\n<\/tbody>\n<\/table><\/content-table-wrap><\/div>\n<h2 id=\"open-source-vs-closed-source-llms\" id=\"open-source-vs-closed-source-llms\" ><a class=\"heading-link\" href=\"#open-source-vs-closed-source-llms\">Open source vs. closed source LLMs<span class=\"heading-hash pl-2 text-italic text-bold\" aria-hidden=\"true\"><\/span><\/a><\/h2>\n<p>By now, most of us are familiar with LLMs: <a href=\"https:\/\/github.blog\/2023-04-14-how-generative-ai-is-changing-the-way-developers-work\/\">neural network-based language models trained on vast quantities of data<\/a> to mimic human behavior by <a href=\"https:\/\/github.blog\/2023-04-07-what-developers-need-to-know-about-generative-ai\/#what-is-generative-ai-used-for\">performing various downstream tasks<\/a>, like question answering, translation, and summarization. LLMs have disrupted the world with the introduction of tools like ChatGPT and <a href=\"https:\/\/github.com\/features\/copilot\">GitHub Copilot<\/a>.<\/p>\n<p>Open source LLMs differ from their closed counterparts regarding the source code (and sometimes other components, as well). With closed LLMs, the source code&mdash;which explains how the model is structured and how the training algorithms work&mdash;isn&rsquo;t published.<\/p>\n<p>&ldquo;When you&rsquo;re doing research, you want access to the source code so you can fine-tune some of the pieces of the algorithm itself,&rdquo; says <a href=\"https:\/\/github.com\/whatsinfinitum\">Alireza Goudarzi<\/a>, a senior researcher of machine learning at GitHub. &ldquo;With closed models, it&rsquo;s harder to do that.&rdquo;<\/p>\n<p>Open source LLMs help the industry at large: because so many people contribute, they can be developed faster than closed models. They can also be more effective for edge cases or specific applications (like local language support), can <span style=\"font-weight: 400\">contain bespoke security controls, and can run on local models.<\/span><\/p>\n<p>But closed models&mdash;often built by larger companies&mdash;have advantages, too. For one, they&rsquo;re embedded in systems with filters for biased information, inappropriate language, and other questionable content. They also frequently have security measures baked in. Plus, they don&rsquo;t need fine-tuning, a specialized skill set requiring dedicated people and teams.<\/p>\n<p>&ldquo;Closed, off-the-shelf LLMs are high quality,&rdquo; notes <a href=\"https:\/\/github.com\/eaftan\">Eddie Aftandilian<\/a>, a principal researcher at GitHub. &ldquo;They&rsquo;re often far more accessible to the average developer.&rdquo;<\/p>\n<h2 id=\"how-to-fine-tune-open-source-llms\" id=\"how-to-fine-tune-open-source-llms\" ><a class=\"heading-link\" href=\"#how-to-fine-tune-open-source-llms\">How to fine-tune open source LLMs<span class=\"heading-hash pl-2 text-italic text-bold\" aria-hidden=\"true\"><\/span><\/a><\/h2>\n<p>Fine-tuning open source models is done on the large cloud provider hosted by the LLM, such as AWS, Google Cloud, or Microsoft Azure. Fine-tuning allows you to optimize the model by creating more advanced language interactions in applications like virtual assistants and chatbots. This can improve model accuracy anywhere from five to 10 percent.<\/p>\n<p>As for best practices? Goudarzi recommends being careful about data sampling and being clear about the specific needs of the application you&rsquo;re trying to build. <span style=\"font-weight: 400\">The curated data should match your needs exactly since the models are pre-trained on anything you can find online.<\/span><\/p>\n<p>&ldquo;You need to emphasize certain things related to your objectives,&rdquo; he says. &ldquo;Let&rsquo;s say you&rsquo;re trying to create a model to process TV and smart home commands. You&rsquo;d want to preselect your data to have more of a command form.&rdquo;<\/p>\n<p>This will help optimize model efficiency.<\/p>\n<aside class=\"p-4 p-md-6 post-aside--large\"><p class=\"h5-mktg gh-aside-title\">Looking to fine-tune your open source LLM? Try LoRA.<\/p><p>Microsoft offers the open sourced LoRA (Low-Rank Adaptation of Large Language Models) project <a href=\"https:\/\/github.com\/microsoft\/LoRA\">on GitHub<\/a>, which can be a useful tool for fine-tuning LLMs.<\/p>\n<ul>\n<li>LoRA is a training method that uses a mathematical trick to decompose large metrics into smaller ones. This leads to fewer parameters and more storage efficiency, resulting in quicker processing time.<\/li>\n<li>Techniques like LoRA can help you deploy LLMs to many customers, since it only requires saving small matrices.<\/li>\n<li>Other techniques for fine-tuning LLMs include <a href=\"https:\/\/magazine.sebastianraschka.com\/p\/understanding-parameter-efficient\">hard tuning, soft tuning, and prefix tuning<\/a>.<\/li>\n<\/ul>\n<\/aside>\n<\/p><h2 id=\"choosing-your-model\" id=\"choosing-your-model\" ><a class=\"heading-link\" href=\"#choosing-your-model\">Choosing your model<span class=\"heading-hash pl-2 text-italic text-bold\" aria-hidden=\"true\"><\/span><\/a><\/h2>\n<p>Which open source model is best for you? Aftandilian recommends focusing on models&rsquo; performance benchmarks against different scenarios, such as reasoning, domain-specific understanding of law or science, and linguistic comprehension.<\/p>\n<p>However, don&rsquo;t assume that the benchmark results are correct or meaningful.<\/p>\n<p>&ldquo;Rather, ask yourself, how good is this model at a particular task?&rdquo; he says. &ldquo;It&rsquo;s pretty easy to let benchmarks seep into the training set due to lack of deep understanding, skewed performance, or limited generalization.&rdquo;<\/p>\n<p>When this happens, the model is trained on its own evaluation data. &ldquo;Which would make it look better than it should,&rdquo; Aftandilian says.<\/p>\n<p>You should also consider how much the model costs to run and its overall latency rates. A large model, for instance, might be exceptionally powerful. <span style=\"font-weight: 400\">But if it takes minutes to generate responses versus seconds, there may be better options. <\/span>(For example, the models that power GitHub Copilot in the IDE feature a latency rate of less than ten milliseconds, which is well-suited for getting quick suggestions.)<\/p>\n<figure id=\"attachment_74520\"  class=\"wp-caption aligncenter mx-0\"><a href=\"https:\/\/resources.github.com\/newsletter\/\"><img data-recalc-dims=\"1\" decoding=\"async\" loading=\"lazy\" class=\"width-fit size-large wp-image-74520 width-fit\" src=\"https:\/\/github.blog\/wp-content\/uploads\/2023\/10\/insider-signup.png?w=1024&#038;h=256\" alt=\"\"  srcset=\"https:\/\/github.blog\/wp-content\/uploads\/2023\/10\/insider-signup.png?w=1600 1600w, https:\/\/github.blog\/wp-content\/uploads\/2023\/10\/insider-signup.png?w=300 300w, https:\/\/github.blog\/wp-content\/uploads\/2023\/10\/insider-signup.png?w=768 768w, https:\/\/github.blog\/wp-content\/uploads\/2023\/10\/insider-signup.png?w=1024 1024w, https:\/\/github.blog\/wp-content\/uploads\/2023\/10\/insider-signup.png?w=1536 1536w\" sizes=\"(max-width: 1000px) 100vw, 1000px\" \/><\/a><figcaption class=\"text-mono color-fg-muted mt-14px f5-mktg\">Supercharge your productivity with our monthly developer newsletter.<\/figcaption><\/figure>\n<h2 id=\"open-source-llms-available-today\" id=\"open-source-llms-available-today\" ><a class=\"heading-link\" href=\"#open-source-llms-available-today\">Open source LLMs available today<span class=\"heading-hash pl-2 text-italic text-bold\" aria-hidden=\"true\"><\/span><\/a><\/h2>\n<p>There are several open source commercially licensed models available. These include:<\/p>\n<ul>\n<li><strong><a href=\"https:\/\/github.com\/openlm-research\/open_llama\">OpenLLaMA<\/a>:<\/strong> <span style=\"font-weight: 400\">An open source reproduction of Meta&rsquo;s LLaMA model, developed by <a href=\"https:\/\/github.com\/UCBAIR\">Berkeley AI Research<\/a><\/span><span style=\"font-weight: 400\">, this project provides permissively licensed models with 3B, 7B, and 13B parameters, and is trained on one trillion tokens.<\/span> <span style=\"font-weight: 400\">OpenLLaMA models have been evaluated on tasks using the <a href=\"https:\/\/foundation.mozilla.org\/en\/blog\/evaluation-harness-is-setting-the-benchmark-for-auditing-large-language-models\/\">lm-evaluation-harness<\/a><\/span><span style=\"font-weight: 400\">&nbsp;and perform comparably to the original LLaMA and GPT-J across most tasks. But because of the tokenizer&rsquo;s configuration, the models aren&rsquo;t great for code generation tasks with empty spaces. <\/span><\/li>\n<li><strong><a href=\"https:\/\/github.com\/huggingface\/blog\/blob\/main\/falcon.md#the-falcon-models\">Falcon-Series<\/a>:<\/strong> Developed by <a href=\"https:\/\/github.com\/tiiuae\">Abu Dhabi&rsquo;s Technology Innovation Institute (TII)<\/a>, Falcon-Series consists of two models: Falcon-40B and Falcon-7B. <span style=\"font-weight: 400\">The series has a unique training data pipeline that extracts content with deduplication and filtering from web data. The models also use multi-query attention, which improves the scalability of inference. <\/span><span style=\"font-weight: 400\">Falcon can generate human-like text, translate languages, and answer questions.<\/span><\/li>\n<li><strong><a href=\"https:\/\/github.com\/mosaicml\/llm-foundry\">MPT-Series<\/a>:<\/strong> A set of decoder-only large language models, MPT-Series models have been trained on one trillion tokens spanning code, natural language text, and scientific text. Developed by <a href=\"https:\/\/github.com\/mosaicml\">MosaicML<\/a>, these models come in two specific versions: MPT-Instruct, designed to be task-oriented, and MPT-Chat, which provides a conversational experience. <span style=\"font-weight: 400\">It&rsquo;s most suitable for virtual assistants, chatbots, and other interactive user engagement tools. <\/span><\/li>\n<li><strong><a href=\"https:\/\/huggingface.co\/lmsys\/fastchat-t5-3b-v1.0\">FastChat-T5<\/a>:<\/strong> A large transformer model with three billion parameters, FastChat-T5 is a chatbot model developed by the <a href=\"https:\/\/github.com\/lm-sys\/FastChat\">FastChat<\/a> team through fine-tuning the Flan-T5-XL model. <span style=\"font-weight: 400\">Trained on 70,000 user-shared conversations, it generates responses to user inputs autoregressively and is primarily for commercial applications. It&rsquo;s a strong fit for applications that need language understanding, like virtual assistants, customer support systems, and interactive platforms.&nbsp;<\/span><\/li>\n<\/ul>\n<h2 id=\"the-future-of-open-source-llms\" id=\"the-future-of-open-source-llms\" ><a class=\"heading-link\" href=\"#the-future-of-open-source-llms\">The future of open source LLMs<span class=\"heading-hash pl-2 text-italic text-bold\" aria-hidden=\"true\"><\/span><\/a><\/h2>\n<p>There&rsquo;s been a scurry of activity in the open source LLM world.<\/p>\n<p>&ldquo;Developers are very active on some of these open source models,&rdquo; Aftandilian says. &ldquo;They can optimize performance, explore new use cases, and push for new algorithms and more efficient data.&rdquo;<\/p>\n<p>And that&rsquo;s just the start.<\/p>\n<p>Meta&rsquo;s LLaMA model is now available for commercial use, allowing businesses to create their own AI solutions.<\/p>\n<p>Goudarzi&rsquo;s team has been thinking about how they can distill open source LLMs and reduce their size. If smaller, the models could be installed on local machines, and you could have your own mini version of GitHub Copilot, for instance. But for now, open source models often need financial support due to their extensive infrastructure and operating costs.<\/p>\n<p>One thing that surprised Goudarzi: originally, the machine learning community thought that more advanced generative AI would require more advanced algorithms. But that hasn&rsquo;t been the case.<\/p>\n<p>&ldquo;The simple algorithm actually stays the same, regardless of how much it can do,&rdquo; he says. &ldquo;Scaling is the only change, which is completely mind-blowing.&rdquo;<\/p>\n<p>Who knows how open source LLMs will revolutionize the developer landscape.<\/p>\n<p>&ldquo;I&rsquo;m excited that we&rsquo;re seeing so many open source LLMs now,&rdquo; Goudarzi says. <span style=\"font-weight: 400\">&ldquo;When developers start building with these models, the possibilities are endless.&rdquo;<\/span><\/p>\n<div class=\"post-content-cta\"><p>Interested in how generative AI can help optimize your productivity? <a href=\"\/\/github.blog\/2023-06-08-developer-experience-what-is-it-and-why-should-you-care\/\">Read our guide on developer experience &gt;<\/a><\/p>\n<\/div>\n<\/body><\/html>\n","protected":false},"excerpt":{"rendered":"<p>Open source generative AI projects are a great way to build new AI-powered features and apps.<\/p>\n","protected":false},"author":2072,"featured_media":74519,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_gh_post_show_toc":"yes","_gh_post_is_no_robots":"no","_gh_post_is_featured":"no","_gh_post_is_excluded":"no","_gh_post_is_unlisted":"no","_gh_post_related_link_1":"","_gh_post_related_link_2":"","_gh_post_related_link_3":"","_gh_post_sq_img":"","_gh_post_sq_img_id":"","_gh_post_cta_title":"","_gh_post_cta_text":"","_gh_post_cta_link":"","_gh_post_cta_button":"Click Here to Learn More","_gh_post_recirc_hide":"no","_gh_post_recirc_col_1":"gh-auto-select","_gh_post_recirc_col_2":"77524","_gh_post_recirc_col_3":"65303","_gh_post_recirc_col_4":"65316","_featured_video":"","_gh_post_additional_query_params":"","_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2},"_wpas_customize_per_network":false,"_links_to":"","_links_to_target":""},"categories":[3293,3296],"tags":[2837,3241,3028,3064],"coauthors":[2922],"class_list":["post-74518","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-and-ml","category-llms","tag-ai","tag-ai-insights","tag-generative-ai","tag-llm"],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v27.3 (Yoast SEO v27.3) - https:\/\/yoast.com\/product\/yoast-seo-premium-wordpress\/ -->\n<title>A developer&#039;s guide to open source LLMs and generative AI - The GitHub Blog<\/title>\n<meta name=\"description\" content=\"Open source generative AI projects are a great way to build AI-powered features and apps.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/github.blog\/ai-and-ml\/llms\/a-developers-guide-to-open-source-llms-and-generative-ai\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"A developer&#039;s guide to open source LLMs and generative AI\" \/>\n<meta property=\"og:description\" content=\"Open source generative AI projects are a great way to build AI-powered features and apps.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/github.blog\/ai-and-ml\/llms\/a-developers-guide-to-open-source-llms-and-generative-ai\/\" \/>\n<meta property=\"og:site_name\" content=\"The GitHub Blog\" \/>\n<meta property=\"article:published_time\" content=\"2023-10-05T16:00:38+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-05-20T14:34:59+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/github.blog\/wp-content\/uploads\/2023\/10\/AI-DarkMode-2.png?fit=1200%2C630\" \/>\n\t<meta property=\"og:image:width\" content=\"1200\" \/>\n\t<meta property=\"og:image:height\" content=\"630\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Gwen Davis\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Gwen Davis\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"7 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/github.blog\\\/ai-and-ml\\\/llms\\\/a-developers-guide-to-open-source-llms-and-generative-ai\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/github.blog\\\/ai-and-ml\\\/llms\\\/a-developers-guide-to-open-source-llms-and-generative-ai\\\/\"},\"author\":{\"name\":\"Gwen Davis\",\"@id\":\"https:\\\/\\\/github.blog\\\/#\\\/schema\\\/person\\\/96c3a97beb33cf89e366d250b3442d7d\"},\"headline\":\"A developer&#8217;s guide to open source LLMs and generative AI\",\"datePublished\":\"2023-10-05T16:00:38+00:00\",\"dateModified\":\"2024-05-20T14:34:59+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/github.blog\\\/ai-and-ml\\\/llms\\\/a-developers-guide-to-open-source-llms-and-generative-ai\\\/\"},\"wordCount\":1373,\"image\":{\"@id\":\"https:\\\/\\\/github.blog\\\/ai-and-ml\\\/llms\\\/a-developers-guide-to-open-source-llms-and-generative-ai\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/github.blog\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/AI-DarkMode-2.png?fit=1200%2C630\",\"keywords\":[\"AI\",\"AI Insights\",\"generative AI\",\"LLM\"],\"articleSection\":[\"AI &amp; ML\",\"LLMs\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/github.blog\\\/ai-and-ml\\\/llms\\\/a-developers-guide-to-open-source-llms-and-generative-ai\\\/\",\"url\":\"https:\\\/\\\/github.blog\\\/ai-and-ml\\\/llms\\\/a-developers-guide-to-open-source-llms-and-generative-ai\\\/\",\"name\":\"A developer's guide to open source LLMs and generative AI - The GitHub Blog\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/github.blog\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/github.blog\\\/ai-and-ml\\\/llms\\\/a-developers-guide-to-open-source-llms-and-generative-ai\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/github.blog\\\/ai-and-ml\\\/llms\\\/a-developers-guide-to-open-source-llms-and-generative-ai\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/github.blog\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/AI-DarkMode-2.png?fit=1200%2C630\",\"datePublished\":\"2023-10-05T16:00:38+00:00\",\"dateModified\":\"2024-05-20T14:34:59+00:00\",\"author\":{\"@id\":\"https:\\\/\\\/github.blog\\\/#\\\/schema\\\/person\\\/96c3a97beb33cf89e366d250b3442d7d\"},\"description\":\"Open source generative AI projects are a great way to build AI-powered features and apps.\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/github.blog\\\/ai-and-ml\\\/llms\\\/a-developers-guide-to-open-source-llms-and-generative-ai\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/github.blog\\\/ai-and-ml\\\/llms\\\/a-developers-guide-to-open-source-llms-and-generative-ai\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/github.blog\\\/ai-and-ml\\\/llms\\\/a-developers-guide-to-open-source-llms-and-generative-ai\\\/#primaryimage\",\"url\":\"https:\\\/\\\/github.blog\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/AI-DarkMode-2.png?fit=1200%2C630\",\"contentUrl\":\"https:\\\/\\\/github.blog\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/AI-DarkMode-2.png?fit=1200%2C630\",\"width\":1200,\"height\":630},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/github.blog\\\/ai-and-ml\\\/llms\\\/a-developers-guide-to-open-source-llms-and-generative-ai\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/github.blog\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"AI &amp; ML\",\"item\":\"https:\\\/\\\/github.blog\\\/ai-and-ml\\\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"LLMs\",\"item\":\"https:\\\/\\\/github.blog\\\/ai-and-ml\\\/llms\\\/\"},{\"@type\":\"ListItem\",\"position\":4,\"name\":\"A developer&#8217;s guide to open source LLMs and generative AI\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/github.blog\\\/#website\",\"url\":\"https:\\\/\\\/github.blog\\\/\",\"name\":\"The GitHub Blog\",\"description\":\"Updates, ideas, and inspiration from GitHub to help developers build and design software.\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/github.blog\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/github.blog\\\/#\\\/schema\\\/person\\\/96c3a97beb33cf89e366d250b3442d7d\",\"name\":\"Gwen Davis\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/50d04ba55630d5126102ddfe3df0462eb60feef832a532f22a31b4494e84df37?s=96&d=mm&r=g54563047c71119b06d00499b2d34167d\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/50d04ba55630d5126102ddfe3df0462eb60feef832a532f22a31b4494e84df37?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/50d04ba55630d5126102ddfe3df0462eb60feef832a532f22a31b4494e84df37?s=96&d=mm&r=g\",\"caption\":\"Gwen Davis\"},\"description\":\"Gwen Davis is a senior content strategist at GitHub, where she writes about developer experience, AI-powered workflows, and career growth in tech.\",\"url\":\"https:\\\/\\\/github.blog\\\/author\\\/purpledragon85\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"A developer's guide to open source LLMs and generative AI - The GitHub Blog","description":"Open source generative AI projects are a great way to build AI-powered features and apps.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/github.blog\/ai-and-ml\/llms\/a-developers-guide-to-open-source-llms-and-generative-ai\/","og_locale":"en_US","og_type":"article","og_title":"A developer's guide to open source LLMs and generative AI","og_description":"Open source generative AI projects are a great way to build AI-powered features and apps.","og_url":"https:\/\/github.blog\/ai-and-ml\/llms\/a-developers-guide-to-open-source-llms-and-generative-ai\/","og_site_name":"The GitHub Blog","article_published_time":"2023-10-05T16:00:38+00:00","article_modified_time":"2024-05-20T14:34:59+00:00","og_image":[{"width":1200,"height":630,"url":"https:\/\/github.blog\/wp-content\/uploads\/2023\/10\/AI-DarkMode-2.png?fit=1200%2C630","type":"image\/png"}],"author":"Gwen Davis","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Gwen Davis","Est. reading time":"7 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/github.blog\/ai-and-ml\/llms\/a-developers-guide-to-open-source-llms-and-generative-ai\/#article","isPartOf":{"@id":"https:\/\/github.blog\/ai-and-ml\/llms\/a-developers-guide-to-open-source-llms-and-generative-ai\/"},"author":{"name":"Gwen Davis","@id":"https:\/\/github.blog\/#\/schema\/person\/96c3a97beb33cf89e366d250b3442d7d"},"headline":"A developer&#8217;s guide to open source LLMs and generative AI","datePublished":"2023-10-05T16:00:38+00:00","dateModified":"2024-05-20T14:34:59+00:00","mainEntityOfPage":{"@id":"https:\/\/github.blog\/ai-and-ml\/llms\/a-developers-guide-to-open-source-llms-and-generative-ai\/"},"wordCount":1373,"image":{"@id":"https:\/\/github.blog\/ai-and-ml\/llms\/a-developers-guide-to-open-source-llms-and-generative-ai\/#primaryimage"},"thumbnailUrl":"https:\/\/github.blog\/wp-content\/uploads\/2023\/10\/AI-DarkMode-2.png?fit=1200%2C630","keywords":["AI","AI Insights","generative AI","LLM"],"articleSection":["AI &amp; ML","LLMs"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/github.blog\/ai-and-ml\/llms\/a-developers-guide-to-open-source-llms-and-generative-ai\/","url":"https:\/\/github.blog\/ai-and-ml\/llms\/a-developers-guide-to-open-source-llms-and-generative-ai\/","name":"A developer's guide to open source LLMs and generative AI - The GitHub Blog","isPartOf":{"@id":"https:\/\/github.blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/github.blog\/ai-and-ml\/llms\/a-developers-guide-to-open-source-llms-and-generative-ai\/#primaryimage"},"image":{"@id":"https:\/\/github.blog\/ai-and-ml\/llms\/a-developers-guide-to-open-source-llms-and-generative-ai\/#primaryimage"},"thumbnailUrl":"https:\/\/github.blog\/wp-content\/uploads\/2023\/10\/AI-DarkMode-2.png?fit=1200%2C630","datePublished":"2023-10-05T16:00:38+00:00","dateModified":"2024-05-20T14:34:59+00:00","author":{"@id":"https:\/\/github.blog\/#\/schema\/person\/96c3a97beb33cf89e366d250b3442d7d"},"description":"Open source generative AI projects are a great way to build AI-powered features and apps.","breadcrumb":{"@id":"https:\/\/github.blog\/ai-and-ml\/llms\/a-developers-guide-to-open-source-llms-and-generative-ai\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/github.blog\/ai-and-ml\/llms\/a-developers-guide-to-open-source-llms-and-generative-ai\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/github.blog\/ai-and-ml\/llms\/a-developers-guide-to-open-source-llms-and-generative-ai\/#primaryimage","url":"https:\/\/github.blog\/wp-content\/uploads\/2023\/10\/AI-DarkMode-2.png?fit=1200%2C630","contentUrl":"https:\/\/github.blog\/wp-content\/uploads\/2023\/10\/AI-DarkMode-2.png?fit=1200%2C630","width":1200,"height":630},{"@type":"BreadcrumbList","@id":"https:\/\/github.blog\/ai-and-ml\/llms\/a-developers-guide-to-open-source-llms-and-generative-ai\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/github.blog\/"},{"@type":"ListItem","position":2,"name":"AI &amp; ML","item":"https:\/\/github.blog\/ai-and-ml\/"},{"@type":"ListItem","position":3,"name":"LLMs","item":"https:\/\/github.blog\/ai-and-ml\/llms\/"},{"@type":"ListItem","position":4,"name":"A developer&#8217;s guide to open source LLMs and generative AI"}]},{"@type":"WebSite","@id":"https:\/\/github.blog\/#website","url":"https:\/\/github.blog\/","name":"The GitHub Blog","description":"Updates, ideas, and inspiration from GitHub to help developers build and design software.","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/github.blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/github.blog\/#\/schema\/person\/96c3a97beb33cf89e366d250b3442d7d","name":"Gwen Davis","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/50d04ba55630d5126102ddfe3df0462eb60feef832a532f22a31b4494e84df37?s=96&d=mm&r=g54563047c71119b06d00499b2d34167d","url":"https:\/\/secure.gravatar.com\/avatar\/50d04ba55630d5126102ddfe3df0462eb60feef832a532f22a31b4494e84df37?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/50d04ba55630d5126102ddfe3df0462eb60feef832a532f22a31b4494e84df37?s=96&d=mm&r=g","caption":"Gwen Davis"},"description":"Gwen Davis is a senior content strategist at GitHub, where she writes about developer experience, AI-powered workflows, and career growth in tech.","url":"https:\/\/github.blog\/author\/purpledragon85\/"}]}},"jetpack_publicize_connections":[],"jetpack_featured_media_url":"https:\/\/github.blog\/wp-content\/uploads\/2023\/10\/AI-DarkMode-2.png?fit=1200%2C630","jetpack_shortlink":"https:\/\/wp.me\/pamS32-jnU","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/github.blog\/wp-json\/wp\/v2\/posts\/74518","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/github.blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/github.blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/github.blog\/wp-json\/wp\/v2\/users\/2072"}],"replies":[{"embeddable":true,"href":"https:\/\/github.blog\/wp-json\/wp\/v2\/comments?post=74518"}],"version-history":[{"count":19,"href":"https:\/\/github.blog\/wp-json\/wp\/v2\/posts\/74518\/revisions"}],"predecessor-version":[{"id":74580,"href":"https:\/\/github.blog\/wp-json\/wp\/v2\/posts\/74518\/revisions\/74580"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/github.blog\/wp-json\/wp\/v2\/media\/74519"}],"wp:attachment":[{"href":"https:\/\/github.blog\/wp-json\/wp\/v2\/media?parent=74518"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/github.blog\/wp-json\/wp\/v2\/categories?post=74518"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/github.blog\/wp-json\/wp\/v2\/tags?post=74518"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/github.blog\/wp-json\/wp\/v2\/coauthors?post=74518"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}