Smaller, more performant models such as LLaMA enable others in the research community who don’t have access to large amounts of infrastructure to study these models, further democratizing access in this important, fast-changing field. https://ai.facebook.com/blog/large-language-model-llama-meta-ai/
LLaMA-13BはほとんどのベンチマークでGPT-3(175B)を上回り、LLaMA-65Bは最高のモデルであるChinchilla-70BとPaLM-540Bに匹敵する性能を持っている。