Quoted By:
/lmg/ - a general dedicated to the discussion and development of local language models.
Previous threads: >>99710043 & >>99702452
►News
>(03/28) Jamba 52B MoE released with 256k context: https://huggingface.co/ai21labs/Jamba-v0.1
>(03/27) Databricks Releases 132B MoE model: https://huggingface.co/collections/databricks/dbrx-6601c0852a0cdd3c59f71962
>(03/23) Mistral releases 7B v0.2 base model with 32k context: https://models.mistralcdn.com/mistral-7b-v0-2/mistral-7B-v0.2.tar
>(03/23) Grok support merged: https://github.com/ggerganov/llama.cpp/pull/6204
>(03/17) xAI open sources Grok: https://github.com/xai-org/grok
>(03/15) Control vector support in llamacpp: https://github.com/ggerganov/llama.cpp/pull/5970
►FAQ: https://wikia.schneedc.com
►Glossary: https://archive.today/E013q | https://rentry.org/local_llm_glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png
►Getting Started
https://rentry.org/llama-mini-guide
https://rentry.org/8-step-llm-guide
https://rentry.org/llama_v2_sillytavern
https://rentry.org/lmg-spoonfeed-guide
https://rentry.org/rocm-llamacpp
►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers
►Benchmarks
General Purpose:
https://hf.co/spaces/HuggingFaceH4/open_llm_leaderboard
https://hf.co/spaces/lmsys/chatbot-arena-leaderboard
Programming: https://hf.co/spaces/bigcode/bigcode-models-leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench
►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/GGUF-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling/index.xhtml
►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp
Previous threads: >>99710043 & >>99702452
►News
>(03/28) Jamba 52B MoE released with 256k context: https://huggingface.co/ai21labs/Jamba-v0.1
>(03/27) Databricks Releases 132B MoE model: https://huggingface.co/collections/databricks/dbrx-6601c0852a0cdd3c59f71962
>(03/23) Mistral releases 7B v0.2 base model with 32k context: https://models.mistralcdn.com/mistral-7b-v0-2/mistral-7B-v0.2.tar
>(03/23) Grok support merged: https://github.com/ggerganov/llama.cpp/pull/6204
>(03/17) xAI open sources Grok: https://github.com/xai-org/grok
>(03/15) Control vector support in llamacpp: https://github.com/ggerganov/llama.cpp/pull/5970
►FAQ: https://wikia.schneedc.com
►Glossary: https://archive.today/E013q | https://rentry.org/local_llm_glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png
►Getting Started
https://rentry.org/llama-mini-guide
https://rentry.org/8-step-llm-guide
https://rentry.org/llama_v2_sillytavern
https://rentry.org/lmg-spoonfeed-guide
https://rentry.org/rocm-llamacpp
►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers
►Benchmarks
General Purpose:
https://hf.co/spaces/HuggingFaceH4/open_llm_leaderboard
https://hf.co/spaces/lmsys/chatbot-arena-leaderboard
Programming: https://hf.co/spaces/bigcode/bigcode-models-leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench
►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/GGUF-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling/index.xhtml
►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp