15 Best Local LLM For Coding [SOTA]

Large language models (LLMs) are a type of artificial intelligence (AI) that are trained on massive datasets of text and code. This allows them to generate text, translate languages, write different kinds of creative content, and answer your questions in an informative way.

Contents

In recent years, there has been a growing interest in developing LLMs for coding applications. This is because LLMs can be used to generate code, debug code, and even write entire programs.

Wavecoder-ultra-6.7b

Out of all the code-crunching champs we tested, WaveCoder-Ultra-6.7B by Microsoft is a real powerhouse. This bad boy uses a fancy way of learning (instruction-following, they call it) to tackle those pesky coding problems. Trained on a bunch of super-useful code snippets, WaveCoder-Ultra-6.7B can handle four major coding tasks like a boss:

Code Generation: Need some fresh code written? Just tell this thing what you want, and it’ll whip it up for you in no time.
Code Summary: Don’t have time to untangle a giant mess of code? WaveCoder-Ultra-6.7B can break it down into a clear and short summary for you.
Code Translation: Talking to a computer in the wrong language? No problem! This LLM can translate your code from one programming language to another.

Code Repair: Got a bug in your code acting like a gremlin? WaveCoder-Ultra-6.7B can find and fix those errors for you, like a code-cleaning superhero.

The results show that WaveCoder-Ultra-6.7B scores a super high 79.9 on this “HumanEval” thing, which basically means it’s really good at understanding code just like a human would. It also does well in different areas, like explaining code (scoring a 45.7) and fixing it (with a score of 52.3).

Sure, it might not be the absolute best at everything (looking at you, GPT-4), but WaveCoder-Ultra-6.7B is a great option because it focuses specifically on code.

Feature	Description
Model Name	WizardCoder-Python-34B-V1.0
Specialization	Python
Training Data	100 billion tokens of Python code
Benchmark Performance	Second place with a score of 73.2, outperforming GPT-4, ChatGPT-3.5, and Claude2
Utility	Efficient and accurate code generation for Python-based projects
Use Cases	Developers and AI practitioners for precise coding solutions

Feature	Description
Model Name	Phind-CodeLlama-34B-v1
Specialization	None specified
Performance	67.6% pass rate at rank 1 on HumanEval, 69.5% for CodeLlama-34B-Python
Training Data	Fine-tuned on proprietary datasets
Reliability	Decontamination methodology for credibility
Training Details	Trained for two epochs on 80,000 programming problems and solutions
Technology Used	DeepSpeed ZeRO 3 and Flash Attention 2 for efficient training

Feature	Description
Model Name	WizardCoder-15B
Specialization	Coding tasks
Training Method	Evol-Instruct method
Foundation	Fine-tuned from StarCoder
Performance	Comparable to ChatGPT, Bard, and Claude for coding tasks
Use Cases	Code generation, understanding programming concepts, debugging

Feature	Description
Model Name	Code Llama (7B, 13B, 34B)
Parameter Configurations	7B, 13B, 34B
Training Data	500 billion tokens of code and code-related data
Features	FIM capabilities for code insertion and completion
Specializations	Code Llama – Python and Code Llama – Instruct
Specialization Details	Code Llama – Python fine-tuned on 100 billion tokens of Python code

Wavecoder-ultra-6.7b

CodeQwen1.5-7B-Chat

Deepseek Coder

WizardCoder-Python-34B-V1.0 (7 to 34B)

Phind-CodeLlama-34B-v1

Moe-2x7b-QA-Code

WizardCoder-15B

Stable Code 3B

Code Llama (7, 13, 34b)

OctoCoder

Redmond-Hermes-Coder 15B

phi-1

DeciCoder 1B

StableCode-Instruct-Alpha-3B

CodeGen2.5-7B

Recent

Hallucination in LLM is Advantage

Best Open Source TTS

8 Best LLM For Low End Smartphone (1 – 4 GB RAM)

6 Best Mamba Based LLM (Open Source)

Where imagination meets innovation