🔥Deep Seek V2 is a 236 billion parameter model with a mixture of experts architecture.
💡It has 160 experts for specific tasks and a 128k context window.
🚀Deep Seek V2 performs on par or better than GPT 4 and Claude in multiple benchmarks.
💰The model is open source and costs about 28 cents per 1 million tokens.
🌍It supports English and Chinese languages.