DeepSeek’s chatbot with the R1 model is a stunning release from the Chinese startup. While it’s an innovation in training efficiency, hallucinations still run rampant.
One possible answer being floated in tech circles is distillation, an AI training method that uses bigger "teacher" models to train smaller but faster-operating "student" models.
Investing.com - The emergence of Chinese start-up DeepSeek's cut-price artificial intelligence model suggests the expenses related to training similar models is set to decline "substantially", ...