Fedor Zhdanov

VP, Head of Applied AI | AI & ML Advisor | ex-AWS Principal Scientist

AI Innovations: How DeepSeek Changes the Game in Business

The responsible use of AI in business applications is a nuanced process that requires strategic planning, skilled team composition, and informed decision-making. Developing AI solutions involves multiple stages: from prototyping to creating a Minimum Viable Product (MVP) and scaling into a stable, production-ready system. Each stage demands careful resource allocation and iterative testing to ensure the AI solution aligns with business goals while remaining cost-effective. 

Team composition is equally critical. Successful AI projects typically often require a multidisciplinary team that includes Machine Learning Engineers (MLEs), Data Engineers, ML Researchers, DevOps specialists, AI engineers, and domain experts. MLEs act as versatile contributors who bridge research and production, while researchers focus on model innovation, and DevOps ensures smooth deployment pipelines. Well-planned hiring process—such as starting with generalists like MLEs and scaling with specialists as needed—can optimize both costs and efficiency.

Strategic choices, such as whether to build on top of proprietary models or adopt open-source alternatives, also play a pivotal role. Open-source models like the Llama family, or DeepSeek R1 for reasoning tasks, offer transparency, cost-efficiency, and local hosting options for enhanced security and data privacy. However, closed-source models often provide better alignment with safety standards and some advanced capabilities in narrow areas. Businesses must weigh these trade-offs based on their specific needs. With the release of DeepSeek R1, the industry has shown that high-performance AI can be achieved at much lower costs than previously believed—estimated by some to cost only $6 million for training, compared to the hundreds of millions spent by competitors. This achievement is partly due to a new Multi-Head Attention mechanism and the use of “pure” reinforcement learning (RL) fine-tuning on synthetic data before starting the alignment phase. The use of synthetic data was undoubtedly a key factor in enabling such an efficient training process. While the $6 million figure is highly debated—prior infrastructure and research investments suggest a much higher estimate of around $500 million—the open-sourcing of DeepSeek R1’s model weights and paper provides businesses with a cost-effective option to choose between “build” and “buy,” allowing them to differentiate their products in terms of both capabilities and data privacy.

In conclusion, businesses looking to integrate AI should focus on building agile teams, adopting iterative development practices, and making informed strategic decisions about technology adoption. By doing so, they can harness AI’s potential while navigating its complexities responsibly.