Solution Overview
This solution helps you quickly set up the DeepSeek-R1 distilled models on the Huawei Cloud Flexus X Instance (Elastic Cloud Server (ECS)). DeepSeek-R1 is a high-performance AI inference model focused on mathematical, coding, and natural language inference tasks. By deploying the DeepSeek-R1 distilled models on the cloud server through Ollama, you can quickly create your personal AI assistant, ideal for the following applications:
1. Natural Language Processing (NLP): Understands and generates natural language text, suitable for tasks such as dialogue, translation, and summarization.
2. Text Generation: Produces coherent and logically sound text, applicable to content creation and story writing.
3. Question Answering System: Answers user queries, ideal for customer service and knowledge base searches.
4. Sentiment Analysis: Analyzes the emotional tone of text, useful for market research and public opinion monitoring.
5. Text Classification: Categorizes text, applicable to spam filtering and news categorization.
6. Information Extraction: Extracts key information from text, suitable for data mining and knowledge graph construction.
Solution Architecture
This solution helps you quickly set up the DeepSeek-R1 distilled models on the Huawei Cloud Flexus X instance (Elastic Cloud Server (ECS)).

Building a DeepSeek Inference System
Version: 1.0.0
Last Updated: February 2025
Built By: Huawei Cloud
Time Required for Deployment: About 10 minutes
Time Required for Uninstallation: About 5 minutes
Solution Description
Solution Description
-
This solution will create a FlexusX instance(Elastic Cloud Server (ECS)) to set up the DeepSeek-R1 distilled models.
-
This solution will create one Elastic IP (EIP) for internal and external communication.
-
This solution will create a security group and configure security group rules to protect Huawei Cloud cloud servers.