Tagged articles

586 articles

Page 4 of 6

Mar 5, 2025 · Industry Insights

DeepSeek R1 & Kimi 1.5: Inside the Development of Near‑Strong Reasoning Models

The article analyzes DeepSeek's recent releases—V3 dialogue model and R1 inference model—detailing their launch dates, rapid popularity surge, R1's reinforcement‑learning‑based design for code and math tasks, and provides links to related Beijing University technical reports while stripping promotional sales content.

AIDeepSeekLarge Language Models

0 likes · 3 min read

DeepSeek R1 & Kimi 1.5: Inside the Development of Near‑Strong Reasoning Models

Java Architect Essentials

Mar 5, 2025 · Artificial Intelligence

How to Seamlessly Integrate DeepSeek AI into IntelliJ IDEA for Java Development

This guide walks Java developers through preparing the environment, installing the CodeGPT plugin, configuring DeepSeek API keys and models, and using the AI assistant for code generation, completion, explanation, and troubleshooting directly within IntelliJ IDEA, while also showing usage statistics.

AI code assistantCodeGPTDeepSeek

0 likes · 8 min read

How to Seamlessly Integrate DeepSeek AI into IntelliJ IDEA for Java Development

Model Perspective

Mar 5, 2025 · Artificial Intelligence

Can AI Really Crack NP‑Hard Problems? Inside the DeepSeek‑R1 Breakthrough

Researchers from Nanjing University of Aeronautics, Nanjing University of Technology and Oxford show that high‑instruction prompts dramatically boost large language models' mathematical reasoning, enabling DeepSeek‑R1 and Qwen2.5 to solve complex polynomial tasks and even produce a new counterexample to Hilbert's 17th problem.

AIDeepSeekMathematical Reasoning

0 likes · 6 min read

Can AI Really Crack NP‑Hard Problems? Inside the DeepSeek‑R1 Breakthrough

Tencent Cloud Developer

Mar 5, 2025 · Artificial Intelligence

DeepSeek Series Overview: Core Technologies, Model Innovations, and Product Highlights

The article delivers a PPT‑style deep dive into the DeepSeek series—from the original LLM through DeepSeek‑MoE, Math, V2, V3 and R1—highlighting core innovations such as Multi‑Head Latent Attention, fine‑grained MoE, GRPO reinforcement learning, Multi‑Token Prediction, DualPipe parallelism and FP8 training that together achieve high performance at a fraction of traditional costs, and notes their integration into Tencent’s OlaChat intelligent assistant.

AIDeepSeekFP8 training

0 likes · 21 min read

DeepSeek Series Overview: Core Technologies, Model Innovations, and Product Highlights

Open Source Linux

Mar 5, 2025 · Artificial Intelligence

How DeepSeek‑R1 Redefines Prompt Engineering and Real‑World AI Deployment

The article analyzes DeepSeek‑R1’s low‑cost inference architecture, Chinese language optimizations, novel prompt‑engineering techniques, and the practical challenges of deploying large domestic models, offering insights into vertical AI applications and the evolving open‑source ecosystem in China.

AI deploymentDeepSeekLarge Language Model

0 likes · 8 min read

How DeepSeek‑R1 Redefines Prompt Engineering and Real‑World AI Deployment

Data Thinking Notes

Mar 4, 2025 · Artificial Intelligence

Unlock AI-Powered Research: The DeepSeek‑R1 & DeepResearch Guide

Compiled by Tsinghua University experts, this guide systematically analyzes the DeepSeek‑R1 inference model and DeepResearch platform, offering multi‑model comparisons, real‑world case studies, and end‑to‑end AI‑driven solutions from data collection to report generation for researchers.

AI researchData AutomationDeepSeek

0 likes · 6 min read

Unlock AI-Powered Research: The DeepSeek‑R1 & DeepResearch Guide

Big Data Tech Team

Mar 4, 2025 · Industry Insights

100 Real-World DeepSeek Scenarios: How AI Is Reshaping Industries

The article analyzes DeepSeek's open‑source model launch, its rapid user growth, and presents a comprehensive list of 100 practical AI use cases across sectors—grouped by frequency and adoption stage—to illustrate the model's market impact and future potential.

AI applicationsArtificial IntelligenceDeepSeek

0 likes · 16 min read

100 Real-World DeepSeek Scenarios: How AI Is Reshaping Industries

Alibaba Cloud Developer

Mar 4, 2025 · Artificial Intelligence

Build a Smart Knowledge Base with DeepSeek R1 and Alibaba Cloud Low‑Code

This tutorial guides you through creating an AI‑powered, customizable knowledge space by integrating DeepSeek R1 via Alibaba Cloud Bailei's Model‑as‑a‑Service with the low‑code Mobinext platform, covering setup, configuration, deployment, and future expansion for multi‑tenant use.

AIAlibaba CloudDeepSeek

0 likes · 12 min read

Build a Smart Knowledge Base with DeepSeek R1 and Alibaba Cloud Low‑Code

JD Tech Talk

Mar 4, 2025 · Artificial Intelligence

Building a Local Personal Knowledge Base with Ollama, DeepSeek‑R1, AnythingLLM and Integrating Continue into VSCode

This guide walks through setting up a local personal knowledge base using Ollama, DeepSeek‑R1, and AnythingLLM, and demonstrates how to integrate the Continue AI code assistant into VSCode, covering installation, configuration, and usage tips for efficient, secure development.

AI integrationAnythingLLMDeepSeek

0 likes · 10 min read

Building a Local Personal Knowledge Base with Ollama, DeepSeek‑R1, AnythingLLM and Integrating Continue into VSCode

JD Cloud Developers

Mar 4, 2025 · Artificial Intelligence

Build a Local AI Knowledge Base with Ollama, DeepSeek‑R1, AnythingLLM & VSCode

This guide walks you through setting up a powerful local AI knowledge base using Ollama, DeepSeek‑R1, and AnythingLLM, and shows how to integrate the Continue extension into VSCode for seamless, secure, and efficient development workflows.

AI knowledge baseAnythingLLMDeepSeek

0 likes · 12 min read

Build a Local AI Knowledge Base with Ollama, DeepSeek‑R1, AnythingLLM & VSCode

Huawei Cloud Developer Alliance

Mar 4, 2025 · Artificial Intelligence

Build a RAG Vector Database with DeepSeek on a Cloud Host – Step‑by‑Step Guide

This tutorial explains how to deploy the DeepSeek‑r1:1.5b model on a cloud server using Ollama, create a retrieval‑augmented generation (RAG) vector database with the mxbai‑embed‑large embedding model, and build an interactive AI application that answers questions from uploaded PDFs.

AIDeepSeekOllama

0 likes · 6 min read

Build a RAG Vector Database with DeepSeek on a Cloud Host – Step‑by‑Step Guide

Java Web Project

Mar 4, 2025 · Artificial Intelligence

How to Seamlessly Integrate DeepSeek AI into IntelliJ IDEA for Java Development

This step‑by‑step guide shows Java developers how to prepare their environment, install the CodeGPT plugin, configure DeepSeek with an API key and model settings, and then use the assistant for code generation, completion, explanation, question answering, and usage monitoring within IntelliJ IDEA.

AI code assistantCodeGPTDeepSeek

0 likes · 8 min read

Alibaba Cloud Big Data AI Platform

Mar 4, 2025 · Artificial Intelligence

Deploy a High‑Performance RAG Service with Hologres, DeepSeek, and PAI‑EAS

This guide walks you through building a Retrieval‑Augmented Generation (RAG) system by integrating Alibaba Cloud's Hologres vector store, the Proxima high‑performance vector engine, and DeepSeek large language models via PAI‑EAS, covering prerequisites, deployment steps, configuration, and inference verification.

AI deploymentDeepSeekHologres

0 likes · 12 min read

Deploy a High‑Performance RAG Service with Hologres, DeepSeek, and PAI‑EAS

Architect

Mar 3, 2025 · Artificial Intelligence

Unlocking Reasoning LLMs: Methods, DeepSeek R1 Insights, and Cost‑Effective Strategies

This article examines how to build and improve reasoning‑capable large language models, explains the definition and use‑cases of reasoning models, details DeepSeek‑R1’s training pipeline, compares four key enhancement methods—including inference‑time scaling, pure RL, SFT + RL, and distillation—and offers budget‑friendly advice.

AI researchDeepSeekInference Scaling

0 likes · 27 min read

Unlocking Reasoning LLMs: Methods, DeepSeek R1 Insights, and Cost‑Effective Strategies

AI Algorithm Path

Mar 3, 2025 · Artificial Intelligence

DeepSeek‑R1 Model Performance: Comparing 32B, 70B, and R1

This article evaluates DeepSeek‑R1’s 32B and 70B distilled models alongside the original R1 on a range of reasoning and coding tasks, detailing hardware setup, test methodology, per‑task results, and a comparative analysis of their strengths and weaknesses.

32B70BDeepSeek

0 likes · 6 min read

DeepSeek‑R1 Model Performance: Comparing 32B, 70B, and R1

DataFunSummit

Mar 3, 2025 · Artificial Intelligence

DeepSeek Open Source Week: Seven Core Technologies Reshaping Large‑Model Training

The DeepSeek open‑source week introduced seven breakthrough technologies—FlashMLA, DeepGEMM, DeepEP, DualPipe, EPLB, 3FS, and Smallpond—that together overhaul data flow, algorithmic complexity, hardware utilization, MoE communication, and resource balancing, dramatically improving large‑model training efficiency and lowering entry barriers for the AI industry.

AI hardwareDeepSeekLarge Models

0 likes · 17 min read

DeepSeek Open Source Week: Seven Core Technologies Reshaping Large‑Model Training

Alibaba Cloud Developer

Mar 3, 2025 · Mobile Development

Build a WeChat Mini‑Program Without Writing Code Using AI

This article demonstrates how a non‑programmer can use the DeepSeek‑powered “AI Programmer” mode in Tongyi Lingma to generate, modify, and deploy a functional WeChat mini‑program entirely through natural language prompts, complete with screenshots of each step.

AI programmingDeepSeekMobile Development

0 likes · 5 min read

Build a WeChat Mini‑Program Without Writing Code Using AI

macrozheng

Mar 3, 2025 · Artificial Intelligence

Integrate DeepSeek with Spring AI: Step‑by‑Step Spring Boot Guide

This tutorial walks you through integrating DeepSeek via Spring AI into a Spring Boot project, covering Spring AI basics, obtaining an API key, adding dependencies and configuration, implementing controller endpoints, testing with Postman, and accessing the full source code.

AI integrationChatbotDeepSeek

0 likes · 7 min read

Integrate DeepSeek with Spring AI: Step‑by‑Step Spring Boot Guide

AI Large Model Application Practice

Mar 3, 2025 · Artificial Intelligence

Can DeepSeek‑R1 Unlock True “Deep Thinking” for Enterprise RAG?

This article examines how swapping in DeepSeek‑R1 enhances Retrieval‑Augmented Generation with deeper reasoning, outlines its benefits and pitfalls—including slower inference, higher compute costs, and hallucinations—provides a simple hallucination test, and proposes an Agentic RAG research assistant to balance accuracy and creativity.

AI reasoningDeepSeekLLM

0 likes · 10 min read

Can DeepSeek‑R1 Unlock True “Deep Thinking” for Enterprise RAG?

Java Architect Essentials

Mar 2, 2025 · Artificial Intelligence

Zero‑Code Local Deployment of DeepSeek LLM on Consumer GPUs Using Ollama

This guide explains why DeepSeek is a compelling GPT‑4‑level alternative, provides hardware recommendations for various model sizes, and walks through a three‑step Windows deployment using Ollama, including installation, environment configuration, model download, performance tuning, and common troubleshooting tips.

AIDeepSeekGPU

0 likes · 8 min read

Zero‑Code Local Deployment of DeepSeek LLM on Consumer GPUs Using Ollama

Architects' Tech Alliance

Mar 1, 2025 · Artificial Intelligence

Decoding DeepSeek: A Four‑Tier Capability Framework for Multimodal AI

The article outlines DeepSeek's four‑level capability hierarchy—basic multimodal data fusion and dynamic governance, intermediate domain modeling with causal reasoning and multi‑objective optimization, advanced complex system modeling with digital twins and multi‑agent coordination, and ultimate autonomous evolution features such as concept‑space exploration and self‑programming.

Artificial IntelligenceDeepSeekDigital Twin

0 likes · 5 min read

Decoding DeepSeek: A Four‑Tier Capability Framework for Multimodal AI

Rare Earth Juejin Tech Community

Mar 1, 2025 · Artificial Intelligence

Predicting Movie Box Office with Playwright Data Scraping and DeepSeek AI

This article demonstrates how to combine Playwright web‑scraping of multiple Chinese movie platforms with the DeepSeek AI model to automatically collect data and generate a scientific prediction of the box‑office revenue for the film "Ne Zha 2".

AI predictionData AnalysisDeepSeek

0 likes · 12 min read

Predicting Movie Box Office with Playwright Data Scraping and DeepSeek AI

ITPUB

Mar 1, 2025 · Artificial Intelligence

Can DeepSeek AI Replace Your DBA? Real-World Database Scenarios Tested

This article examines DeepSeek, a Chinese AGI‑focused AI model, explains prompt‑engineering techniques, and evaluates its performance across database architecture, development, and operations tasks through concrete Q&A examples, SQL plan analysis, and shell‑script generation, while also discussing its broader impact on professionals, vendors and enterprises.

AIDatabaseDeepSeek

0 likes · 10 min read

Can DeepSeek AI Replace Your DBA? Real-World Database Scenarios Tested

Architects' Tech Alliance

Feb 28, 2025 · Artificial Intelligence

DeepSeek V3 & R1: How Their Training Costs Compare to Llama 3.1

The article analyzes DeepSeek’s latest V3 conversational model and R1 inference model, detailing their MoE architecture, training on H800 GPUs costing about $558 k, comparing compute expenses to Meta’s Llama 3.1, and showing that their API pricing is roughly one‑tenth of GPT‑4o for dialogue and one‑twentieth of OpenAI o1 for inference.

AI model analysisDeepSeekLarge Language Model

0 likes · 4 min read

DeepSeek V3 & R1: How Their Training Costs Compare to Llama 3.1

Alibaba Cloud Developer

Feb 28, 2025 · Artificial Intelligence

How to Deploy a Full‑Power DeepSeek R1 Model on Alibaba Cloud Without Rate Limits

This guide walks you through deploying a private DeepSeek R1 inference service on Alibaba Cloud using CAP and AgentCraft, covering architecture, one‑click deployment, database and vector model configuration, UI customization, cleanup tips, and FAQs for seamless, unlimited AI inference.

Alibaba CloudDeepSeek

0 likes · 8 min read

How to Deploy a Full‑Power DeepSeek R1 Model on Alibaba Cloud Without Rate Limits

Alibaba Cloud Developer

Feb 28, 2025 · Artificial Intelligence

How DeepSeek’s RL‑Powered Time Scaling Is Redefining AI Model Training

DeepSeek’s rapid rise is examined through its RL‑based Time Scaling paradigm, cost‑effective architecture, innovative training pipeline, open‑source strategy, and security challenges, highlighting how these breakthroughs disrupt traditional AI model development, lower resource demands, and influence industry dynamics.

AI model trainingDeepSeekcost‑efficient AI

0 likes · 13 min read

How DeepSeek’s RL‑Powered Time Scaling Is Redefining AI Model Training

Fun with Large Models

Feb 28, 2025 · Frontend Development

Build a Personal Website in 5 Minutes with Free VS Code + DeepSeek AI Coding

This step‑by‑step guide shows how to set up VS Code with the Cline and Continue extensions, configure the free DeepSeek API, generate website code using AI, and deploy the site via GitHub Pages, enabling anyone to create a personal website without paying for AI tools.

AI codingClineDeepSeek

0 likes · 11 min read

Build a Personal Website in 5 Minutes with Free VS Code + DeepSeek AI Coding

Baidu Intelligent Cloud Tech Hub

Feb 27, 2025 · Artificial Intelligence

Deploy and Extend Baidu DeepSeek Enterprise Suite in Minutes

This guide walks you through quickly deploying Baidu's DeepSeek‑R1 model, accessing its WebUI, and enabling key enterprise extensions such as web search, file upload, OCR, and content moderation to integrate AI capabilities into production workflows.

AI extensionsAI model deploymentDeepSeek

0 likes · 6 min read

Deploy and Extend Baidu DeepSeek Enterprise Suite in Minutes

Alibaba Cloud Native

Feb 27, 2025 · Cloud Native

Build AI-Powered Code Review with Alibaba Cloud Flow and DeepSeek

This guide walks you through creating a custom Cloud Native Flow step that calls DeepSeek to automatically review code in Alibaba Cloud Codeup, covering token creation, API key setup, step publishing, pipeline configuration, and viewing AI‑generated review comments.

Alibaba CloudDeepSeekDevOps

0 likes · 7 min read

Build AI-Powered Code Review with Alibaba Cloud Flow and DeepSeek

IT Services Circle

Feb 27, 2025 · Artificial Intelligence

DeepSeek Announces FlashMLA: An Efficient Multi‑Layer Attention Decoding Kernel for Hopper GPUs

DeepSeek’s OpenSourceWeek introduced FlashMLA, a GPU‑optimized MLA decoding kernel for Hopper GPUs that leverages FlashAttention and CUTLASS to dramatically improve large‑model inference performance, with early adoption showing up to 30% higher compute utilization and doubled speed in some scenarios.

Artificial IntelligenceDeepSeekFlashMLA

0 likes · 3 min read

DeepSeek Announces FlashMLA: An Efficient Multi‑Layer Attention Decoding Kernel for Hopper GPUs

JavaEdge

Feb 27, 2025 · Artificial Intelligence

How to Quickly Build a DeepSeek‑Powered Knowledge Base on Tencent Cloud

This guide walks through deploying the full‑feature DeepSeek V3+R1 model on Tencent Cloud, configuring a smart knowledge‑base application, importing documentation, enabling internet search, tuning retrieval parameters, and publishing the app for public use, all without writing code.

AIDeepSeekKnowledge Base

0 likes · 6 min read

How to Quickly Build a DeepSeek‑Powered Knowledge Base on Tencent Cloud

Model Perspective

Feb 27, 2025 · Artificial Intelligence

Why AI Model Cost Cuts Trigger a New Wave of Nvidia Demand

The article explains how DeepSeek’s low‑cost large‑language‑model training reduces GPU price pressure, yet paradoxically fuels greater demand for Nvidia hardware by lowering entry barriers, illustrating the modern Jevons paradox and its broader economic and societal implications.

AI hardwareDeepSeekGPU demand

0 likes · 8 min read

Why AI Model Cost Cuts Trigger a New Wave of Nvidia Demand

NewBeeNLP

Feb 27, 2025 · Industry Insights

How DeepSeek’s Open‑Source Tools Exploit China‑Specific H800 GPUs to Boost AI Performance

The article analyzes DeepSeek’s three open‑source projects—FlashMLA, DeepEP, and DeepGEMM—showing how they optimize for the China‑only NVIDIA H800 GPU, contrast this with the abundant hardware resources of Western AI firms, and highlight the growing demand for talent that masters both AI models and GPU hardware.

AI hardwareDeepEPDeepGEMM

0 likes · 7 min read

How DeepSeek’s Open‑Source Tools Exploit China‑Specific H800 GPUs to Boost AI Performance

Tencent Cloud Developer

Feb 27, 2025 · Artificial Intelligence

DeepSeek LLM Series (V1‑V3, R1) Technical Overview and Analysis

The DeepSeek technical overview details the evolution from the dense 67 B V1 model through the 236 B MoE‑based V2 and 671 B V3 with FP8 training, to the RL‑only R1 series that learns reasoning without supervision, highlighting innovations such as Grouped‑Query Attention, Multi‑Head Latent Attention, load‑balancing‑free MoE, Multi‑Token Prediction, and knowledge distillation, and reporting state‑of‑the‑art benchmark results and open‑source reproduction projects.

AI researchDeepSeekMixture of Experts

0 likes · 37 min read

DeepSeek LLM Series (V1‑V3, R1) Technical Overview and Analysis

Architects' Tech Alliance

Feb 27, 2025 · Artificial Intelligence

How Inspur Metabrain R1 Server Enables 1000+ Concurrent Users for DeepSeek 671B via SGLang Optimization

The Inspur Metabrain R1 inference server, equipped with FP8 acceleration and a 1128 GB HBM3e memory pool, has been tightly integrated with SGLang 0.4.3 to run the 671‑billion‑parameter DeepSeek R1 model, delivering over 1,000 concurrent user sessions and up to 3,976 tokens/s throughput.

AI serverDeepSeekInference Optimization

0 likes · 5 min read

How Inspur Metabrain R1 Server Enables 1000+ Concurrent Users for DeepSeek 671B via SGLang Optimization

Architecture & Thinking

Feb 27, 2025 · Artificial Intelligence

How to Seamlessly Integrate DeepSeek AI into VSCode with the Cline Plugin

This step‑by‑step guide shows you how to obtain a DeepSeek API key, install VSCode and the open‑source Cline extension, configure the plugin to connect to DeepSeek, and use AI‑powered code assistance while covering common pitfalls and alternative models.

AI integrationAPI keyCline

0 likes · 6 min read

How to Seamlessly Integrate DeepSeek AI into VSCode with the Cline Plugin

Alibaba Cloud Big Data AI Platform

Feb 27, 2025 · Artificial Intelligence

Fine‑Tune DeepSeek‑R1 with MaxCompute & DataWorks on Alibaba Cloud

This step‑by‑step guide explains how to use Alibaba Cloud's MaxCompute, DataWorks, and the AI platform PAI to build a custom dataset, fine‑tune the DeepSeek‑R1 distilled model, and deploy the resulting model for practical applications.

Alibaba CloudArtificial IntelligenceDataWorks

0 likes · 5 min read

Fine‑Tune DeepSeek‑R1 with MaxCompute & DataWorks on Alibaba Cloud

IT Architects Alliance

Feb 26, 2025 · Artificial Intelligence

DeepSeek Large Model: Core Architecture, Key Technologies, and Training Strategies

The article provides an in‑depth overview of DeepSeek’s large language model, detailing its mixture‑of‑experts and Transformer foundations, novel attention mechanisms, load‑balancing, multi‑token prediction, FP8 mixed‑precision training, and various training regimes such as knowledge distillation and reinforcement learning.

DeepSeekFP8Knowledge Distillation

0 likes · 18 min read

DeepSeek Large Model: Core Architecture, Key Technologies, and Training Strategies

Tencent Technical Engineering

Feb 26, 2025 · Artificial Intelligence

Engineers' Perspectives on DeepSeek: Technical Innovations and Implications

Thirteen engineers praise DeepSeek’s open‑source, reinforcement‑learning‑driven architecture—using FP8 storage and SFT‑free training—to deliver GPT‑4‑level reasoning at one‑twentieth the cost, enabling single‑GPU deployment, lowering barriers for academia and startups, and prompting notable market reactions that could democratize advanced AI.

AI cost reductionDeepSeekFP8

0 likes · 9 min read

Engineers' Perspectives on DeepSeek: Technical Innovations and Implications

Selected Java Interview Questions

Feb 26, 2025 · Artificial Intelligence

Integrating DeepSeek AI Assistant into IntelliJ IDEA for Java Development

This guide walks Java developers through preparing the environment, installing the CodeGPT plugin, configuring DeepSeek API keys and models, and using the AI assistant for code generation, completion, explanation, and troubleshooting within IntelliJ IDEA, complete with screenshots and sample code.

AI code assistantCodeGPTDeepSeek

0 likes · 8 min read

Integrating DeepSeek AI Assistant into IntelliJ IDEA for Java Development

58UXD

Feb 26, 2025 · Artificial Intelligence

How AI Tools Like Deepseek Transform Design Workflow

This article shows designers how to combine AI services such as Deepseek, JiMeng, Tripo, Tongyi and Jianying to accelerate 3D modeling, PPT creation and short‑video production, turning lengthy manual tasks into fast, creative processes.

3D modelingAIDeepSeek

0 likes · 5 min read

How AI Tools Like Deepseek Transform Design Workflow

Architecture Digest

Feb 26, 2025 · Artificial Intelligence

DeepSeek4j 1.4: A Java Integration Framework for DeepSeek AI Models

DeepSeek4j 1.4 introduces a Java‑native framework that fully preserves DeepSeek's chain‑of‑thought and billing features, adds reactive streaming support, and provides a Spring Boot starter for effortless integration, accompanied by quick‑start code, configuration examples, and a built‑in debugging UI.

AIAPIDeepSeek

0 likes · 5 min read

DeepSeek4j 1.4: A Java Integration Framework for DeepSeek AI Models

Java Architecture Diary

Feb 26, 2025 · Databases

Build a Private LLM Knowledge Base with Redis and DeepSeek4J in 10 Minutes

This tutorial shows how to harness Redis's dual role as a high‑performance cache and a vector database, guiding you through Docker setup, vector storage methods, and Java Lettuce integration to build a private large‑language‑model knowledge base with DeepSeek4J.

AIDeepSeekLettuce

0 likes · 6 min read

Build a Private LLM Knowledge Base with Redis and DeepSeek4J in 10 Minutes

Architecture & Thinking

Feb 26, 2025 · Artificial Intelligence

Unlocking DeepSeek: A Comprehensive Guide to China’s Cutting-Edge AI Chat Model

This article provides an in‑depth overview of DeepSeek, covering its core multimodal and multilingual features, long‑context capabilities, domain optimizations, security, main functions, diverse application scenarios, and practical usage via web interface or API integration.

AI chatbotArtificial IntelligenceDeepSeek

0 likes · 6 min read

Unlocking DeepSeek: A Comprehensive Guide to China’s Cutting-Edge AI Chat Model

Architect

Feb 25, 2025 · Artificial Intelligence

DeepSeek R1: Multi‑Stage Reinforcement Learning, Reward Modeling, and Distillation for a High‑Performance LLM

DeepSeek R1 builds on the DeepSeek V3 base model using a multi‑stage reinforcement learning pipeline—including GRPO optimization, rule‑based reward modeling, supervised fine‑tuning, language‑consistency rewards, rejection sampling, and distillation—to produce a high‑performing, aligned LLM capable of accurate reasoning.

DeepSeekLLM trainingReward Modeling

0 likes · 24 min read

DeepSeek R1: Multi‑Stage Reinforcement Learning, Reward Modeling, and Distillation for a High‑Performance LLM

Architects' Tech Alliance

Feb 25, 2025 · Artificial Intelligence

What Makes DeepSeek‑R1 a Game‑Changer in AIGC? Insights from Peking University

This article summarizes a Peking University lecture on DeepSeek‑R1, detailing its core concepts, advantages, and historical significance, then explains the underlying mechanisms of large‑model AI and AIGC tools, and finally offers practical guidance for selecting and efficiently applying AI solutions.

AI model analysisAIGCDeepSeek

0 likes · 5 min read

What Makes DeepSeek‑R1 a Game‑Changer in AIGC? Insights from Peking University

Alibaba Cloud Big Data AI Platform

Feb 25, 2025 · Artificial Intelligence

Accelerate DeepSeek‑V2‑Lite Deployment with FlashMLA: A Step‑by‑Step Guide

This tutorial walks users through installing FlashMLA, integrating it with the vLLM framework, downloading the DeepSeek‑V2‑Lite‑Chat model, benchmarking various MLA implementations, and running a local inference demo that shows FlashMLA’s speed advantage on long‑sequence generation.

DeepSeekFlashMLAInferenceOptimization

0 likes · 16 min read

Accelerate DeepSeek‑V2‑Lite Deployment with FlashMLA: A Step‑by‑Step Guide

Alibaba Cloud Big Data AI Platform

Feb 25, 2025 · Artificial Intelligence

Build a RAG‑Powered Smart Q&A Assistant with Milvus, DeepSeek, and PAI LangStudio

This step‑by‑step guide shows how to assemble a Retrieval‑Augmented Generation (RAG) system using Alibaba Cloud Milvus vector search, the DeepSeek large language model, and PAI LangStudio, covering instance creation, data upload, model deployment, connection setup, flow design, and service invocation.

AI TutorialDeepSeekLLM

0 likes · 9 min read

Build a RAG‑Powered Smart Q&A Assistant with Milvus, DeepSeek, and PAI LangStudio

Architecture Digest

Feb 25, 2025 · Artificial Intelligence

DeepSeek Distillation Technology: Overview, Innovations, Architecture, Training, Performance, and Challenges

DeepSeek’s distillation technology combines data and model distillation to transfer knowledge from large teacher models to compact student models, detailing its definitions, principles, key innovations, architecture, training methods, performance gains, and challenges, especially in multimodal contexts.

AI researchDeepSeekKnowledge Distillation

0 likes · 16 min read

DeepSeek Distillation Technology: Overview, Innovations, Architecture, Training, Performance, and Challenges

Full-Stack Internet Architecture

Feb 25, 2025 · Artificial Intelligence

Getting Started with Spring AI: Building a Hello‑World Application Using DeepSeek

This tutorial explains what Spring AI is, walks through creating a Spring Boot project with Maven, adding the necessary dependencies, writing a simple controller that forwards user messages to a local DeepSeek model, configuring the application, and testing the AI‑powered endpoint.

AI integrationChatbotDeepSeek

0 likes · 10 min read

Getting Started with Spring AI: Building a Hello‑World Application Using DeepSeek

Java Web Project

Feb 25, 2025 · Artificial Intelligence

How DeepSeek4j 1.4 Solves Spring AI’s Chain‑of‑Thought and Streaming Gaps

The article explains why existing Java AI frameworks struggle with DeepSeek R1’s chain‑of‑thought and streaming features, introduces DeepSeek4j 1.4 as a targeted solution, details its core capabilities, and provides a step‑by‑step guide to integrate it with Spring Boot and Project Reactor.

AI integrationDeepSeekJava

0 likes · 5 min read

How DeepSeek4j 1.4 Solves Spring AI’s Chain‑of‑Thought and Streaming Gaps

JD Cloud Developers

Feb 25, 2025 · Artificial Intelligence

How to Access Free DeepSeek AI Models on China’s Supercomputing Center

This guide explains how to obtain free API keys for DeepSeek‑R1:7B, 14B, and 32B models from the National Supercomputing Center, walks through the purchase steps, and provides a Python example for calling the models via the provided endpoint.

AIAPIDeepSeek

0 likes · 3 min read

How to Access Free DeepSeek AI Models on China’s Supercomputing Center

Baobao Algorithm Notes

Feb 25, 2025 · Artificial Intelligence

FlashMLA vs FlashInfer: DeepSeek Inference Performance Benchmarks Revealed

The author benchmarks DeepSeek's FlashMLA against FlashInfer and several Triton-based implementations, detailing setup challenges, decode‑only bandwidth results, and observations that the official DeepSeek version leads while Triton optimizations show mixed performance across different head sizes.

AIDeepSeekFlashMLA

0 likes · 6 min read

FlashMLA vs FlashInfer: DeepSeek Inference Performance Benchmarks Revealed

Efficient Ops

Feb 25, 2025 · Artificial Intelligence

How to Deploy DeepSeek R1 Locally: A Step‑by‑Step Guide for AI Enthusiasts

This guide explains what DeepSeek R1 is, compares its full and distilled versions, details hardware requirements for Linux, Windows, and macOS, and provides step‑by‑step instructions for local deployment using Ollama, LM Studio, Docker, and visual interfaces like Open‑WebUI and Dify.

AI modelDeepSeekDify

0 likes · 9 min read

How to Deploy DeepSeek R1 Locally: A Step‑by‑Step Guide for AI Enthusiasts

Tencent Cloud Developer

Feb 25, 2025 · Artificial Intelligence

Deploy DeepSeek AI: Cloud, Local, API – Full Step‑by‑Step Guide

This guide walks developers through the full lifecycle of using DeepSeek—choosing the right deployment method (API, local machine, or private cloud), selecting model sizes based on hardware, configuring Tencent Cloud services, building AI applications, and integrating the model into development tools and mini‑programs.

AI application developmentAI model deploymentAPI integration

0 likes · 12 min read

Deploy DeepSeek AI: Cloud, Local, API – Full Step‑by‑Step Guide

CSS Magic

Feb 25, 2025 · Artificial Intelligence

Two Simple Ways to Access DeepSeek API for Free

This guide shows how to obtain free DeepSeek API access through GitHub Models and SiliconFlow, detailing the required API base URL, key, and model name, how to register, create keys, verify usage with a web chat tool, and compare model choices and platform limits.

APIDeepSeekFree access

0 likes · 7 min read

Two Simple Ways to Access DeepSeek API for Free

DevOps

Feb 24, 2025 · Artificial Intelligence

AI‑Powered Full‑Stack Development with DeepSeek and ClinePRO: A 12× Efficiency Case Study

During the Chinese New Year break the author used DeepSeek and AISE ClinePRO to build a complete full‑stack product in only 20 hours, demonstrating a twelve‑fold productivity boost over traditional development while showcasing AI‑driven code generation, multilingual support, automated documentation, and DevOps integration.

AI codingClinePRODeepSeek

0 likes · 17 min read

AI‑Powered Full‑Stack Development with DeepSeek and ClinePRO: A 12× Efficiency Case Study

AI Algorithm Path

Feb 24, 2025 · Artificial Intelligence

Flash-MLA: Boosting LLM Inference Speed on Nvidia Hopper GPUs

Flash-MLA is an open‑source GPU kernel optimized for Nvidia Hopper GPUs that compresses the KV cache of multi‑head attention, cutting memory usage by up to 93.3% and delivering 580 TFLOPS compute, thereby dramatically accelerating large‑language‑model inference while lowering cost.

DeepSeekFlash-MLAGPU optimization

0 likes · 8 min read

Flash-MLA: Boosting LLM Inference Speed on Nvidia Hopper GPUs

21CTO

Feb 24, 2025 · Artificial Intelligence

From Transformers to DeepSeek-R1: Evolution of Large Language Models

Since the 2017 introduction of the Transformer architecture, this article chronicles the rapid development of large language models—including BERT, GPT series, multimodal systems, and the cost‑effective DeepSeek‑R1—highlighting key innovations, scaling trends, alignment techniques, and their transformative impact across AI research and industry.

AI evolutionDeepSeekLLM History

0 likes · 23 min read

From Transformers to DeepSeek-R1: Evolution of Large Language Models

Alibaba Cloud Native

Feb 24, 2025 · Cloud Native

Build a Real‑Time AI Search‑Enabled Q&A System with Higress and DeepSeek

This guide shows how open‑source LLMs like DeepSeek can power cost‑effective intelligent Q&A services, and how the cloud‑native Higress API gateway adds real‑time web search, routing, security, and observability to create a production‑grade solution in just a few steps.

DeepSeekHigressLLM

0 likes · 6 min read

Build a Real‑Time AI Search‑Enabled Q&A System with Higress and DeepSeek

Baobao Algorithm Notes

Feb 24, 2025 · Artificial Intelligence

How to Build a Breakfast Shop AI Agent with Baidu Wenxin and DeepSeek R1

This article provides a step‑by‑step guide to creating a breakfast‑shop reception AI agent on Baidu's Wenxin Intelligent Agent platform, highlighting its core features, model selection with DeepSeek R1, and practical tips for configuring personas, knowledge bases, and plugins.

AI AgentBaidu WenxinDeepSeek

0 likes · 7 min read

How to Build a Breakfast Shop AI Agent with Baidu Wenxin and DeepSeek R1

Full-Stack Internet Architecture

Feb 24, 2025 · Artificial Intelligence

Deploying the DeepSeek Large Language Model Locally with Ollama on Windows

This guide explains how to install Ollama on a Windows machine, configure its environment, and use it to download and run the DeepSeek‑R1 1.5B large language model locally, enabling offline AI interactions without relying on remote servers.

AI model deploymentDeepSeekOllama

0 likes · 4 min read

Deploying the DeepSeek Large Language Model Locally with Ollama on Windows

Selected Java Interview Questions

Feb 24, 2025 · Artificial Intelligence

Deploying Ollama on Windows and Linux and Integrating with SpringBoot

This guide explains how to download, install, and configure Ollama on Windows and Linux, set up environment variables, select a DeepSeek model, and call the Ollama API from a SpringBoot application with example code snippets.

APIDeepSeekDeployment

0 likes · 6 min read

Deploying Ollama on Windows and Linux and Integrating with SpringBoot

Architects' Tech Alliance

Feb 24, 2025 · Artificial Intelligence

NSA: Hardware‑Optimized Sparse Attention Mechanism from DeepSeek, Peking University and University of Washington

The NSA mechanism introduces a three‑branch hardware‑optimized sparse attention architecture—token compression, token selection, and sliding window—combined with learnable gating to balance global and local context, dramatically improving inference speed and efficiency for long‑context large language models.

AI ArchitectureDeepSeekLarge Language Models

0 likes · 5 min read

NSA: Hardware‑Optimized Sparse Attention Mechanism from DeepSeek, Peking University and University of Washington

Huawei Cloud Developer Alliance

Feb 24, 2025 · Artificial Intelligence

Generate Game Code Instantly with DeepSeek V3 on Huawei Cloud

This tutorial walks you through configuring a Huawei Cloud host, installing the AutoGen framework, setting up DeepSeek V3 model API keys, and using the model to automatically generate Python code for a graphical two‑player battle game, complete with step‑by‑step instructions and sample commands.

AI Code GenerationAutoGenDeepSeek

0 likes · 9 min read

Generate Game Code Instantly with DeepSeek V3 on Huawei Cloud

AI Large Model Application Practice

Feb 24, 2025 · Artificial Intelligence

How Web Agents Combine LLMs and Browser Automation to Perform Real‑World Tasks

This article explains what Web Agents are, their ReAct‑style reasoning loop, key implementation technologies such as observation parsing, multimodal models, and browser control tools like Selenium and Playwright, and demonstrates building a DeepSeek‑powered Web Agent with the Browser‑use framework, including code samples and performance insights.

DeepSeekLLMPlaywright

0 likes · 11 min read

How Web Agents Combine LLMs and Browser Automation to Perform Real‑World Tasks

Alibaba Cloud Developer

Feb 24, 2025 · Artificial Intelligence

How to Build a Local Chatbot with Web Search Using DeepSeek, Ollama, and Dify

Learn how to create a locally hosted chatbot powered by DeepSeek R1 32b, using Ollama and Docker, integrate Dify for model management, and add web‑search capability through SEARXNG, covering environment setup, search logic, content extraction, testing, and optimization tips.

ChatbotDeepSeekDify

0 likes · 10 min read

How to Build a Local Chatbot with Web Search Using DeepSeek, Ollama, and Dify

Alibaba Cloud Big Data AI Platform

Feb 24, 2025 · Artificial Intelligence

How to Distill and Fine‑Tune DeepSeek R1 with Qwen on Alibaba Cloud PAI

This guide walks you through the complete workflow of preparing instruction data, deploying the DeepSeek‑R1 teacher model, using Alibaba Cloud PAI to generate teacher responses, distilling a smaller Qwen2.5‑7B‑Instruct student model, fine‑tuning it, and deploying the final service, with performance comparisons on several math‑reasoning benchmarks.

Alibaba Cloud PAIDeepSeek

0 likes · 17 min read

How to Distill and Fine‑Tune DeepSeek R1 with Qwen on Alibaba Cloud PAI

Java Web Project

Feb 23, 2025 · Artificial Intelligence

Build Your First AI Chatbot with Spring Boot and DeepSeek LLM

This guide walks you through creating a Spring Boot project, configuring DeepSeek's large language model via SiliconFlow, setting up OpenAI‑compatible parameters, and implementing a REST controller that returns weather forecasts using the model, complete with step‑by‑step code snippets, configuration files, and deployment instructions.

AIChatbotDeepSeek

0 likes · 7 min read

Build Your First AI Chatbot with Spring Boot and DeepSeek LLM

Open Source Linux

Feb 23, 2025 · Artificial Intelligence

How Chinese Universities Are Rapidly Deploying DeepSeek AI Models on Campus

After a winter break surge, DeepSeek AI models have been swiftly adopted across Chinese universities, enabling local deployments for teaching, research, and campus services, while facing bans and security concerns abroad, highlighting both rapid domestic integration and international challenges.

AI modelsArtificial IntelligenceChina

0 likes · 13 min read

How Chinese Universities Are Rapidly Deploying DeepSeek AI Models on Campus

Su San Talks Tech

Feb 23, 2025 · Artificial Intelligence

How DeepSeek’s Distillation Breaks AI Model Limits: Core Principles & Performance

This article explores DeepSeek’s cutting‑edge distillation technology, detailing its definition, underlying principles, innovative data‑model fusion, architecture choices, training strategies, performance gains over large language models, and the remaining challenges in knowledge transfer and multimodal data processing.

DeepSeekKnowledge DistillationLarge Language Models

0 likes · 16 min read

How DeepSeek’s Distillation Breaks AI Model Limits: Core Principles & Performance

Model Perspective

Feb 22, 2025 · Artificial Intelligence

Why DeepSeek Is Gaining Traction Beyond ChatGPT: Insights from the Global Developers Conference

The article examines DeepSeek’s surge in popularity by analyzing its timely release, cost‑effective performance, open‑source approach, and broader AI ecosystem trends, while also sharing expert predictions and practical coding tool recommendations for developers.

AI predictionsAI trendsDeepSeek

0 likes · 5 min read

Why DeepSeek Is Gaining Traction Beyond ChatGPT: Insights from the Global Developers Conference

macrozheng

Feb 22, 2025 · Artificial Intelligence

Choosing the Right DeepSeek‑R1 Model: Hardware Needs & Use Cases Explained

This guide compares DeepSeek‑R1’s 1.5B/7B/8B, 14B/32B, and 70B/671B versions, detailing their characteristics, typical applications, and the specific CPU, memory, and GPU specifications required for local deployment, helping you select the optimal model for your resources.

AI model deploymentDeepSeekHardware Requirements

0 likes · 7 min read

Choosing the Right DeepSeek‑R1 Model: Hardware Needs & Use Cases Explained

Rare Earth Juejin Tech Community

Feb 22, 2025 · Artificial Intelligence

Deploying DeepSeek Locally with Ollama, Building Personal and Organizational Knowledge Bases, and Integrating with Spring AI

This guide explains how to locally deploy the DeepSeek large‑language model using Ollama on Windows, macOS, and Linux, configure model storage and CORS, build personal and enterprise RAG knowledge bases with AnythingLLM and Open WebUI, and integrate the model into a Spring AI application via Docker and Docker‑Compose.

ContainerizationDeepSeekKnowledge Base

0 likes · 16 min read

Deploying DeepSeek Locally with Ollama, Building Personal and Organizational Knowledge Bases, and Integrating with Spring AI

Architect

Feb 21, 2025 · Artificial Intelligence

DeepSeek Model Innovations: Architecture, Training Methods, and Performance Evaluation

This article reviews DeepSeek's recent breakthroughs, including the MLA attention redesign, GRPO alignment algorithm, MoE enhancements, multi‑stage training pipelines (SFT, RL, preference tuning, distillation), and comparative performance against GPT‑4o‑Mini and Llama 3.1, highlighting both strengths and remaining challenges.

DeepSeekMixture of Expertsarchitecture

0 likes · 16 min read

DeepSeek Model Innovations: Architecture, Training Methods, and Performance Evaluation

Alibaba Cloud Developer

Feb 21, 2025 · Artificial Intelligence

Build a Plain‑Explanation AI Agent with DeepSeek‑R1: Prompt Templates & SVG Tips

This article introduces the “Plain Explanation Expert” AI agent built on DeepSeek‑R1, explains its prompt framework—including role, skills, and output format—demonstrates usage through direct prompt copying and smart‑agent configuration in tools like Cherry Studio, and provides concrete examples, memory tricks, and SVG visualizations to help users quickly master complex concepts.

AI promptingDeepSeekSVG visualization

0 likes · 15 min read

Build a Plain‑Explanation AI Agent with DeepSeek‑R1: Prompt Templates & SVG Tips

Alibaba Cloud Infrastructure

Feb 21, 2025 · Artificial Intelligence

Deploying DeepSeek R1 Model Inference on ACK Edge with Virtual Nodes and Serverless GPU

This article explains how to use Alibaba Cloud ACK Edge to manage on‑premise GPU resources and seamlessly fall back to cloud‑based ACS Serverless GPU via virtual nodes for deploying DeepSeek R1 inference, covering environment preparation, model download, storage setup, custom scheduling, and scaling strategies.

ACK@EdgeDeepSeekGPU

0 likes · 16 min read

Deploying DeepSeek R1 Model Inference on ACK Edge with Virtual Nodes and Serverless GPU

ByteDance Cloud Native

Feb 21, 2025 · Artificial Intelligence

Deploy DeepSeek‑R1‑Distill on Volcengine CPU Cloud for Low‑Cost AI Inference

This guide walks you through deploying the DeepSeek‑R1‑Distill model on Volcengine CPU ECS instances, covering use‑case scenarios, recommended server types, Docker setup, environment configuration, and verification steps to achieve cost‑effective, high‑compatibility AI inference.

AI model deploymentCPU inferenceDeepSeek

0 likes · 6 min read

Deploy DeepSeek‑R1‑Distill on Volcengine CPU Cloud for Low‑Cost AI Inference

Top Architect

Feb 21, 2025 · Artificial Intelligence

DeepSeek4j 1.4: Java Integration Framework for DeepSeek with Full Chain‑of‑Thought and Streaming Support

The article introduces DeepSeek4j 1.4, a Java‑based framework that overcomes Spring AI’s limitations by fully preserving DeepSeek’s chain‑of‑thought and billing features, adding reactive streaming, providing Spring Boot starter integration, and offering quick‑start code samples and configuration guidance.

AIDeepSeekJava

0 likes · 8 min read

DeepSeek4j 1.4: Java Integration Framework for DeepSeek with Full Chain‑of‑Thought and Streaming Support

Selected Java Interview Questions

Feb 21, 2025 · Artificial Intelligence

Integrating DeepSeek Large Model with Spring AI: A Step-by-Step Guide

This article explains how to integrate DeepSeek's large language models into a Spring AI application, covering model selection, API key configuration, URL setup, dependency inclusion, and providing complete Java code examples for both synchronous and streaming chat interactions.

Backend IntegrationDeepSeekJava

0 likes · 5 min read

Integrating DeepSeek Large Model with Spring AI: A Step-by-Step Guide

Data Thinking Notes

Feb 20, 2025 · Artificial Intelligence

How to Deploy DeepSeek R1 671B Model Locally with Ollama: A Step‑by‑Step Guide

This article provides a comprehensive tutorial on locally deploying the 671‑billion‑parameter DeepSeek R1 model using Ollama, covering model selection, hardware requirements, dynamic quantization, detailed installation steps, performance observations, and practical recommendations for consumer‑grade hardware.

AI model optimizationDeepSeekDynamic Quantization

0 likes · 14 min read

How to Deploy DeepSeek R1 671B Model Locally with Ollama: A Step‑by‑Step Guide

dbaplus Community

Feb 20, 2025 · Artificial Intelligence

Can DeepSeek AI Replace DBA Tasks? Real-World Database Scenarios Tested

This article examines DeepSeek, a Chinese AGI‑focused AI model, and demonstrates how prompt engineering can enable it to assist database architects, development DBAs, and operations DBAs across various real‑world scenarios, while also discussing its broader impact on individuals, vendors, and enterprises.

AI for DBAsDatabase ArchitectureDeepSeek

0 likes · 10 min read

Can DeepSeek AI Replace DBA Tasks? Real-World Database Scenarios Tested

Top Architect

Feb 20, 2025 · Artificial Intelligence

Deploying DeepSeek R1 671B Model Locally with Ollama and Dynamic Quantization

This guide explains how to download, quantize, and run the full‑size 671‑billion‑parameter DeepSeek R1 model on local hardware using Ollama, covering model selection, hardware requirements, step‑by‑step deployment commands, optional web UI setup, performance observations, and practical recommendations.

AIDeepSeekDynamic Quantization

0 likes · 16 min read

Deploying DeepSeek R1 671B Model Locally with Ollama and Dynamic Quantization

Alibaba Cloud Developer

Feb 20, 2025 · Artificial Intelligence

Build an Elasticsearch AI Assistant with DeepSeek‑R1 in 1 Minute

This guide shows how to quickly integrate Alibaba Cloud's AI Search platform and the DeepSeek‑R1 large model with Elasticsearch to create a smart AI Assistant that automates cluster diagnostics, query generation, and visual analytics for operations tasks.

AI AssistantDeepSeekElasticsearch

0 likes · 7 min read

Build an Elasticsearch AI Assistant with DeepSeek‑R1 in 1 Minute

Alibaba Cloud Infrastructure

Feb 20, 2025 · Artificial Intelligence

Deploying DeepSeek‑R1 Large Language Model on Knative with GPU A10

This guide explains how to deploy the DeepSeek‑R1 large language model on a Knative platform using an A10 GPU, covering preparation, service creation with appropriate annotations, YAML configuration, verification via curl, custom domain setup, and optional personal AI assistant deployment.

AIDeepSeekDeployment

0 likes · 8 min read

Deploying DeepSeek‑R1 Large Language Model on Knative with GPU A10

Baobao Algorithm Notes

Feb 20, 2025 · Industry Insights

How DeepSeek R1 Is Redefining Large‑Model Engineer Roles and the AI Job Market

The article analyzes DeepSeek R1’s release, showing how rising base‑model thresholds, a shift toward infrastructure‑centric skills, and the rise of retrieval‑augmented generation are rapidly diminishing traditional large‑model algorithm engineer positions while reshaping the broader AI industry landscape.

AGIAI IndustryDeepSeek

0 likes · 6 min read

How DeepSeek R1 Is Redefining Large‑Model Engineer Roles and the AI Job Market

Tencent Cloud Developer

Feb 20, 2025 · Artificial Intelligence

Build Your Own Private Knowledge Base with Cloud Studio DeepSeek R1 in Minutes

This guide explains what a knowledge base and Retrieval‑Augmented Generation (RAG) are, why personal knowledge bases are valuable, and provides step‑by‑step instructions for using Cloud Studio's DeepSeek‑R1 CPU template to set up and query a private knowledge base with Open‑WebUI or AnythingLLM.

AI TutorialCloud StudioDeepSeek

0 likes · 8 min read

Build Your Own Private Knowledge Base with Cloud Studio DeepSeek R1 in Minutes

Su San Talks Tech

Feb 20, 2025 · Artificial Intelligence

Generate AI‑Powered PPTs in WPS with DeepSeek – Step‑by‑Step Guide

This guide shows how to use the built‑in DeepSeek integration in the latest WPS version to create AI‑generated PowerPoint presentations, covering installation, activation, prompt design, template selection, custom template upload, and final editing, all without extra software or API keys.

AI PPTDeepSeekWPS

0 likes · 5 min read

Generate AI‑Powered PPTs in WPS with DeepSeek – Step‑by‑Step Guide

Java Architect Essentials

Feb 19, 2025 · Artificial Intelligence

Integrating DeepSeek AI Assistant into IntelliJ IDEA for Java Development

This guide walks Java developers through preparing the environment, installing the CodeGPT plugin, configuring DeepSeek API keys, and using the AI-powered code assistant within IntelliJ IDEA, including code completion, explanation, and question‑answer features, with usage statistics and sample code.

AI code assistantCodeGPTDeepSeek

0 likes · 9 min read

Data Thinking Notes

Feb 19, 2025 · Artificial Intelligence

DeepSeek Evolution: Key Technical Highlights from V1 to R1

This article examines DeepSeek’s various versions, detailing their core modules, underlying principles, architecture diagrams, and performance metrics, while illustrating the internal logic and advantages of each model to guide enthusiasts, professionals, and practitioners toward deeper AI innovation insights.

AIDeepSeekmodel architecture

0 likes · 4 min read

DeepSeek Evolution: Key Technical Highlights from V1 to R1

Architect's Alchemy Furnace

Feb 19, 2025 · Artificial Intelligence

How DeepSeek Beats GPT-4 with 10× Less Compute: Inside the AI Efficiency Revolution

This article examines DeepSeek's breakthrough AI techniques—including a revamped MoE architecture, aggressive data distillation, ultra‑low‑energy training, novel multi‑stage training strategies, and custom AI chips—that enable a 7B model to rival GPT‑4 while consuming a fraction of the resources.

AI efficiencyData distillationDeepSeek

0 likes · 9 min read

How DeepSeek Beats GPT-4 with 10× Less Compute: Inside the AI Efficiency Revolution

AIWalker

Feb 19, 2025 · Artificial Intelligence

DeepSeek’s NSA Attention Cuts Inference Time 11× – CEO Liang Co‑author

DeepSeek introduces the NSA sparse attention mechanism, combining dynamic hierarchical sparsity, coarse token compression and fine token selection to achieve up to 11.6× faster inference, lower pre‑training cost, and superior benchmark performance across general, long‑context, and chain‑of‑thought tasks.

DeepSeekLLM optimizationNSA

0 likes · 9 min read

DeepSeek’s NSA Attention Cuts Inference Time 11× – CEO Liang Co‑author

Architect's Alchemy Furnace

Feb 19, 2025 · Artificial Intelligence

DeepSeek’s Self‑Correction: Transforming AI Reliability and Safety

The article explores DeepSeek’s innovative self‑correction system—combining a Mixture‑of‑Experts architecture with reinforcement‑learning feedback—to achieve real‑time error detection, dynamic knowledge‑graph updates, and enhanced safety in high‑risk fields like autonomous driving and medical diagnostics.

AI safetyDeepSeekMixture of Experts

0 likes · 9 min read

DeepSeek’s Self‑Correction: Transforming AI Reliability and Safety

Alibaba Cloud Native

Feb 19, 2025 · Cloud Native

Engineering Traffic Management for DeepSeek: Cloud‑Native Deployment Strategies

This article outlines practical cloud‑native deployment options for DeepSeek models, explains common engineering challenges such as traffic spikes, latency, security, quota control, and provides detailed AI‑gateway solutions—including fallback, content safety, API key management, gray‑release routing, caching, and observability—to ensure reliable large‑model applications.

DeepSeekModel Deploymenttraffic management

0 likes · 9 min read

Engineering Traffic Management for DeepSeek: Cloud‑Native Deployment Strategies

Alibaba Cloud Developer

Feb 19, 2025 · Artificial Intelligence

How to Replicate DeepSeek‑R1’s Thought Process on Claude 3.5 Sonnet with Prompt Engineering

The article explains how to use prompt‑engineering techniques on Claude 3.5 Sonnet to mimic DeepSeek‑R1’s transparent reasoning, detailing background, prompt design, iterative optimization, and the broader impact on AI communication and user expression.

AI reasoningClaudeDeepSeek

0 likes · 25 min read

How to Replicate DeepSeek‑R1’s Thought Process on Claude 3.5 Sonnet with Prompt Engineering

Java Tech Enthusiast

Feb 19, 2025 · Artificial Intelligence

AI Agent Development Guide: Building Intelligent Agents with Coze Platform

The guide explains how to build AI agents—digital labor forces that follow instructions, plan tasks, and use tools—using ByteDance’s no‑code Coze platform, outlining a 3‑phase, 10‑step framework, emphasizing business‑first design, tool integration, and concise, scenario‑driven development with real‑world case studies.

AI AgentAgent Development FrameworkCoze Platform

0 likes · 7 min read

AI Agent Development Guide: Building Intelligent Agents with Coze Platform

Code Mala Tang

Feb 19, 2025 · Artificial Intelligence

Compute Power’s Role in the AI Race: Insights from Grok 3, DeepSeek & the Post‑Training Era

The article analyzes how massive compute resources drive AI breakthroughs, highlighting Grok 3's top‑tier performance, DeepSeek's efficient engineering under constraints, and the emerging post‑training paradigm that reshapes competition among major AI players.

AI scalingDeepSeekGrok-3

0 likes · 7 min read

Compute Power’s Role in the AI Race: Insights from Grok 3, DeepSeek & the Post‑Training Era

Architects' Tech Alliance

Feb 19, 2025 · Industry Insights

Why DeepSeek One‑Stop AI Machines Are Redefining Private Model Deployment

The surge in demand for private AI deployment has prompted multiple vendors to launch DeepSeek one‑stop machines—integrated hardware solutions that support the full DeepSeek model family, offering higher stability, easier setup, customization, cost savings, and data security across diverse industry scenarios.

AI hardwareAI infrastructureDeepSeek

0 likes · 7 min read

Why DeepSeek One‑Stop AI Machines Are Redefining Private Model Deployment

Alibaba Cloud Big Data AI Platform

Feb 19, 2025 · Artificial Intelligence

Build a DeepSeek AI Assistant with PAI‑RAG: Internet Search & Enterprise Knowledge Base

This guide walks you through using Alibaba Cloud's PAI‑RAG platform to deploy a DeepSeek large‑language‑model assistant that combines real‑time web search with an enterprise knowledge‑base, covering deployment, network‑search configuration, testing, and advanced enterprise features.

AI AssistantDeepSeekEnterprise Knowledge Base

0 likes · 10 min read

Build a DeepSeek AI Assistant with PAI‑RAG: Internet Search & Enterprise Knowledge Base

Tencent Cloud Developer

Feb 19, 2025 · Industry Insights

Why Every Enterprise Needs a Knowledge‑Management System in the LLM Era

The article analyzes how the shift from data‑driven to knowledge‑driven operations, powered by large language models like DeepSeek, forces companies to build dynamic knowledge‑management platforms that integrate personal and corporate knowledge, improve efficiency, and create sustainable competitive advantage.

DeepSeekEnterprise AILarge Language Models

0 likes · 14 min read

Why Every Enterprise Needs a Knowledge‑Management System in the LLM Era