Tagged articles

584 articles

Page 3 of 6

May 12, 2025 · Artificial Intelligence

Introducing Spring AI: Building a Simple Chat Application with DeepSeek

This article introduces Spring AI, explains its core features for integrating various AI models, and walks through creating a Spring Boot chat application that connects to the DeepSeek model using both synchronous and streaming endpoints.

AI integrationChatbotDeepSeek

0 likes · 7 min read

Introducing Spring AI: Building a Simple Chat Application with DeepSeek

Linux Kernel Journey

May 8, 2025 · Artificial Intelligence

How Tencent’s TRMT Tech Delivered a Huge Speedup to DeepSeek’s Large‑Model Network

DeepSeek engineers highlighted Tencent’s open‑source TRMT and DeepEP contributions that boost GPU‑to‑GPU communication by up to 300%, double RoCE performance and add a further 30% gain on InfiniBand, while addressing lane‑utilization and CPU‑control bottlenecks through three targeted optimizations.

DeepEPDeepSeekGPU communication

0 likes · 6 min read

How Tencent’s TRMT Tech Delivered a Huge Speedup to DeepSeek’s Large‑Model Network

JD Tech

May 8, 2025 · Artificial Intelligence

The Emerging Boom of Large Model Applications and Why 2025 Will Be the Turning Point

Amid the AI wave, large language models like DeepSeek R1 are poised to explode by 2025, driven by open-source, low-cost access and superior reasoning, with successful deployment requiring four key factors—domain expertise, knowledge bases, robust search, and engineered agent architectures—to unlock value beyond simple chat.

2025AI applicationsAgent Architecture

0 likes · 10 min read

The Emerging Boom of Large Model Applications and Why 2025 Will Be the Turning Point

DevOps

May 5, 2025 · Artificial Intelligence

DeepSeek Releases Math‑Specialized Large Model V2 and ProverBench Evaluation Suite

DeepSeek has quietly open‑sourced a new mathematics‑focused large language model, DeepSeek‑Prover‑V2 (available in 671B and 7B variants), achieving 88.9% on MiniF2F and strong results on PutnamBench, alongside the high‑quality ProverBench dataset and a novel recursive theorem‑proving pipeline.

AIDeepSeekLarge Language Model

0 likes · 4 min read

DeepSeek Releases Math‑Specialized Large Model V2 and ProverBench Evaluation Suite

ITPUB

May 5, 2025 · Operations

Turn Zabbix Alerts into an AI‑Powered Personal Assistant

This guide shows how to integrate Zabbix with a locally deployed DeepSeek large language model via Webhook, enabling automatic analysis of alert causes and solutions, feeding results back to operators through dashboards or enterprise WeChat, and dramatically reducing MTTR and manual effort.

AIAlert AutomationDeepSeek

0 likes · 5 min read

Turn Zabbix Alerts into an AI‑Powered Personal Assistant

Architects' Tech Alliance

May 2, 2025 · Artificial Intelligence

DeepSeek‑Prover‑V2‑671B: A Massive AI Model for Formal Mathematical Theorem Proving

DeepSeek‑Prover‑V2‑671B, a 671 billion‑parameter AI model released on Hugging Face, dramatically advances formal mathematical theorem proving with MoE architecture, FP8 quantization, 163 k token context, superior performance over GPT‑4 Turbo and other models, and broad implications for research and industry.

AIDeepSeekFP8 quantization

0 likes · 11 min read

DeepSeek‑Prover‑V2‑671B: A Massive AI Model for Formal Mathematical Theorem Proving

Data Thinking Notes

Apr 27, 2025 · Artificial Intelligence

Step‑by‑Step MCP Demo: Build Server and Claude/DeepSeek Clients

This guide walks developers through creating a complete MCP application, covering the workflow, server setup with Python, debugging tools, and client implementation using both Claude and DeepSeek models, complete with code snippets, environment configuration, and testing procedures to demonstrate end‑to‑end LLM tool integration.

ClaudeDeepSeekLLM

0 likes · 10 min read

Step‑by‑Step MCP Demo: Build Server and Claude/DeepSeek Clients

Baobao Algorithm Notes

Apr 27, 2025 · Artificial Intelligence

How DeepSeek R1T‑Chimera Cuts Tokens by 40% Without Fine‑Tuning

The DeepSeek‑R1T‑Chimera model merges DeepSeek‑R1 reasoning with V3‑0324 architecture, reusing most V3 weights and swapping only the blue‑highlighted R1 routing experts, achieving the same intelligence as R1 while reducing output tokens by about 40% and running faster, all without any fine‑tuning or distillation.

Artificial IntelligenceDeepSeekLLM

0 likes · 5 min read

How DeepSeek R1T‑Chimera Cuts Tokens by 40% Without Fine‑Tuning

Baobao Algorithm Notes

Apr 27, 2025 · Artificial Intelligence

How Model Fusion Cut LLM Chain‑of‑Thought Length by 40% Without Fine‑Tuning

A small tech firm, tngtech, released an open‑source model fusion called DeepSeek‑R1T‑Chimera that merges R1 inference with V3‑0324 without fine‑tuning, distillation, or prompts, achieving the same intelligence as R1 while reducing token output by 40% and speeding up inference.

Artificial IntelligenceDeepSeekLLM

0 likes · 4 min read

How Model Fusion Cut LLM Chain‑of‑Thought Length by 40% Without Fine‑Tuning

Big Data Tech Team

Apr 23, 2025 · Industry Insights

10 Powerful Ways DeepSeek Transforms Data Warehousing

DeepSeek leverages AI to automate multi‑source integration, data cleaning, warehouse modeling, real‑time processing, governance, metadata management, reporting, cloud scaling, and decision support, offering twelve distinct use cases that boost efficiency, intelligence, and scalability of modern data warehouses.

AIData WarehouseDeepSeek

0 likes · 9 min read

10 Powerful Ways DeepSeek Transforms Data Warehousing

Data Thinking Notes

Apr 22, 2025 · Artificial Intelligence

How DeepSeek AI is Transforming Agriculture, Manufacturing, Finance, Healthcare, and Education

The Zhejiang University IT Center report highlights DeepSeek's AI technology across more than 40 real‑world cases in agriculture, manufacturing, finance, healthcare and education, demonstrating data‑driven, intelligent solutions that accelerate industry transformation and modernization.

AI applicationsData-driven transformationDeepSeek

0 likes · 3 min read

How DeepSeek AI is Transforming Agriculture, Manufacturing, Finance, Healthcare, and Education

Big Data Technology & Architecture

Apr 22, 2025 · Artificial Intelligence

Introduction to Retrieval‑Augmented Generation (RAG) and Vector Indexing with StarRocks and DeepSeek

This article explains the fundamentals of Retrieval‑Augmented Generation, demonstrates how to create and query vector indexes using StarRocks, shows how DeepSeek provides embeddings and answer generation, and walks through a complete end‑to‑end RAG pipeline with code examples and a web UI.

AIDeepSeekEmbedding

0 likes · 20 min read

Introduction to Retrieval‑Augmented Generation (RAG) and Vector Indexing with StarRocks and DeepSeek

Big Data Tech Team

Apr 21, 2025 · Industry Insights

8 Practical Ways DeepSeek Boosts Data Quality for Better Governance

This guide outlines eight concrete methods DeepSeek uses to improve data quality—including automated cleaning, validation, classification, monitoring, governance standards, anomaly detection, integration, and intelligent analysis—providing actionable steps for organizations to enhance data accuracy, completeness, consistency, and usability.

Data cleaningDeepSeekdata integration

0 likes · 5 min read

8 Practical Ways DeepSeek Boosts Data Quality for Better Governance

dbaplus Community

Apr 21, 2025 · Operations

Turn Zabbix Alerts into AI‑Powered Insights with DeepSeek

This guide shows how to integrate Zabbix with a locally deployed DeepSeek large language model via Webhook, enabling automatic analysis of alerts, generation of root‑cause explanations and remediation suggestions, and delivering results through WeChat bots, dashboards, or email to reduce MTTR and manual effort.

AI OpsAlert AutomationDeepSeek

0 likes · 4 min read

Turn Zabbix Alerts into AI‑Powered Insights with DeepSeek

AIWalker

Apr 17, 2025 · Artificial Intelligence

Unveiling DeepSeek’s Janus Series: Decoupled Visual Encoding for Unified Multimodal Understanding and Generation

This article provides an in‑depth analysis of DeepSeek’s Janus and Janus‑Pro models, explaining how decoupling visual encoding resolves the conflict between multimodal understanding and generation, detailing training stages, data scaling, architectural choices, and presenting extensive benchmark results that demonstrate significant performance gains.

DeepSeekJanusModel Scaling

0 likes · 23 min read

Unveiling DeepSeek’s Janus Series: Decoupled Visual Encoding for Unified Multimodal Understanding and Generation

Architects' Tech Alliance

Apr 15, 2025 · Industry Insights

Why DeepSeek’s Private Deployment Is Fueling the AI Model Appliance Market

The article analyzes DeepSeek’s private‑deployment solutions, detailing selection criteria, deployment forms, service models, hardware‑software cost breakdown, technical innovations that lower model and compute barriers, and their impact on government and enterprise AI adoption.

AIDeepSeekHardware Requirements

0 likes · 11 min read

Why DeepSeek’s Private Deployment Is Fueling the AI Model Appliance Market

Big Data Tech Team

Apr 14, 2025 · Industry Insights

How DeepSeek AI is Transforming Data Warehouses: From Automation to Real‑Time Insights

DeepSeek leverages large‑model AI to automate requirement analysis, intelligent modeling, performance tuning, and value extraction in data warehouses, addressing low development efficiency, high O&M cost, latency, and lack of intelligence while showcasing concrete use‑case results across finance, e‑commerce, and manufacturing.

AIData WarehouseDeepSeek

0 likes · 9 min read

How DeepSeek AI is Transforming Data Warehouses: From Automation to Real‑Time Insights

Open Source Linux

Apr 14, 2025 · Artificial Intelligence

How to Deploy DeepSeek Locally: Step‑by‑Step Guide for Offline AI

This guide compares DeepSeek’s local and online versions, outlines hardware and privacy advantages of offline deployment, and provides a detailed step‑by‑step tutorial—including Ollama installation, model selection, command execution, and UI plugin setup—to help users run DeepSeek on their own machines.

AI modelDeepSeekOllama

0 likes · 6 min read

How to Deploy DeepSeek Locally: Step‑by‑Step Guide for Offline AI

Architects' Tech Alliance

Apr 13, 2025 · Artificial Intelligence

Deploying DeepSeek LLMs On-Premises: Step‑by‑Step Guide and Hardware Sizing

This article provides a comprehensive technical guide for privately deploying DeepSeek large language models, covering model and runtime parameter selection, hardware sizing calculations, software stack preparation, inference service setup, performance tuning, and security monitoring considerations.

AI hardware sizingDeepSeekInference Optimization

0 likes · 14 min read

Deploying DeepSeek LLMs On-Premises: Step‑by‑Step Guide and Hardware Sizing

Data Thinking Notes

Apr 13, 2025 · Artificial Intelligence

How to Build a Retrieval‑Augmented Generation Knowledge Base with DeepSeek and RAGFlow

This guide walks you through the fundamentals of Retrieval‑Augmented Generation, introduces the open‑source RAGFlow framework, details installation steps, shows how to integrate DeepSeek LLMs, and explores practical application scenarios such as intelligent customer service and enterprise document QA.

AIDeepSeekLLM

0 likes · 11 min read

How to Build a Retrieval‑Augmented Generation Knowledge Base with DeepSeek and RAGFlow

AI Algorithm Path

Apr 13, 2025 · Artificial Intelligence

Understanding GRPO: Group Relative Policy Optimization for LLM Training

The article explains GRPO, a reinforcement‑learning algorithm that extends PPO with group sampling, no value network, dual penalties and KL regularisation, showing how it improves efficiency and stability when fine‑tuning large language models such as DeepSeek‑Math and DeepSeek‑R1.

DeepSeekGRPOPPO

0 likes · 6 min read

Understanding GRPO: Group Relative Policy Optimization for LLM Training

Fun with Large Models

Apr 12, 2025 · Artificial Intelligence

Build a No‑Code Travel‑Planning AI Assistant with VS Code, Cline, and Gaode MCP Server

This guide walks through setting up VS Code, installing the Cline plugin, configuring a Gaode Map MCP Server API key, and using the DeepSeek model to generate a personalized park‑recommendation agent and a visual HTML page, while also explaining the stdio‑based communication between Cline and the MCP Server.

AI agentClineDeepSeek

0 likes · 15 min read

Build a No‑Code Travel‑Planning AI Assistant with VS Code, Cline, and Gaode MCP Server

Big Data Tech Team

Apr 9, 2025 · Artificial Intelligence

12 Powerful Ways DeepSeek Transforms Data Governance

This article outlines twelve practical DeepSeek AI applications for data governance, covering automated classification, dynamic privacy masking, compliance checks, quality monitoring, intelligent integration, lineage analysis, metadata management, smart retrieval, strategy formulation, security risk handling, lifecycle control, and performance evaluation.

AIDeepSeekUse Cases

0 likes · 7 min read

12 Powerful Ways DeepSeek Transforms Data Governance

Data Thinking Notes

Apr 8, 2025 · Artificial Intelligence

How DeepSeek‑R1 Redefines AI: Innovations, Core Mechanics, and Education Applications

This report outlines the innovative advantages of the DeepSeek‑R1 large language model, explains its underlying architecture and operation, details practical prompt engineering techniques, and explores concrete use‑cases that empower education and academic research through advanced AI capabilities.

AIDeepSeekEducation Technology

0 likes · 6 min read

How DeepSeek‑R1 Redefines AI: Innovations, Core Mechanics, and Education Applications

Volcano Engine Developer Services

Apr 8, 2025 · Artificial Intelligence

Which Cloud Platform Delivers the Fastest DeepSeek‑R1 API? A Comprehensive Benchmark

This article aggregates multiple independent evaluations of DeepSeek‑R1 across major cloud providers, comparing accuracy on AIME math problems, token‑per‑second throughput, first‑token latency, stability under high concurrency, and overall service reliability, ultimately highlighting Volcano Engine as the top performer.

AI inferenceAPI performanceDeepSeek

0 likes · 12 min read

Which Cloud Platform Delivers the Fastest DeepSeek‑R1 API? A Comprehensive Benchmark

Alibaba Cloud Developer

Apr 7, 2025 · Artificial Intelligence

Why Does GPU Memory Keep Growing in DeepSeek‑R1 Inference? Uncovering PyTorch’s Cache

After deploying the full‑precision DeepSeek‑R1 model on a 2×8‑GPU ACS cluster, repeated stress tests showed GPU memory usage continuously rising without release; this article details the investigation, reproduces the behavior, examines vLLM logs, Prometheus metrics, and reveals PyTorch’s caching allocator as the root cause, offering mitigation tips.

DeepSeekGPU MemoryMemory Cache

0 likes · 21 min read

Why Does GPU Memory Keep Growing in DeepSeek‑R1 Inference? Uncovering PyTorch’s Cache

Code Mala Tang

Apr 5, 2025 · Artificial Intelligence

Open-Source AI Video Models Are Redefining the Industry – China Leads the Charge

While most eyes remain on familiar AI giants, China’s Alibaba and DeepSeek are unveiling open‑source video and inference models that run on consumer GPUs, sparking a regulatory scramble and threatening the dominance of closed‑source AI, heralding a rapid, disruptive shift across the industry.

AI VideoAI localizationAI regulation

0 likes · 10 min read

Open-Source AI Video Models Are Redefining the Industry – China Leads the Charge

Alibaba Cloud Developer

Apr 3, 2025 · Artificial Intelligence

Create High‑Quality SVG Illustrations with DeepSeek‑V3 and Claude AI

This guide shows how to use DeepSeek‑V3‑0324 and Claude 3.5/3.7 to generate professional SVG graphics for articles and presentations, explains the impact of model capability and prompt quality, provides ready‑to‑use prompt templates, and demonstrates basic and advanced usage scenarios such as prototype drawing, image re‑drawing, and colorful newspaper‑style visuals.

AI image generationClaudeDeepSeek

0 likes · 15 min read

Create High‑Quality SVG Illustrations with DeepSeek‑V3 and Claude AI

AI Algorithm Path

Apr 2, 2025 · Artificial Intelligence

Vision‑Reasoning Model: Enabling LLMs to See and Think

The article analyzes the limitations of current visual language models and large reasoning models, proposes a combined Vision‑Reasoning Model (VRM), details its architecture using LLaVA, describes end‑to‑end fine‑tuning and reinforcement‑learning reward design, and argues that such models will become the next breakthrough in AI.

DeepSeekLLaVALarge Language Model

0 likes · 9 min read

Vision‑Reasoning Model: Enabling LLMs to See and Think

Java Architect Essentials

Apr 2, 2025 · Backend Development

Integrating DeepSeek Large Language Model with Spring Boot to Build an AI Chat Application

This guide demonstrates how to create a Spring Boot backend that integrates DeepSeek's large language model via the Spring AI OpenAI starter, covering project setup, dependency configuration, API key management, and a sample controller that provides AI-powered chat responses such as weather forecasts.

AI integrationChatbotDeepSeek

0 likes · 8 min read

Integrating DeepSeek Large Language Model with Spring Boot to Build an AI Chat Application

AI Algorithm Path

Apr 2, 2025 · Artificial Intelligence

Master the Three Essential LLM Training Stages for 2025

The article breaks down the three core stages of large‑language‑model training—pre‑training, supervised fine‑tuning, and RLHF—explaining their purpose, methods, and concrete examples while noting DeepSeek‑R1’s recent breakthrough and its implications for AI development.

AI trainingDeepSeekLLM

0 likes · 5 min read

Master the Three Essential LLM Training Stages for 2025

Architects' Tech Alliance

Apr 1, 2025 · Artificial Intelligence

What’s New in Large Language Models? DeepSeek V3, Qwen2.5‑Omni, Gemini 2.5 Pro, and GPT‑4o Unpacked

This article reviews the latest updates from major LLM providers—DeepSeek V3’s parameter boost and longer context, Qwen2.5‑Omni’s open‑source multimodal 7B model, Google Gemini 2.5 Pro’s 1 M‑token window and multimodal prowess, and OpenAI GPT‑4o’s image generation and reduced pricing—highlighting technical specs, capabilities, and availability.

DeepSeekGPT-4oGemini

0 likes · 9 min read

What’s New in Large Language Models? DeepSeek V3, Qwen2.5‑Omni, Gemini 2.5 Pro, and GPT‑4o Unpacked

Tencent Technical Engineering

Mar 31, 2025 · Artificial Intelligence

Step-by-Step Guide to Local Training of DeepSeek R1 on Multi‑GPU A100 Systems

This step‑by‑step tutorial shows how to set up CUDA 12.4, install required packages, prepare a JSON dataset and custom reward, troubleshoot out‑of‑memory errors, and launch DeepSeek R1 training on an 8‑GPU A100 cluster using Accelerate, Deepspeed zero‑3 and vLLM configurations.

A100CUDADeepSeek

0 likes · 9 min read

Step-by-Step Guide to Local Training of DeepSeek R1 on Multi‑GPU A100 Systems

Baobao Algorithm Notes

Mar 30, 2025 · Artificial Intelligence

Why Scaling, Data, and Infra Matter More Than Reward Design in R1 Replication

The article analyses two months of community attempts to reproduce DeepSeek R1, highlighting that model scaling, high‑quality data, robust training infrastructure, and careful hyper‑parameter tuning outweigh pure reward‑based tricks, and it outlines common pitfalls and future research directions.

DeepSeekInfrastructureLLM

0 likes · 13 min read

Why Scaling, Data, and Infra Matter More Than Reward Design in R1 Replication

Data Thinking Notes

Mar 30, 2025 · Artificial Intelligence

How DeepSeek‑R1 and Kimi‑K1.5 Push the Boundaries of Strong Reasoning Models

This comprehensive analysis by the Peking University AI Alignment team dissects the technical innovations behind DeepSeek‑R1, DeepSeek‑R1 Zero, and Kimi‑K1.5, covering reinforcement‑learning‑based post‑training, rule‑based rewards, GRPO optimization, scaling laws, multimodal extensions, safety challenges, and future research directions.

AI alignmentDeepSeekKimi

0 likes · 57 min read

How DeepSeek‑R1 and Kimi‑K1.5 Push the Boundaries of Strong Reasoning Models

dbaplus Community

Mar 30, 2025 · Databases

Supercharge Your SQL Workflows with DeepSeek Prompt Templates

This guide presents a comprehensive collection of DeepSeek prompt templates for MySQL, covering SQL generation, optimization, data analysis, database administration, debugging, and advanced features, enabling beginners and seasoned developers alike to craft accurate queries, improve performance, and resolve errors efficiently.

Database OptimizationDeepSeekmysql

0 likes · 11 min read

Supercharge Your SQL Workflows with DeepSeek Prompt Templates

Fun with Large Models

Mar 30, 2025 · Artificial Intelligence

DeepSeek‑V3‑0324 Review: Why This New Chinese LLM Beats the Competition for Agent Development

The article provides a comprehensive evaluation of DeepSeek‑V3‑0324, highlighting its superior inference, coding, and long‑text abilities, benchmark rankings that place it near GPT‑4.5, extensive code‑generation tests, and advanced Function Calling features that make it the preferred model for building AI agents.

DeepSeekagent developmentcode generation

0 likes · 8 min read

DeepSeek‑V3‑0324 Review: Why This New Chinese LLM Beats the Competition for Agent Development

Java Tech Enthusiast

Mar 29, 2025 · Frontend Development

Building a Twitter Image Card Browser Extension with Claude Sonnet and DeepSeek

In a side‑by‑side test on the Trae platform, the author used Claude Sonnet 3.5 to create a functional Twitter‑to‑image‑card browser extension in roughly twenty minutes, while DeepSeek‑R1 required multiple prompt iterations, manual bug fixes, and still produced visual glitches, demonstrating Claude’s superior reliability for frontend plugin generation.

AI Code GenerationClaude SonnetDeepSeek

0 likes · 5 min read

Building a Twitter Image Card Browser Extension with Claude Sonnet and DeepSeek

MaGe Linux Operations

Mar 28, 2025 · Artificial Intelligence

How to Create AI-Generated Videos with Tongyi Wanxiang and DeepSeek: A Step‑by‑Step Guide

This article explains the fundamentals of AI video technology, details the features of Alibaba Cloud's Tongyi Wanxiang platform, demonstrates how to use DeepSeek for script generation, and provides a complete workflow—including code examples—for producing high‑quality AI‑generated videos.

AI video generationDeepSeekJava SDK

0 likes · 24 min read

How to Create AI-Generated Videos with Tongyi Wanxiang and DeepSeek: A Step‑by‑Step Guide

Architects' Tech Alliance

Mar 28, 2025 · Artificial Intelligence

How DeepSeek Leverages Huawei Ascend to Boost AI Inference Efficiency

The report analyzes DeepSeek's latest V3 and R1 models, highlights their scaling‑law‑driven cost reductions, explains how Huawei Ascend optimizes inference by cutting KV‑Cache storage and improving compute efficiency, and surveys the model’s deployments across finance, government, manufacturing, and healthcare sectors.

AI efficiencyAI inferenceDeepSeek

0 likes · 4 min read

How DeepSeek Leverages Huawei Ascend to Boost AI Inference Efficiency

Qborfy AI

Mar 27, 2025 · Artificial Intelligence

How to Deploy DeepSeek‑R1 Locally with Ollama and Dify: A Step‑by‑Step Guide

This article walks through the entire process of deploying the DeepSeek‑R1 large language model on a personal machine, covering hardware requirements, Ollama installation, model download, service startup, remote access configuration, and visual UI integration with Dify, complete with concrete commands and screenshots.

AIDeepSeekDocker

0 likes · 9 min read

How to Deploy DeepSeek‑R1 Locally with Ollama and Dify: A Step‑by‑Step Guide

AI Algorithm Path

Mar 26, 2025 · Artificial Intelligence

DeepSeek V3-0324 Upgrade Delivers Smarter Coding and Higher Code Quality

The DeepSeek V3-0324 model, released on March 24, 2025 with 6.85 trillion parameters and a Mixture‑of‑Experts architecture, is fully open‑source on Hugging Face and brings notable upgrades in coding ability, structured responses, stability, generation length, and speed, while offering performance comparable to leading closed‑source models such as Claude 3.7.

AI Code GenerationCoding AIDeepSeek

0 likes · 10 min read

DeepSeek V3-0324 Upgrade Delivers Smarter Coding and Higher Code Quality

Java Architecture Diary

Mar 26, 2025 · Artificial Intelligence

How DeepSeek V3-0324 Boosts Java AI Apps with Function Calling

The article introduces DeepSeek's new V3-0324 model, highlights its performance gains and new features like function calling and standardized JSON output, demonstrates Chinese and frontend coding tests, provides Java code examples for AI integration, and concludes with a summary of its business impact.

AIChat2BIDeepSeek

0 likes · 6 min read

How DeepSeek V3-0324 Boosts Java AI Apps with Function Calling

Alibaba Cloud Developer

Mar 26, 2025 · Artificial Intelligence

Why DeepSeek Is Shaking Up the LLM Landscape: Architecture, Performance, and Cost

DeepSeek, a Chinese AI startup, offers open‑source large language models—DeepSeek‑V3 for general tasks and DeepSeek‑R1 for intensive reasoning—featuring MoE, MLA, low‑cost training, and competitive performance against OpenAI’s GPT‑4o, while providing detailed usage guidance and cost analysis.

AI inferenceDeepSeekcost efficiency

0 likes · 21 min read

Why DeepSeek Is Shaking Up the LLM Landscape: Architecture, Performance, and Cost

Aikesheng Open Source Community

Mar 25, 2025 · Databases

ChatDBA vs DeepSeek: AI‑Driven Diagnosis of OceanBase Backup Cluster Tenant Sync Issue (Case Study)

This case study demonstrates how the AI assistant ChatDBA identifies and resolves a tenant data‑synchronization failure in an OceanBase primary‑backup cluster, detailing four interactive troubleshooting rounds, the final SQL fix, and a comparative analysis with the DeepSeek‑R1 model.

AI assistantChatDBADeepSeek

0 likes · 5 min read

ChatDBA vs DeepSeek: AI‑Driven Diagnosis of OceanBase Backup Cluster Tenant Sync Issue (Case Study)

Java Architect Essentials

Mar 25, 2025 · Artificial Intelligence

DeepSeek4j 1.4: A Java Framework for Seamless DeepSeek AI Integration with Full Chain‑of‑Thought and Streaming Support

The article introduces DeepSeek4j 1.4, a Java‑centric framework that overcomes Spring AI’s limitations by preserving DeepSeek’s chain‑of‑thought, supporting streaming output, and offering a simple Spring Boot starter with reactive, configurable, and ready‑to‑use APIs for AI developers.

AIDeepSeekJava

0 likes · 5 min read

DeepSeek4j 1.4: A Java Framework for Seamless DeepSeek AI Integration with Full Chain‑of‑Thought and Streaming Support

Architecture & Thinking

Mar 25, 2025 · Fundamentals

Boost Diagram Creation with DeepSeek: Generate Mermaid Flowcharts, Sequence & Class Diagrams

This guide shows how programmers can leverage the DeepSeek large language model to automatically generate Mermaid code for flowcharts, sequence diagrams, class diagrams, and other visualizations, dramatically reducing manual diagramming effort and improving documentation efficiency.

AIDeepSeekMermaid

0 likes · 10 min read

Boost Diagram Creation with DeepSeek: Generate Mermaid Flowcharts, Sequence & Class Diagrams

DataFunTalk

Mar 24, 2025 · Artificial Intelligence

DeepSeek R1: Open‑Source Reasoning Model and Multi‑Stage Training Insights

The interview explores DeepSeek R1's open‑source weights, its multi‑stage training pipeline—including pre‑training, supervised fine‑tuning, and RLHF—alongside innovations such as self‑consistency, chain‑of‑thought prompting, distillation, MoE architectures, and cost considerations, highlighting its impact on the future of large language models.

AI trainingDeepSeekOpen Source

0 likes · 20 min read

DeepSeek R1: Open‑Source Reasoning Model and Multi‑Stage Training Insights

Fun with Large Models

Mar 24, 2025 · Artificial Intelligence

How to Build a Multi‑Turn Chatbot with DeepSeek’s API in Python

This guide walks you through using DeepSeek’s large‑model API via the OpenAI request format, covering API advantages, key parameters, Python setup, code examples for single and multi‑turn conversations, and a full reference table of request options.

APIChatbotDeepSeek

0 likes · 12 min read

How to Build a Multi‑Turn Chatbot with DeepSeek’s API in Python

Data Thinking Notes

Mar 23, 2025 · Artificial Intelligence

Mastering AI Communication: Prompt Strategies, Deep Thinking, and Productivity Hacks

This guide explains how effective communication with AI—through clear prompts, step‑by‑step task breakdowns, and critical‑thinking techniques—can unlock deeper reasoning, avoid hallucinations, and boost personal productivity across various work scenarios.

AI promptingDeepSeekcritical thinking

0 likes · 18 min read

Mastering AI Communication: Prompt Strategies, Deep Thinking, and Productivity Hacks

Architects' Tech Alliance

Mar 22, 2025 · Industry Insights

What Does DeepSeek’s 2025 AI Report Reveal About the Future of Large Models?

The 2025 DeepSeek Insight report analyzes DeepSeek’s new large‑model releases, compares US and Chinese AI ecosystems, outlines diverse application scenarios such as government, healthcare and aerospace, and provides practical guidance for safely leveraging these models despite their current limitations.

AI industryAI safetyDeepSeek

0 likes · 5 min read

What Does DeepSeek’s 2025 AI Report Reveal About the Future of Large Models?

AsiaInfo Technology: New Tech Exploration

Mar 21, 2025 · Industry Insights

How DeepSeek’s Large Model is Revolutionizing Digital Twin Simulations

This article analyzes how DeepSeek’s multimodal large model overcomes traditional digital‑twin simulation bottlenecks through dynamic modeling, generative data augmentation, and low‑cost open‑source architecture, enabling smarter city traffic, industrial design, and water‑resource management while reshaping the industry’s AI‑driven simulation landscape.

AIDeepSeekDigital Twin

0 likes · 22 min read

How DeepSeek’s Large Model is Revolutionizing Digital Twin Simulations

Fun with Large Models

Mar 20, 2025 · Artificial Intelligence

Fine‑Tune DeepSeek‑R1 with Just a Few Lines of Code Using Unsloth

This guide walks through setting up an Anaconda environment, installing Unsloth, downloading the DeepSeek‑R1‑Distill‑Llama‑8B model, preparing a medical CoT dataset, configuring LoRA parameters, running a short fine‑tuning job, and evaluating the customized model with structured prompts.

DeepSeekLoRAPython

0 likes · 18 min read

Fine‑Tune DeepSeek‑R1 with Just a Few Lines of Code Using Unsloth

Practical DevOps Architecture

Mar 20, 2025 · Artificial Intelligence

DeepSeek Model Integration Tutorial Series

This collection provides a step‑by‑step tutorial series of sixteen short videos demonstrating how to access, configure, and use the DeepSeek large language model across various office applications such as Word, Excel, PowerPoint, as well as platforms like WPS and online APIs.

AIAPIDeepSeek

0 likes · 5 min read

DeepSeek Model Integration Tutorial Series

Data Thinking Notes

Mar 18, 2025 · Artificial Intelligence

Unlocking DeepSeek‑R1: A Practical Guide to AIGC Tools and Large‑Model Technology

This manual introduces the fundamental concepts of the DeepSeek‑R1 model, explains large‑model and AIGC technologies, and provides practical guidance for selecting and efficiently using AI tools, helping readers grasp the deeper value of DeepSeek and related applications.

AI toolsAIGCDeepSeek

0 likes · 2 min read

Unlocking DeepSeek‑R1: A Practical Guide to AIGC Tools and Large‑Model Technology

Java Tech Enthusiast

Mar 18, 2025 · Artificial Intelligence

Can Apple’s M3 Ultra Mac Studio Run Full‑Scale DeepSeek R1 at 11 Tokens/s?

Early adopters benchmarked the M3 Ultra‑powered Mac Studio running the 671‑billion‑parameter DeepSeek R1 model, achieving around 11 tokens per second in practice (up to 20 tokens/s theoretically), and compared its performance and cost against GPU‑based solutions and the newer M4 Max hardware.

AI inferenceDeepSeekLLM Benchmark

0 likes · 5 min read

Can Apple’s M3 Ultra Mac Studio Run Full‑Scale DeepSeek R1 at 11 Tokens/s?

Architects' Tech Alliance

Mar 17, 2025 · Industry Insights

DeepSeek Integrated Machines: 52 Models, Specs, Prices & Use Cases

This article compiles a market overview of 52 DeepSeek integrated machines, detailing GPU chips, price ranges from tens of thousands to millions, major Chinese cloud vendors, and diverse application scenarios such as intelligent customer service, data processing, and smart governance.

AI hardwareDeepSeekGPU

0 likes · 3 min read

DeepSeek Integrated Machines: 52 Models, Specs, Prices & Use Cases

dbaplus Community

Mar 17, 2025 · Operations

Designing an AI‑Powered Ops Platform with DeepSeek: Architecture, Modules, and Implementation

This article outlines a comprehensive AI‑Ops solution built on DeepSeek, covering its technical architecture, data collection stack, AI engine deployment, key functional modules, implementation roadmap, model training, security design, cost estimates, and risk mitigation strategies for modern operations teams.

AI OpsDeepSeekInfrastructure Automation

0 likes · 7 min read

Designing an AI‑Powered Ops Platform with DeepSeek: Architecture, Modules, and Implementation

Infra Learning Club

Mar 17, 2025 · Artificial Intelligence

Testing OpenManus with DeepSeek: A Hands‑On Evaluation

The author walks through installing OpenManus, configuring it to use DeepSeek (and an Ollama‑based vision model), runs a sample financial data query, and reports that the system is slow, sometimes inaccurate, and still requires further optimization.

AI agentsDeepSeekLLM

0 likes · 5 min read

Testing OpenManus with DeepSeek: A Hands‑On Evaluation

Fun with Large Models

Mar 17, 2025 · Industry Insights

How Chinese Scientists Are Driving the Global AI Race—from DeepSeek to Grok‑3

The article analyzes how Chinese researchers dominate AI research worldwide, detailing their roles in US tech giants, Chinese model teams, talent‑attraction policies in both countries, and the strategic implications of this "internal" competition for the future of artificial intelligence.

AI modelsArtificial IntelligenceChinese Scientists

0 likes · 12 min read

How Chinese Scientists Are Driving the Global AI Race—from DeepSeek to Grok‑3

Selected Java Interview Questions

Mar 15, 2025 · Artificial Intelligence

DeepSeek4j 1.4: Java Spring Boot Integration for DeepSeek with Full Chain‑of‑Thought and Streaming Support

DeepSeek4j 1.4 introduces a Java‑centric, Spring Boot‑compatible framework that fully preserves DeepSeek's chain‑of‑thought capabilities, adds reactive streaming, and provides simple one‑line API integration, addressing previous limitations in mainstream frameworks and offering ready‑to‑use configuration and code examples.

AI integrationDeepSeekSpring Boot

0 likes · 5 min read

DeepSeek4j 1.4: Java Spring Boot Integration for DeepSeek with Full Chain‑of‑Thought and Streaming Support

Rare Earth Juejin Tech Community

Mar 15, 2025 · Artificial Intelligence

Integrating DeepSeek Large Model with SpringAI in Java Applications

This article provides a concise guide on using SpringAI to connect Java applications with the domestic large‑language model DeepSeek, covering design philosophy, configuration, code examples for chat, streaming, structured output, security hardening, performance tuning, and production best practices.

AI integrationBackend DevelopmentChatClient

0 likes · 9 min read

Integrating DeepSeek Large Model with SpringAI in Java Applications

Open Source Tech Hub

Mar 13, 2025 · Artificial Intelligence

Build a Private AI Knowledge Base with Webman AI, Redis‑Stack, and Ollama

This guide walks you through setting up a private AI knowledge base using Webman AI 5.4.0, deploying Redis‑Stack, installing the illuminate/redis component, adding Ollama with DeepSeek and other embedding models, configuring Redis, importing training data, running the training process, and configuring role prompts for accurate AI responses.

AIDeepSeekOllama

0 likes · 6 min read

Build a Private AI Knowledge Base with Webman AI, Redis‑Stack, and Ollama

Alibaba Cloud Developer

Mar 13, 2025 · Artificial Intelligence

How Alibaba’s Tongyi Lingma AI Programmer Supercharges Java Development with QwQ‑32B

This article reviews Alibaba Cloud's Tongyi Lingma AI programmer, highlighting its new model selection feature—including DeepSeek V3, R1, Qwen2.5‑Max and the open‑source QwQ‑32B—its impressive benchmark performance, step‑by‑step code generation for a CMS notice module, cross‑language integration with DeepSeek‑R1, and practical developer experiences comparing version 1.0 and 2.0.

AI Code GenerationCross-language programmingDeepSeek

0 likes · 23 min read

How Alibaba’s Tongyi Lingma AI Programmer Supercharges Java Development with QwQ‑32B

Baidu Intelligent Cloud Tech Hub

Mar 12, 2025 · Artificial Intelligence

Build a Business‑Specific DeepSeek Knowledge Base with VectorDB (Code & No‑Code)

This guide explains how to create a proprietary knowledge base for DeepSeek by leveraging Baidu Cloud's VectorDB, offering both code‑first and no‑code approaches to integrate internal data securely and enable accurate AI‑driven Q&A for business applications.

AIDeepSeekKnowledge Base

0 likes · 3 min read

Build a Business‑Specific DeepSeek Knowledge Base with VectorDB (Code & No‑Code)

Java Tech Enthusiast

Mar 12, 2025 · Artificial Intelligence

Open-Source Enterprise Knowledge Base and Intelligent Dialogue Platform Based on Spring Boot and DeepSeek

The open‑source DeepSeek‑Flow‑AI platform combines Spring Boot 3.4 back‑end APIs with a Vue 3 front‑end to deliver an enterprise‑grade knowledge base and intelligent multi‑turn dialogue system, supporting private deployment, role‑based access, analytics, CRM/ERP integration, and easy installation via Maven and Yarn.

AIDeepSeekKnowledge Base

0 likes · 5 min read

Open-Source Enterprise Knowledge Base and Intelligent Dialogue Platform Based on Spring Boot and DeepSeek

Architects' Tech Alliance

Mar 12, 2025 · Artificial Intelligence

How DeepSeek Can Transform Family Education: A Practical Guide

This guide from Tsinghua University’s New Media Research Center systematically explores DeepSeek’s entry methods, interaction strategies, subject‑specific tutoring, emotional support, ethical risks, age‑specific parenting solutions, tool integration, and future educational outlook, offering parents actionable AI‑powered techniques for digital home learning.

AI in EducationDeepSeekEducational Technology

0 likes · 5 min read

How DeepSeek Can Transform Family Education: A Practical Guide

Top Architecture Tech Stack

Mar 12, 2025 · Big Data

DeepSeek: Comprehensive Installation, Configuration, and Usage Guide

This article provides a detailed, step‑by‑step guide to installing, configuring, and using DeepSeek—a powerful command‑line data processing tool—covering basic operations, advanced features, scripting tips, and troubleshooting to help users efficiently import, clean, analyze, and visualize data.

Big DataCLIData Analysis

0 likes · 8 min read

DeepSeek: Comprehensive Installation, Configuration, and Usage Guide

Architects' Tech Alliance

Mar 11, 2025 · Artificial Intelligence

How DeepSeek’s Breakthrough AI Models Thrive on Huawei Ascend: A Deep Dive

An in‑depth analysis reveals how DeepSeek’s V3 and R1 large‑language models achieve superior inference performance and cost efficiency on Huawei’s Ascend AI platform, detailing architectural optimizations, KV‑Cache reductions, multimodal support, real‑world deployments across finance, government, manufacturing, and the projected impact on the AI industry.

DeepSeekHuawei Ascendai-optimization

0 likes · 4 min read

How DeepSeek’s Breakthrough AI Models Thrive on Huawei Ascend: A Deep Dive

Cognitive Technology Team

Mar 11, 2025 · Artificial Intelligence

Deploying DeepSeek R1:7b Model Locally with Ollama and Building AI Applications Using Dify

This tutorial explains how to set up Ollama for CPU or GPU environments, run the DeepSeek R1:7b large language model, and use the open‑source Dify platform to create and deploy a custom AI application, providing step‑by‑step commands and configuration details.

AIDeepSeekDify

0 likes · 8 min read

Deploying DeepSeek R1:7b Model Locally with Ollama and Building AI Applications Using Dify

NewBeeNLP

Mar 11, 2025 · Artificial Intelligence

How DeepSeek’s New Architecture Redefines LLM Efficiency and Performance

This article analyzes DeepSeek’s recent breakthroughs—including the Multi‑Head Latent Attention (MLA), Group Relative Policy Optimization (GRPO), and a refined Mixture‑of‑Experts design—along with its three‑stage training pipeline, RL‑only R1‑Zero variant, and benchmark comparisons against GPT‑4o‑Mini and Llama 3.1, highlighting both gains and remaining challenges.

DeepSeekLLMMixture of Experts

0 likes · 18 min read

Architect

Mar 10, 2025 · Artificial Intelligence

What Makes DeepSeek’s New Architecture a Game‑Changer? Inside MLA, GRPO, and MoE Innovations

This article analyzes DeepSeek’s latest large‑model breakthroughs, covering the MLA attention compression, GRPO alignment algorithm, MoE load‑balancing redesign, multi‑stage training pipelines, reinforcement‑learning tricks, and performance comparisons with GPT‑4o‑Mini and Llama 3.1, highlighting both strengths and remaining challenges.

AI trainingDeepSeekGRPO

0 likes · 19 min read

What Makes DeepSeek’s New Architecture a Game‑Changer? Inside MLA, GRPO, and MoE Innovations

Baidu Geek Talk

Mar 10, 2025 · Artificial Intelligence

How Baidu Cloud’s AI+ Strategy Powers State‑Owned Enterprises with DeepSeek One‑Box Solutions

The article examines Baidu Cloud’s integration of DeepSeek large‑model hardware, detailing the Baige and Qianfan one‑box systems, their technical specs, deployment speed, and how they enable state‑owned enterprises across energy, manufacturing, and logistics to accelerate AI‑driven digital transformation.

AIBaidu CloudCloud Computing

0 likes · 6 min read

How Baidu Cloud’s AI+ Strategy Powers State‑Owned Enterprises with DeepSeek One‑Box Solutions

AI Frontier Lectures

Mar 10, 2025 · Industry Insights

Why DeepSeek’s Rise Is Shaking China’s AGI Landscape

The article analyzes how DeepSeek’s unexpected success has triggered a strategic rethink across Chinese AI firms, prompting shifts from product‑centric growth to foundational model research, reshaping talent structures at Tencent and ByteDance, and questioning where the true barriers to AGI lie.

AGIChina AIDeepSeek

0 likes · 13 min read

Why DeepSeek’s Rise Is Shaking China’s AGI Landscape

Alibaba Cloud Developer

Mar 10, 2025 · Artificial Intelligence

Seamlessly Switch Between DeepSeek‑R1 and QwQ‑32B with Higress AI Gateway

Learn how to deploy the new QwQ‑32B inference model alongside DeepSeek‑R1 using the Higress AI gateway, covering environment setup, model configuration, routing, token‑level rate limiting, content safety, semantic caching, and advanced features like automatic fallback and internet‑search integration.

DeepSeekHigressLLM integration

0 likes · 16 min read

Seamlessly Switch Between DeepSeek‑R1 and QwQ‑32B with Higress AI Gateway

Top Architect

Mar 10, 2025 · Artificial Intelligence

Using DeepSeek to Generate Mermaid Diagrams: Flowcharts, Gantt Charts, and Sequence Diagrams

This guide demonstrates how to leverage the DeepSeek AI model to automatically create Mermaid diagram code for flowcharts, Gantt charts, and sequence diagrams, walk through the required prompts, show the generated code, and compare Mermaid with traditional mind‑mapping tools.

AIDeepSeekFlowchart

0 likes · 6 min read

Using DeepSeek to Generate Mermaid Diagrams: Flowcharts, Gantt Charts, and Sequence Diagrams

Baobao Algorithm Notes

Mar 10, 2025 · Artificial Intelligence

Why DeepSeek V3’s FP8 Training Beats Traditional Schemes: A Deep Dive

This article provides a detailed technical analysis of FP8 training, comparing Nvidia’s TransformerEngine approach with DeepSeek V3’s novel scheme, and examines how block‑wise scaling, high‑precision accumulation, and vector length and correlation affect quantization error and signal‑to‑noise ratio in large‑language‑model training.

DeepSeekFP8LLM

0 likes · 20 min read

Why DeepSeek V3’s FP8 Training Beats Traditional Schemes: A Deep Dive

Spring Full-Stack Practical Cases

Mar 10, 2025 · Artificial Intelligence

Integrate DeepSeek AI with Spring Boot 3: Complete Hands‑On Guide

This article introduces deepseek4j, a Java SDK for DeepSeek AI models, walks through Maven setup, Spring Boot configuration, basic and advanced usage examples—including streaming, synchronous, SSE debugging, and web‑search integration—while detailing key configuration options and best‑practice notes.

AIDeepSeekJava SDK

0 likes · 8 min read

Integrate DeepSeek AI with Spring Boot 3: Complete Hands‑On Guide

CSS Magic

Mar 10, 2025 · Artificial Intelligence

Three Advanced Ways to Harness DeepSeek for Everyone

The article outlines three practical approaches to get the most out of DeepSeek—using it as a conversational assistant, integrating its API to power AI tools such as the Chrome immersive‑translation plugin, and leveraging it for AI‑assisted programming—while comparing the V3 and R1 models and offering concrete configuration steps.

AI programmingAI translationChrome Extension

0 likes · 8 min read

Three Advanced Ways to Harness DeepSeek for Everyone

Java Architect Essentials

Mar 9, 2025 · Backend Development

Building an AI-Powered Chatbot with Spring Boot and DeepSeek

This tutorial demonstrates how to create an AI-driven Spring Boot application by integrating DeepSeek's large language model, covering project setup, dependency configuration, API key management, and implementing a REST controller that provides weather forecasts via a conversational interface.

AIChatbotDeepSeek

0 likes · 8 min read

Building an AI-Powered Chatbot with Spring Boot and DeepSeek

Architects' Tech Alliance

Mar 9, 2025 · Industry Insights

How DeepSeek’s LLMs Slash Training Costs and Reshape China’s Compute Landscape

DeepSeek’s three‑model LLM lineup—V3, R1‑Zero and R1—delivers high performance while cutting training expenses to under $600 k, a fraction of the $0.6‑1 B typical for comparable models, signaling a major shift in China’s AI compute demand and supply chain dynamics.

AI computeChinaDeepSeek

0 likes · 3 min read

How DeepSeek’s LLMs Slash Training Costs and Reshape China’s Compute Landscape

Data Thinking Notes

Mar 9, 2025 · Artificial Intelligence

How DeepSeek R1 Uses Large‑Scale Reinforcement Learning to Rival OpenAI o1

DeepSeek R1, an open‑source large language model, leverages rule‑based, large‑scale reinforcement learning and mixed supervised‑fine‑tuning data to achieve deep reasoning comparable to OpenAI o1, illustrating China’s rapid AI progress, the importance of efficiency, and the democratizing impact of open AI research.

DeepSeekmodel efficiencyopen-source AI

0 likes · 11 min read

How DeepSeek R1 Uses Large‑Scale Reinforcement Learning to Rival OpenAI o1

Architects' Tech Alliance

Mar 9, 2025 · Industry Insights

DeepSeek’s AI Ecosystem: From Core Tech to Market Impact

This article provides a comprehensive analysis of DeepSeek, covering its foundational AI research, technology stack, product offerings, and the broader upstream, midstream, and downstream AI industry landscape, including hardware, server, cloud, and market trends.

AI infrastructureArtificial IntelligenceDeepSeek

0 likes · 13 min read

DeepSeek’s AI Ecosystem: From Core Tech to Market Impact

AI Product Manager Community

Mar 8, 2025 · Artificial Intelligence

Deploy OpenManus Locally and Let It Generate a Complete WeChat Mini‑Program

This article walks through installing OpenManus locally using Python 3.12, cloning its GitHub repository, configuring DeepSeek LLM credentials, launching the service, and prompting the agent to generate a full WeChat mini‑program, while sharing observations on performance, token cost, and limitations.

AI agentDeepSeekLLM

0 likes · 5 min read

Deploy OpenManus Locally and Let It Generate a Complete WeChat Mini‑Program

DataFunTalk

Mar 8, 2025 · Artificial Intelligence

DeepSeek Reflection Wave and the Shifting Landscape of AGI Development in China

The article analyzes how DeepSeek's rapid rise has triggered a strategic rethink across Chinese AI startups and tech giants, prompting a shift from product‑centric growth to deep‑model research, while examining the real barriers to AGI and the importance of time‑advantage in the large‑model race.

AGIAIChinese tech

0 likes · 12 min read

DeepSeek Reflection Wave and the Shifting Landscape of AGI Development in China

Fun with Large Models

Mar 8, 2025 · Artificial Intelligence

Make AI Obey: A Detailed Prompt Engineering Guide to Boost Large‑Model Logic

This tutorial explains how to enhance large language models' logical reasoning by using DeepSeek‑R1's deep‑thinking mode, few‑shot prompting, chain‑of‑thought, and zero‑shot chain‑of‑thought techniques, providing concrete examples, comparisons, and a step‑by‑step template for effective prompt design.

AI reasoningDeepSeekchain-of-thought

0 likes · 10 min read

Make AI Obey: A Detailed Prompt Engineering Guide to Boost Large‑Model Logic

Java Architect Essentials

Mar 7, 2025 · Artificial Intelligence

Introducing DeepSeek4j 1.4: A Java Spring Boot Integration for DeepSeek AI with Chain‑of‑Thought and Streaming Support

The article introduces DeepSeek4j 1.4, a Java Spring Boot library that overcomes existing framework limitations by preserving DeepSeek's chain‑of‑thought capabilities, adding full reactive streaming, and providing a simple one‑line API along with quick‑start instructions and code examples.

AI integrationDeepSeekJava

0 likes · 5 min read

Introducing DeepSeek4j 1.4: A Java Spring Boot Integration for DeepSeek AI with Chain‑of‑Thought and Streaming Support

Baidu Intelligent Cloud Tech Hub

Mar 7, 2025 · Artificial Intelligence

Deploy DeepSeek R1 with Prefill‑Decode Separation on Baidu Baige

This guide explains how to set up Baidu Baige's PD‑separated deployment for the DeepSeek R1 large‑language model, covering resource preparation, data acquisition, Prefill and Decode service configuration, and API invocation to achieve lower latency and higher throughput.

Baidu BaigeDeepSeekGPU deployment

0 likes · 7 min read

Deploy DeepSeek R1 with Prefill‑Decode Separation on Baidu Baige

DataFunTalk

Mar 7, 2025 · Artificial Intelligence

DeepSeek R1 Technical Report: Insights into Reasoning Models and Their Impact

This presentation reviews the development, technical details, and societal impact of DeepSeek's R1 model, explaining its reasoning capabilities, training pipeline, comparisons with other models, and future directions for AI research and product applications.

AI researchDeepSeekR1

0 likes · 53 min read

DeepSeek R1 Technical Report: Insights into Reasoning Models and Their Impact

Architects' Tech Alliance

Mar 7, 2025 · Industry Insights

How DeepSeek’s V3 and R1 Are Redefining the Global AI Landscape

The 2025 DeepSeek analysis report examines the V3 and R1 models' novel Transformer‑based technologies, their performance gains, and how they are reshaping global AI competition, boosting domestic AI valuations, and ushering in an open‑source AI breakthrough that could spark the next killer applications.

AI modelsDeepSeekmodel technology

0 likes · 5 min read

How DeepSeek’s V3 and R1 Are Redefining the Global AI Landscape

DevOps

Mar 6, 2025 · Artificial Intelligence

Building Multi-Model Chat Agents with Dify: Integrating DeepSeek‑R1 and Gemini

This article explains how to create a high‑performance multi‑model chat agent on the Dify platform by combining DeepSeek‑R1 for reasoning and Gemini for answer generation, covering the underlying principles, configuration steps, API integration, performance benchmarks, and practical deployment guidance.

ChatbotDeepSeekDify

0 likes · 12 min read

Building Multi-Model Chat Agents with Dify: Integrating DeepSeek‑R1 and Gemini

Data Thinking Notes

Mar 6, 2025 · Artificial Intelligence

How China’s State‑Owned Giants Are Accelerating AI with DeepSeek

Amid a global digital surge, 45% of China’s central state‑owned enterprises have deployed the DeepSeek large‑model platform, rapidly integrating AI across energy, power, telecom, construction and other sectors to boost intelligent transformation and operational efficiency.

AI adoptionChinaDeepSeek

0 likes · 7 min read

How China’s State‑Owned Giants Are Accelerating AI with DeepSeek

Model Perspective

Mar 6, 2025 · Artificial Intelligence

Can AI Boost High School Math Problem Solving? A DeepSeek Case Study

This article explores how the AI model DeepSeek can assist high‑school students in tackling challenging sequence problems from the 2024 Chinese college entrance exam, detailing its reasoning process, strengths, pitfalls, and practical tips for using AI to train mathematical thinking rather than just obtain answers.

AIDeepSeekhigh school

0 likes · 9 min read

Can AI Boost High School Math Problem Solving? A DeepSeek Case Study

Fun with Large Models

Mar 6, 2025 · Artificial Intelligence

Master Prompt Engineering: Make AI Follow Your Commands with Simple, Effective Prompts

Prompt engineering transforms vague queries into precise, reliable AI responses by structuring prompts with clear instructions, context, input, and output specifications, and by using role‑playing and formatting tricks, enabling models like DeepSeek and OpenAI to deliver accurate, consistent results across tasks.

AI Prompt DesignDeepSeekOpenAI

0 likes · 15 min read

Master Prompt Engineering: Make AI Follow Your Commands with Simple, Effective Prompts

Architects' Tech Alliance

Mar 5, 2025 · Industry Insights

How DeepSeek’s Open‑Source Tools Are Supercharging AI Model Performance

DeepSeek’s Open‑Source Week unveiled five high‑performance projects—FlashMLA, DeepEP, DeepGEMM, DualPipe/EPLB, and 3FS—each delivering novel GPU optimizations, communication kernels, matrix‑multiplication libraries, parallelism strategies, and a distributed file system that together dramatically accelerate large‑scale AI training and inference workloads.

AI accelerationDeepSeekGPU optimization

0 likes · 9 min read

How DeepSeek’s Open‑Source Tools Are Supercharging AI Model Performance

Java Architect Essentials

Mar 5, 2025 · Artificial Intelligence

Step-by-Step Guide to Integrate DeepSeek AI with a WeChat Public Account Using a Cloud Server

This tutorial walks beginners through obtaining a DeepSeek API key, setting up an Alibaba Cloud ECS instance, configuring the WeChat public‑account interface, cloning and configuring the open‑source COW project, and finally deploying a Python service that connects the WeChat bot to the DeepSeek large‑language model.

DeepSeekPython Tutorialcloud server

0 likes · 13 min read

Step-by-Step Guide to Integrate DeepSeek AI with a WeChat Public Account Using a Cloud Server

Architect's Alchemy Furnace

Mar 5, 2025 · Artificial Intelligence

Boost Your PPT Creation 80% Faster with DeepSeek + Kimi: A Step‑by‑Step Guide

This article demonstrates how to combine DeepSeek's AI‑driven content generation with Kimi's visual design engine to create professional, data‑rich PPTs up to 80% faster, offering reusable prompts, a four‑step workflow, a real‑world case study, and advanced collaboration tips.

AIDeepSeekKimi

0 likes · 10 min read

Boost Your PPT Creation 80% Faster with DeepSeek + Kimi: A Step‑by‑Step Guide

Alibaba Cloud Native

Mar 5, 2025 · Cloud Native

How to Integrate DeepSeek with Alibaba Cloud Native API Gateway: A Step‑by‑Step Guide

This article explains the concepts, evolution, and core capabilities of API gateways, then provides a detailed, cloud‑native tutorial on configuring Alibaba Cloud's API Gateway to connect with DeepSeek, covering prerequisites, service setup, AI API creation, multi‑model routing, and debugging procedures.

Alibaba CloudDeepSeekTutorial

0 likes · 17 min read

How to Integrate DeepSeek with Alibaba Cloud Native API Gateway: A Step‑by‑Step Guide

Architects' Tech Alliance

Mar 5, 2025 · Industry Insights

DeepSeek R1 & Kimi 1.5: Inside the Development of Near‑Strong Reasoning Models

The article analyzes DeepSeek's recent releases—V3 dialogue model and R1 inference model—detailing their launch dates, rapid popularity surge, R1's reinforcement‑learning‑based design for code and math tasks, and provides links to related Beijing University technical reports while stripping promotional sales content.

AIDeepSeekModel Development

0 likes · 3 min read

DeepSeek R1 & Kimi 1.5: Inside the Development of Near‑Strong Reasoning Models

Java Architect Essentials

Mar 5, 2025 · Artificial Intelligence

How to Seamlessly Integrate DeepSeek AI into IntelliJ IDEA for Java Development

This guide walks Java developers through preparing the environment, installing the CodeGPT plugin, configuring DeepSeek API keys and models, and using the AI assistant for code generation, completion, explanation, and troubleshooting directly within IntelliJ IDEA, while also showing usage statistics.

AI code assistantCodeGPTDeepSeek

0 likes · 8 min read

How to Seamlessly Integrate DeepSeek AI into IntelliJ IDEA for Java Development