Tag

DeepSeek

0 views collected around this technical thread.

Sohu Tech Products
Sohu Tech Products
Jun 11, 2025 · Artificial Intelligence

How DeepSeek and TiDB AI Are Redefining Data Engines for the Large‑Model Era

This article explores DeepSeek's open‑source large‑model breakthroughs, PingCAP's AI‑enhanced database roadmap, TiDB.AI's retrieval‑augmented generation framework, the unified TiDB data engine, and practical Q&A insights on knowledge‑graph construction, vector search, and AI‑driven SQL generation.

AIDatabaseDeepSeek
0 likes · 15 min read
How DeepSeek and TiDB AI Are Redefining Data Engines for the Large‑Model Era
DataFunTalk
DataFunTalk
Jun 11, 2025 · Backend Development

Master Modern Web Scraping: From Classic Tools to DeepSeek AI Integration

This article provides a comprehensive overview of web‑scraping technologies, compares popular tools such as requests, BeautifulSoup and Selenium, introduces AI‑assisted crawling with DeepSeek, and walks through practical steps for using BrightData’s platform to collect industry data, complete with ready‑to‑run Python code.

BeautifulSoupBrightDataDeepSeek
0 likes · 13 min read
Master Modern Web Scraping: From Classic Tools to DeepSeek AI Integration
Full-Stack Internet Architecture
Full-Stack Internet Architecture
Jun 6, 2025 · Artificial Intelligence

How to Build a Spring AI Hello World with Ollama and DeepSeek Locally

This step‑by‑step tutorial shows how to install Ollama, pull the DeepSeek‑R1 model, create a Spring Boot project with the Spring AI Ollama starter, code a ChatController, and test a local AI "Hello World" integration, illustrating AI‑enhanced backend development.

AI integrationDeepSeekJava
0 likes · 7 min read
How to Build a Spring AI Hello World with Ollama and DeepSeek Locally
Java Architecture Diary
Java Architecture Diary
Jun 5, 2025 · Artificial Intelligence

Unlock AI Reasoning: How Ollama’s New ‘Thinking’ Feature Works

Version 0.9.0 of Ollama introduces a ‘thinking’ control that lets users view and manage the AI model’s reasoning process, with detailed CLI commands, REST API usage, model support list, scripting options, and advanced Modelfile configurations for models like DeepSeek R1 and Qwen 3.

AI reasoningCLIDeepSeek
0 likes · 6 min read
Unlock AI Reasoning: How Ollama’s New ‘Thinking’ Feature Works
Full-Stack Internet Architecture
Full-Stack Internet Architecture
Jun 3, 2025 · Artificial Intelligence

Spring AI 1.0 General Availability: Overview and Dependency Setup

The article announces the official release of Spring AI 1.0, explains the meaning of GA, provides the Maven dependency configuration, describes how Spring AI simplifies AI integration for Java applications, and outlines its key features and supported model types.

AI integrationArtificial IntelligenceDeepSeek
0 likes · 3 min read
Spring AI 1.0 General Availability: Overview and Dependency Setup
Efficient Ops
Efficient Ops
May 29, 2025 · Artificial Intelligence

DeepSeek R1 0528 Update: New Features, Performance Gains Over OpenAI o3

DeepSeek quietly launched the R1 0528 model, which early testers report matches OpenAI’s o3 in benchmarks and style, while adding deeper chain‑of‑thought reasoning, better writing output, and extended thinking windows, and the announcement is followed by a promotion for the GOPS Global Ops Conference.

AI performanceChain-of-ThoughtDeepSeek
0 likes · 3 min read
DeepSeek R1 0528 Update: New Features, Performance Gains Over OpenAI o3
IT Services Circle
IT Services Circle
May 25, 2025 · Artificial Intelligence

DeepSeek Core Technologies and Model Innovations: DeepSeek‑V3 and DeepSeek‑R1 Technical Overview

The article provides a detailed technical overview of DeepSeek's flagship large language models, DeepSeek‑V3 and DeepSeek‑R1, describing their MoE architecture, training frameworks, reinforcement‑learning based fine‑tuning, inference optimizations, and the broader impact of these innovations on the AI landscape while also promoting related books and resources.

AIDeepSeekMixture of Experts
0 likes · 10 min read
DeepSeek Core Technologies and Model Innovations: DeepSeek‑V3 and DeepSeek‑R1 Technical Overview
Java Architecture Diary
Java Architecture Diary
May 21, 2025 · Artificial Intelligence

Spring AI 1.0 Launch: Production‑Ready Java AI Framework Unveiled

Spring AI 1.0, the first production‑grade Java AI framework, introduces ready‑to‑use APIs, seamless model integration, enterprise‑level RAG engine, smart tool calling, and three development modes, empowering developers to rapidly build, customize, and fully control AI applications with major model providers like OpenAI, Anthropic, DeepSeek.

AI FrameworkDeepSeekJava AI
0 likes · 13 min read
Spring AI 1.0 Launch: Production‑Ready Java AI Framework Unveiled
Architect's Guide
Architect's Guide
May 18, 2025 · Backend Development

Integrating DeepSeek AI with a WeChat Public Account: A Step‑by‑Step Backend Tutorial

This tutorial walks beginners through obtaining a DeepSeek API key, setting up an Alibaba Cloud ECS instance, configuring the server and WeChat public platform, installing required Python dependencies, editing configuration files, and finally running the chatbot so the public account can interact with the DeepSeek large‑language model.

APICloudDeepSeek
0 likes · 12 min read
Integrating DeepSeek AI with a WeChat Public Account: A Step‑by‑Step Backend Tutorial
DataFunSummit
DataFunSummit
May 17, 2025 · Artificial Intelligence

Integrating Knowledge Graphs with DeepSeek AI for Enterprise Knowledge Management

This presentation explores how combining knowledge graphs with DeepSeek large‑model agents can revolutionize enterprise knowledge management, detailing DeepSeek’s technical strengths, the graph‑model complementarity paradigm, various knowledge types, practical frameworks, case studies, and future outlooks for AI‑enhanced intelligent systems.

Artificial IntelligenceDeepSeekEnterprise Knowledge Management
0 likes · 23 min read
Integrating Knowledge Graphs with DeepSeek AI for Enterprise Knowledge Management
Architect's Guide
Architect's Guide
May 13, 2025 · Artificial Intelligence

DeepSeek Model Distillation Technology: Overview, Innovations, Architecture, Training, Performance, and Challenges

This article provides a comprehensive overview of DeepSeek's model distillation technology, detailing its definition, key innovations, architecture, training methods, performance gains, and the remaining challenges such as the implicit performance ceiling and multimodal data distillation.

AI optimizationDeepSeekLarge Language Models
0 likes · 14 min read
DeepSeek Model Distillation Technology: Overview, Innovations, Architecture, Training, Performance, and Challenges
iKang Technology Team
iKang Technology Team
May 12, 2025 · Artificial Intelligence

Introducing Spring AI: Building a Simple Chat Application with DeepSeek

This article introduces Spring AI, explains its core features for integrating various AI models, and walks through creating a Spring Boot chat application that connects to the DeepSeek model using both synchronous and streaming endpoints.

AI integrationChatbotDeepSeek
0 likes · 7 min read
Introducing Spring AI: Building a Simple Chat Application with DeepSeek
JD Tech
JD Tech
May 8, 2025 · Artificial Intelligence

The Emerging Boom of Large Model Applications and Why 2025 Will Be the Turning Point

Amid the AI wave, large language models like DeepSeek R1 are poised to explode by 2025, driven by open-source, low-cost access and superior reasoning, with successful deployment requiring four key factors—domain expertise, knowledge bases, robust search, and engineered agent architectures—to unlock value beyond simple chat.

2025AI applicationsDeepSeek
0 likes · 10 min read
The Emerging Boom of Large Model Applications and Why 2025 Will Be the Turning Point
DevOps
DevOps
May 5, 2025 · Artificial Intelligence

DeepSeek Releases Math‑Specialized Large Model V2 and ProverBench Evaluation Suite

DeepSeek has quietly open‑sourced a new mathematics‑focused large language model, DeepSeek‑Prover‑V2 (available in 671B and 7B variants), achieving 88.9% on MiniF2F and strong results on PutnamBench, alongside the high‑quality ProverBench dataset and a novel recursive theorem‑proving pipeline.

AIDeepSeekProverBench
0 likes · 4 min read
DeepSeek Releases Math‑Specialized Large Model V2 and ProverBench Evaluation Suite
Architects' Tech Alliance
Architects' Tech Alliance
May 2, 2025 · Artificial Intelligence

DeepSeek‑Prover‑V2‑671B: A Massive AI Model for Formal Mathematical Theorem Proving

DeepSeek‑Prover‑V2‑671B, a 671 billion‑parameter AI model released on Hugging Face, dramatically advances formal mathematical theorem proving with MoE architecture, FP8 quantization, 163 k token context, superior performance over GPT‑4 Turbo and other models, and broad implications for research and industry.

AIDeepSeekFP8 Quantization
0 likes · 11 min read
DeepSeek‑Prover‑V2‑671B: A Massive AI Model for Formal Mathematical Theorem Proving
Code Mala Tang
Code Mala Tang
Apr 5, 2025 · Artificial Intelligence

Open-Source AI Video Models Are Redefining the Industry – China Leads the Charge

While most eyes remain on familiar AI giants, China’s Alibaba and DeepSeek are unveiling open‑source video and inference models that run on consumer GPUs, sparking a regulatory scramble and threatening the dominance of closed‑source AI, heralding a rapid, disruptive shift across the industry.

AI localizationAI regulationAI video
0 likes · 10 min read
Open-Source AI Video Models Are Redefining the Industry – China Leads the Charge
Java Architect Essentials
Java Architect Essentials
Apr 2, 2025 · Backend Development

Integrating DeepSeek Large Language Model with Spring Boot to Build an AI Chat Application

This guide demonstrates how to create a Spring Boot backend that integrates DeepSeek's large language model via the Spring AI OpenAI starter, covering project setup, dependency configuration, API key management, and a sample controller that provides AI-powered chat responses such as weather forecasts.

AI integrationChatbotDeepSeek
0 likes · 8 min read
Integrating DeepSeek Large Language Model with Spring Boot to Build an AI Chat Application
Tencent Technical Engineering
Tencent Technical Engineering
Mar 31, 2025 · Artificial Intelligence

Step-by-Step Guide to Local Training of DeepSeek R1 on Multi‑GPU A100 Systems

This step‑by‑step tutorial shows how to set up CUDA 12.4, install required packages, prepare a JSON dataset and custom reward, troubleshoot out‑of‑memory errors, and launch DeepSeek R1 training on an 8‑GPU A100 cluster using Accelerate, Deepspeed zero‑3 and vLLM configurations.

A100CUDADeepSeek
0 likes · 9 min read
Step-by-Step Guide to Local Training of DeepSeek R1 on Multi‑GPU A100 Systems
Java Tech Enthusiast
Java Tech Enthusiast
Mar 29, 2025 · Frontend Development

Building a Twitter Image Card Browser Extension with Claude Sonnet and DeepSeek

In a side‑by‑side test on the Trae platform, the author used Claude Sonnet 3.5 to create a functional Twitter‑to‑image‑card browser extension in roughly twenty minutes, while DeepSeek‑R1 required multiple prompt iterations, manual bug fixes, and still produced visual glitches, demonstrating Claude’s superior reliability for frontend plugin generation.

AI code generationClaude SonnetDeepSeek
0 likes · 5 min read
Building a Twitter Image Card Browser Extension with Claude Sonnet and DeepSeek
Java Architecture Diary
Java Architecture Diary
Mar 26, 2025 · Artificial Intelligence

How DeepSeek V3-0324 Boosts Java AI Apps with Function Calling

The article introduces DeepSeek's new V3-0324 model, highlights its performance gains and new features like function calling and standardized JSON output, demonstrates Chinese and frontend coding tests, provides Java code examples for AI integration, and concludes with a summary of its business impact.

AIChat2BIDeepSeek
0 likes · 6 min read
How DeepSeek V3-0324 Boosts Java AI Apps with Function Calling