Tagged articles

1070 articles

Page 7 of 11

Apr 9, 2025 · Artificial Intelligence

Unlocking LLM Reasoning: A Deep Dive into Prompt Engineering Techniques

This article surveys classic prompt‑engineering methods such as Chain‑of‑Thought, Self‑Consistency, Least‑to‑Most, Boosting of Thoughts, Tree of Thoughts, and AutoGPT, summarizing their core ideas, advantages, limitations, and experimental results to help readers understand how to enhance large language model reasoning without model fine‑tuning.

AI reasoningLarge Language ModelsSelf-Consistency

0 likes · 22 min read

Unlocking LLM Reasoning: A Deep Dive into Prompt Engineering Techniques

Big Data Technology & Architecture

Apr 9, 2025 · Artificial Intelligence

Overview of Data Agents: Definitions, Applications, and Recent Developments by Google, Alibaba Cloud, and ByteDance

This article introduces the concept of AI-powered Data Agents, outlines their key features and use cases across enterprise analytics, data governance, and intelligent customer service, and reviews recent implementations from Google, Alibaba Cloud, and ByteDance, highlighting their impact on modern data-driven workflows.

Artificial IntelligenceData AgentData Analysis

0 likes · 8 min read

Overview of Data Agents: Definitions, Applications, and Recent Developments by Google, Alibaba Cloud, and ByteDance

AIWalker

Apr 8, 2025 · Artificial Intelligence

AgenticIR: An Agentic System for Restoring Images with Complex Degradations

AgenticIR combines visual language models and large language models in a multi‑stage reasoning workflow—perception, planning, execution, reflection, and adjustment—to evaluate, plan, and iteratively apply specialized restoration tools, achieving superior results on complexly degraded images compared to baseline methods.

Agentic SystemsICLR 2025Large Language Models

0 likes · 15 min read

AgenticIR: An Agentic System for Restoring Images with Complex Degradations

Model Perspective

Apr 8, 2025 · Artificial Intelligence

Why Learning Machine Learning Still Matters in the Age of Giant AI Models

The article argues that despite the rapid rise of powerful large language models, mastering machine learning remains essential because it underpins these models, offers customized solutions for specialized tasks, and cultivates the mathematical, programming, and analytical skills needed to effectively use and extend AI technologies.

AILarge Language ModelsMachine Learning

0 likes · 10 min read

Why Learning Machine Learning Still Matters in the Age of Giant AI Models

macrozheng

Apr 8, 2025 · Artificial Intelligence

Boost AI Prompt Quality with Prompt Optimizer: Features, Docker Setup & Real‑World Demo

This guide introduces Prompt Optimizer, a client‑side AI prompt‑enhancement tool with over 2k GitHub stars, outlines its key features, provides step‑by‑step Docker installation commands, showcases a real‑world SpringBoot‑Vue e‑commerce project, and demonstrates how to generate and compare optimized prompts for better LLM responses.

AI Prompt OptimizationDockerLarge Language Models

0 likes · 6 min read

Boost AI Prompt Quality with Prompt Optimizer: Features, Docker Setup & Real‑World Demo

Alibaba Cloud Developer

Apr 8, 2025 · Artificial Intelligence

Unlocking LLM Secrets: From Prompt Basics to RAG and Tool Integration

This article introduces the fundamental paradigms of large language models, explaining how simple prompts, messages, and tools like RAG and ReAct enable powerful applications, while providing practical code examples, translation strategies, and insights on prompt engineering, tool usage, and model fine‑tuning.

AILLM applicationsLarge Language Models

0 likes · 23 min read

Unlocking LLM Secrets: From Prompt Basics to RAG and Tool Integration

DataFunSummit

Apr 7, 2025 · Artificial Intelligence

Bridging the Gap Between Large Models and Real‑World Applications with RAG and Agents

This article examines how Retrieval‑Augmented Generation (RAG) and multi‑agent technologies narrow the gap between large language models and practical deployment, highlighting their roles in operations automation, financial risk control, intelligent data governance, database localization, edge inference, and future AI‑driven solutions.

Data GovernanceLarge Language ModelsOperations Automation

0 likes · 8 min read

Bridging the Gap Between Large Models and Real‑World Applications with RAG and Agents

Architecture and Beyond

Apr 5, 2025 · Artificial Intelligence

Why Defining Problem Boundaries Is Crucial for Effective AI Agents

The article discusses how defining clear problem boundaries is essential for AI agents, explains the challenges of vague tasks for large language models, and proposes multi‑stage decomposition, self‑reflection, and human‑in‑the‑loop strategies to improve AI performance on complex, dynamic tasks.

AIAgent ArchitectureLarge Language Models

0 likes · 13 min read

Why Defining Problem Boundaries Is Crucial for Effective AI Agents

Ops Development & AI Practice

Apr 4, 2025 · Industry Insights

Are Open‑Source LLMs Closing the Gap with Closed‑Source Giants?

A recent leaderboard analysis of top LLMs reveals that while closed‑source models like Gemini‑2.5‑Pro and ChatGPT‑4o still lead overall, open‑source models such as DeepSeek‑V3 and Llama are rapidly narrowing the performance gap, especially in specialized tasks like coding, driven by faster tech diffusion, public datasets, community collaboration, and reduced compute costs.

AI competitionLarge Language ModelsOpen Source

0 likes · 8 min read

Are Open‑Source LLMs Closing the Gap with Closed‑Source Giants?

Code Mala Tang

Apr 3, 2025 · Artificial Intelligence

Intel Core Ultra 5 vs Apple M1: Which Wins for Large Language Model Inference?

This article compares the inference performance of a high‑end Intel Core Ultra 5 AI workstation with an Apple M1 MacBook Air using the IPEX‑LLM library, detailing installation steps, minimal code changes, resource usage, and benchmark results for small and large language models.

AI inferenceApple M1IPEX-LLM

0 likes · 9 min read

Intel Core Ultra 5 vs Apple M1: Which Wins for Large Language Model Inference?

JD Retail Technology

Apr 2, 2025 · Artificial Intelligence

One4All: A Scalable Multi‑Task Generative Recommendation Framework for CPS Advertising

The paper introduces One4All, a scalable multi‑task generative recommendation framework for CPS advertising that combines few‑shot intent prompting, a Rewards‑in‑Context multi‑objective optimization, and an online model‑selection strategy, delivering 2‑3× offline HitRate/NDCG gains and notable online CTR, CVR, and commission improvements.

AdvertisingLLMLarge Language Models

0 likes · 14 min read

One4All: A Scalable Multi‑Task Generative Recommendation Framework for CPS Advertising

Architects' Tech Alliance

Apr 1, 2025 · Artificial Intelligence

What’s New in Large Language Models? DeepSeek V3, Qwen2.5‑Omni, Gemini 2.5 Pro, and GPT‑4o Unpacked

This article reviews the latest updates from major LLM providers—DeepSeek V3’s parameter boost and longer context, Qwen2.5‑Omni’s open‑source multimodal 7B model, Google Gemini 2.5 Pro’s 1 M‑token window and multimodal prowess, and OpenAI GPT‑4o’s image generation and reduced pricing—highlighting technical specs, capabilities, and availability.

DeepSeekGPT-4oGemini

0 likes · 9 min read

What’s New in Large Language Models? DeepSeek V3, Qwen2.5‑Omni, Gemini 2.5 Pro, and GPT‑4o Unpacked

Architect

Apr 1, 2025 · Artificial Intelligence

When to Fine‑Tune Large Language Models vs. Relying on Prompting and RAG

The article explains why most projects should start with prompt engineering or simple agent workflows, outlines the scenarios where model fine‑tuning adds real value, compares fine‑tuning with Retrieval‑Augmented Generation, and offers practical criteria for deciding which approach to adopt.

AI deploymentLarge Language ModelsLoRA

0 likes · 9 min read

When to Fine‑Tune Large Language Models vs. Relying on Prompting and RAG

AI Frontier Lectures

Mar 31, 2025 · Industry Insights

Why GPT‑4o’s Image Generation Is Overwhelming Users—and What It Means for AI

OpenAI’s GPT‑4o image generation, launched only for paid users, quickly hit performance bottlenecks and sparked a flood of viral content, prompting technical analysis of its multimodal capabilities, speed issues, copyright concerns, and the broader impact on the AI industry.

AI IndustryAI multimodalGPT-4o

0 likes · 5 min read

Why GPT‑4o’s Image Generation Is Overwhelming Users—and What It Means for AI

AntTech

Mar 31, 2025 · Artificial Intelligence

Ant Group Papers Accepted at ICLR 2025: Summaries and Links

The article presents the abstracts, publication types, links, and research areas of seventeen Ant Group papers accepted at ICLR 2025, covering topics such as embodied robot co‑design, efficient distributed training for large language models, optimization via LLMs, character animation, interactive frame interpolation, KV‑cache management, and privacy‑preserving Transformers.

AI researchAnt GroupICLR2025

0 likes · 23 min read

Ant Group Papers Accepted at ICLR 2025: Summaries and Links

Architects' Tech Alliance

Mar 31, 2025 · Artificial Intelligence

A Comprehensive History of Large Language Models from the Transformer Era (2017) to DeepSeek‑R1 (2025)

This article reviews the evolution of large language models from the 2017 Transformer breakthrough through BERT, GPT series, alignment techniques, multimodal extensions, open‑weight releases, and the cost‑efficient DeepSeek‑R1 in 2025, highlighting key technical advances, scaling trends, and their societal impact.

AI alignmentLLM evolutionLarge Language Models

0 likes · 26 min read

A Comprehensive History of Large Language Models from the Transformer Era (2017) to DeepSeek‑R1 (2025)

Data Thinking Notes

Mar 30, 2025 · Artificial Intelligence

How DeepSeek‑R1 and Kimi‑K1.5 Push the Boundaries of Strong Reasoning Models

This comprehensive analysis by the Peking University AI Alignment team dissects the technical innovations behind DeepSeek‑R1, DeepSeek‑R1 Zero, and Kimi‑K1.5, covering reinforcement‑learning‑based post‑training, rule‑based rewards, GRPO optimization, scaling laws, multimodal extensions, safety challenges, and future research directions.

AI alignmentDeepSeekKimi

0 likes · 57 min read

How DeepSeek‑R1 and Kimi‑K1.5 Push the Boundaries of Strong Reasoning Models

Architect

Mar 30, 2025 · Artificial Intelligence

What Is Retrieval-Augmented Generation? A Deep Dive into RAG Techniques

This article provides a comprehensive survey of Retrieval‑Augmented Generation (RAG), covering its basic principles, key components, seven technical variants, challenges, evaluation methods, and future research directions across multimodal, graph‑based, and agentic extensions.

AI SurveyKnowledge retrievalLarge Language Models

0 likes · 9 min read

What Is Retrieval-Augmented Generation? A Deep Dive into RAG Techniques

AI Frontier Lectures

Mar 30, 2025 · Artificial Intelligence

Do Large Language Models Mirror Human Brain Language Processing? Google’s Groundbreaking Findings

Google researchers discovered a linear relationship between brain activity recorded during natural conversation and the internal embeddings of a speech‑to‑text large language model, revealing that acoustic and lexical representations from the model can accurately predict neural responses in both language comprehension and production.

AI researchGoogleLarge Language Models

0 likes · 8 min read

Do Large Language Models Mirror Human Brain Language Processing? Google’s Groundbreaking Findings

Cognitive Technology Team

Mar 30, 2025 · Artificial Intelligence

Why Prompt Engineering Is the “Mind‑Reading” Technique of AI: The Crucial Role of In‑Context Learning

Prompt engineering uses in‑context learning to turn large language models into precise, task‑aware assistants by providing well‑crafted prompts that guide the model’s probability distribution, reduce hallucinations, and unlock hidden knowledge without any parameter tuning.

Artificial IntelligenceIn-Context LearningLarge Language Models

0 likes · 6 min read

Why Prompt Engineering Is the “Mind‑Reading” Technique of AI: The Crucial Role of In‑Context Learning

Alibaba Cloud Big Data AI Platform

Mar 29, 2025 · Artificial Intelligence

How DistilQwen2.5‑R1 Boosts Small‑Model Reasoning with Innovative Knowledge Distillation

The article introduces the DistilQwen2.5‑R1 series, which leverages a novel knowledge‑distillation pipeline—including CoT data evaluation, improvement, and validation—to transfer deep reasoning abilities from large models like DeepSeek‑R1 to compact models, achieving superior performance across math, code, and scientific benchmarks and providing open‑source checkpoints and deployment guides for practical use.

AI inferenceKnowledge DistillationLarge Language Models

0 likes · 17 min read

How DistilQwen2.5‑R1 Boosts Small‑Model Reasoning with Innovative Knowledge Distillation

Alimama Tech

Mar 28, 2025 · Artificial Intelligence

How Alibaba’s Taobao AI Models Revolutionize E‑Commerce Recommendations and Bidding

Alibaba’s Taobao Group unveiled its AIGX technology suite, including the RecGPT recommendation model, the AIGB generative bidding system, and a new AI‑generated video engine, detailing open‑source benchmarks, NeurIPS workshop participation, and measurable ROI improvements for e‑commerce advertising.

AIGenerative BiddingLarge Language Models

0 likes · 5 min read

How Alibaba’s Taobao AI Models Revolutionize E‑Commerce Recommendations and Bidding

Qborfy AI

Mar 28, 2025 · Artificial Intelligence

Master Prompt Engineering: From Basics to Advanced SQL Generation

This article walks readers through the fundamentals of prompt engineering—covering role, context, instruction, examples, and output formatting—then demonstrates a step‑by‑step construction of a sophisticated SQL‑generation prompt, complete with concrete code snippets, best‑practice tips, and reference resources.

AI Prompt DesignInstruction TuningLarge Language Models

0 likes · 21 min read

Master Prompt Engineering: From Basics to Advanced SQL Generation

Alibaba Cloud Developer

Mar 26, 2025 · Artificial Intelligence

Why DeepSeek Is Shaking Up the LLM Landscape: Architecture, Performance, and Cost

DeepSeek, a Chinese AI startup, offers open‑source large language models—DeepSeek‑V3 for general tasks and DeepSeek‑R1 for intensive reasoning—featuring MoE, MLA, low‑cost training, and competitive performance against OpenAI’s GPT‑4o, while providing detailed usage guidance and cost analysis.

AI inferenceDeepSeekLarge Language Models

0 likes · 21 min read

Why DeepSeek Is Shaking Up the LLM Landscape: Architecture, Performance, and Cost

Architects' Tech Alliance

Mar 25, 2025 · Industry Insights

How Near‑Memory Computing Can Power Edge LLMs: A 2025 Storage Framework

The article analyzes the challenges of deploying large language models on cloud servers—such as latency, security, and constant connectivity—and explains how near‑memory computing architectures (PNM, PIM, CIM) can integrate storage and processing to enable efficient, high‑performance edge AI deployments, outlining the trade‑offs of each approach.

Artificial IntelligenceEdge AIIndustry Insights

0 likes · 5 min read

How Near‑Memory Computing Can Power Edge LLMs: A 2025 Storage Framework

Alibaba Cloud Developer

Mar 25, 2025 · Artificial Intelligence

Boost Your AI Search Skills: Advanced Prompt & Query Tricks

This guide explains how to leverage AI tools with deep web‑search capabilities, covering site‑specific queries, wildcard operators, date ranges, Boolean logic, and effective prompt engineering techniques—including Socratic questioning and CRISPE framework—to improve information retrieval accuracy and efficiency across various domains.

AILarge Language ModelsSearch Operators

0 likes · 8 min read

Boost Your AI Search Skills: Advanced Prompt & Query Tricks

AI Frontier Lectures

Mar 24, 2025 · Artificial Intelligence

What Can AI Agents Learn from the Latest AIR 2025 Research?

The article compiles insights from the AIR 2025 conference and related talks, covering the evolution of agents from reinforcement‑learning to LLM‑driven systems, novel agent architectures like AIDE, GUI agents, natural‑language reinforcement learning, and scaling advances in large language models such as Qwen, while highlighting key algorithms, benchmarks, and open research questions.

AI agentsAgent ArchitectureGUI agents

0 likes · 27 min read

What Can AI Agents Learn from the Latest AIR 2025 Research?

Architects' Tech Alliance

Mar 22, 2025 · Industry Insights

What Does DeepSeek’s 2025 AI Report Reveal About the Future of Large Models?

The 2025 DeepSeek Insight report analyzes DeepSeek’s new large‑model releases, compares US and Chinese AI ecosystems, outlines diverse application scenarios such as government, healthcare and aerospace, and provides practical guidance for safely leveraging these models despite their current limitations.

AI IndustryAI safetyDeepSeek

0 likes · 5 min read

What Does DeepSeek’s 2025 AI Report Reveal About the Future of Large Models?

Model Perspective

Mar 21, 2025 · Artificial Intelligence

How DeepSeek’s Tree‑Based Reasoning Transforms AI Interaction

DeepSeek’s R1 inference mode replaces linear chain‑of‑thought with a transparent, multi‑path tree reasoning system, offering layered analysis, intent understanding, memory management, emotion detection, and hallucination mitigation, illustrated through a practical example of buying authentic cigarettes and detailed technical breakdowns.

Artificial IntelligenceLarge Language ModelsMemory

0 likes · 16 min read

How DeepSeek’s Tree‑Based Reasoning Transforms AI Interaction

Continuous Delivery 2.0

Mar 21, 2025 · Artificial Intelligence

AI-Driven Automated Unit Test Generation Framework: Architecture, Workflow, and Evaluation

This article presents an AI‑powered framework that automatically scans codebases, generates comprehensive unit tests using large language models, and includes self‑repair agents, detailing its workflow, core components, strategies for accuracy, practical benefits, and current limitations.

AI testingLarge Language Modelsself-repair

0 likes · 9 min read

AI-Driven Automated Unit Test Generation Framework: Architecture, Workflow, and Evaluation

AI Algorithm Path

Mar 20, 2025 · Artificial Intelligence

Understanding Multimodal Large Language Models: Recent Advances and Comparative Analysis

This article surveys the latest multimodal large language model research, dissecting the design, training strategies, and performance trade‑offs of models such as Llama 3.2, Molmo, NVLM, Qwen2‑VL, Pixtral, MM1.5, Emu3, and Janus, and highlights the challenges of fair cross‑model evaluation.

AI researchCross-AttentionLarge Language Models

0 likes · 16 min read

Understanding Multimodal Large Language Models: Recent Advances and Comparative Analysis

AI Frontier Lectures

Mar 20, 2025 · Artificial Intelligence

Why Multimodal LLMs Still Struggle with Multi-Image Math Reasoning: Insights from MV‑MATH

This article introduces the MV‑MATH dataset, a large‑scale multi‑image math benchmark, and evaluates 24 open‑source and closed‑source multimodal large language models, revealing significant performance gaps, especially on complex visual dependencies and higher difficulty levels.

Large Language Modelsdatasetmath reasoning

0 likes · 8 min read

Why Multimodal LLMs Still Struggle with Multi-Image Math Reasoning: Insights from MV‑MATH

JavaScript

Mar 20, 2025 · Artificial Intelligence

How MiniMax’s Linear‑Attention Architecture Is Redefining Long‑Context AI Models

MiniMax’s rapid 2025 releases—including a video model, open‑source LLM, and high‑fidelity voice model—showcase its multimodal linear‑attention architecture that handles up to 4 million tokens, earns a16z recognition, and signals China’s growing influence in open‑source AI innovation.

Artificial IntelligenceLarge Language ModelsLinear Attention

0 likes · 8 min read

How MiniMax’s Linear‑Attention Architecture Is Redefining Long‑Context AI Models

AI Frontier Lectures

Mar 17, 2025 · Artificial Intelligence

Can Diffusion Models Outrun Traditional LLMs? Mercury Coder’s Speed & Architecture

The article analyzes Mercury Coder, a diffusion‑based language model that generates text and code in parallel, compares its speed and quality against traditional autoregressive LLMs like GPT‑4o‑mini using a ball‑collision benchmark, and discusses the underlying score‑entropy training, current limitations, and future multimodal potential.

AI PerformanceBenchmarkLarge Language Models

0 likes · 8 min read

Can Diffusion Models Outrun Traditional LLMs? Mercury Coder’s Speed & Architecture

ZhongAn Tech Team

Mar 17, 2025 · Artificial Intelligence

Weekly Tech Digest: AI Model Advancements, Strategic Infrastructure Deals, and Industry Insights on AI Agents

This weekly technology digest highlights significant advancements in artificial intelligence, including OpenAI's Python-enabled o1 model, Google's open-source Gemma 3, and Alibaba's AI-driven Quark application, alongside major industry partnerships, expert forecasts on AI agent proliferation, and emerging developments in robotics and wearable technology.

AI agentsArtificial IntelligenceLarge Language Models

0 likes · 7 min read

Weekly Tech Digest: AI Model Advancements, Strategic Infrastructure Deals, and Industry Insights on AI Agents

Alibaba Cloud Developer

Mar 17, 2025 · Artificial Intelligence

23 Proven Prompt Engineering Techniques to Make AI Understand You Instantly

As large language models become increasingly adept at natural language, mastering prompt engineering remains essential; this article compiles 23 practical strategies—from concise commands and role‑playing to structured formatting and output guidance—that empower users to communicate clearly with AI and obtain high‑quality, targeted results.

AI interactionAI productivityLarge Language Models

0 likes · 18 min read

23 Proven Prompt Engineering Techniques to Make AI Understand You Instantly

Fighter's World

Mar 14, 2025 · Industry Insights

Will the 10× Growth Promise of Vertical AI Crumble as Generalist LLMs Like Manus Dominate the Market?

The article examines whether the surge of general‑purpose large language models such as Manus, Claude Sonet, and Qwen undermines the Bessemer Venture Partners claim that Vertical AI will grow tenfold, by analysing market size, use‑case demand, technical challenges, emerging business models, and competitive moats.

AI MarketAI agentsBusiness Models

0 likes · 19 min read

Will the 10× Growth Promise of Vertical AI Crumble as Generalist LLMs Like Manus Dominate the Market?

Zhihu Tech Column

Mar 14, 2025 · Artificial Intelligence

Insights from Zhihu’s ZhiLight Large Model Inference Framework: Architecture, Parallelism, and Performance Optimizations

The article summarizes Zhihu’s technical talk on the ZhiLight large‑model inference framework, detailing model execution mechanisms, GPU load analysis, multi‑GPU parallel strategies, open‑source engine comparisons, compute‑communication overlap, quantization techniques, benchmark results, and future directions for scalable LLM deployment.

GPU parallelismLarge Language ModelsSGLang

0 likes · 11 min read

Insights from Zhihu’s ZhiLight Large Model Inference Framework: Architecture, Parallelism, and Performance Optimizations

Alimama Tech

Mar 14, 2025 · Artificial Intelligence

Advances in Search Advertising Models with Large Language Models (2024)

In 2024 Alibaba Mama outlines how large‑language models transform search advertising through a three‑line scaling roadmap—explicit inductive‑bias design, implicit compute growth, and auxiliary CV/NLP advances—implemented via a pre‑train/post‑train/CTR paradigm and the LUM user‑behavior model, promising gains in relevance, recall, and real‑time serving while highlighting inference efficiency challenges.

CTR predictionLarge Language ModelsScaling Law

0 likes · 25 min read

Advances in Search Advertising Models with Large Language Models (2024)

Baidu Tech Salon

Mar 13, 2025 · Artificial Intelligence

How PaddlePaddle 3.0 Boosts Large‑Model Inference with 4‑Bit Quantization and MLA Optimizations

PaddlePaddle 3.0 introduces a full‑stack inference engine that supports FP8, INT8, and 4‑bit quantization for popular LLMs such as DeepSeek V3/R1, delivers up to 2× token throughput on a single H800 GPU, and provides detailed deployment scripts for single‑node and multi‑node setups, including MTP speculative decoding and SageAttention for long‑sequence acceleration.

DockerInference OptimizationLarge Language Models

0 likes · 13 min read

How PaddlePaddle 3.0 Boosts Large‑Model Inference with 4‑Bit Quantization and MLA Optimizations

Alibaba Cloud Big Data AI Platform

Mar 13, 2025 · Artificial Intelligence

From Chain‑of‑Thought to Self‑Evolving Agents: Lessons from AI Agent Engineering

This article traces the evolution of large‑model agents from a simple chain‑of‑thought design through tool and agent instantiation, structured PEER patterns, and self‑evolving architectures, highlighting practical challenges, middleware solutions, and open‑source resources for building robust AI agents.

AI agentsAgent ArchitectureLarge Language Models

0 likes · 16 min read

From Chain‑of‑Thought to Self‑Evolving Agents: Lessons from AI Agent Engineering

Architects' Tech Alliance

Mar 11, 2025 · Artificial Intelligence

How DeepSeek’s Breakthrough AI Models Thrive on Huawei Ascend: A Deep Dive

An in‑depth analysis reveals how DeepSeek’s V3 and R1 large‑language models achieve superior inference performance and cost efficiency on Huawei’s Ascend AI platform, detailing architectural optimizations, KV‑Cache reductions, multimodal support, real‑world deployments across finance, government, manufacturing, and the projected impact on the AI industry.

DeepSeekHuawei AscendLarge Language Models

0 likes · 4 min read

How DeepSeek’s Breakthrough AI Models Thrive on Huawei Ascend: A Deep Dive

AI Algorithm Path

Mar 11, 2025 · Artificial Intelligence

AI Agents Overview: Foundations, Core Components, and When to Use Them

This article provides a comprehensive overview of AI Agents, tracing their evolution from traditional chatbots to LLM‑driven agents, explaining core components such as perception, reasoning, action, knowledge bases, learning and communication interfaces, and discussing practical use cases, interaction cycles, and future prospects.

AI agentsAutonomous SystemsLarge Language Models

0 likes · 15 min read

AI Agents Overview: Foundations, Core Components, and When to Use Them

58 Tech

Mar 11, 2025 · Artificial Intelligence

Applying Large Language Models to Real Estate Recommendation: Case Studies and Optimization Techniques

This article presents a comprehensive case study on how large language models are integrated into 58.com’s real‑estate recommendation platform, detailing challenges, data adaptation, prompt and parameter optimizations, embedding generation, conversational recommendation, and future directions for multimodal and generative recommendation systems.

EmbeddingLarge Language ModelsPrompt Engineering

0 likes · 14 min read

Applying Large Language Models to Real Estate Recommendation: Case Studies and Optimization Techniques

Efficient Ops

Mar 9, 2025 · Artificial Intelligence

Essential LLMOps Tools: Build, Deploy, Monitor, and Manage Large Language Models

LLMOps, the end-to-end methodology for managing large language models, encompasses a curated set of development, deployment, monitoring, and local management tools—such as LangChain, vLLM, LangSmith, and Ollama—enabling practitioners to efficiently build, scale, and maintain AI applications.

AI developmentLLMOpsLarge Language Models

0 likes · 6 min read

Essential LLMOps Tools: Build, Deploy, Monitor, and Manage Large Language Models

Architects' Tech Alliance

Mar 9, 2025 · Industry Insights

DeepSeek’s AI Ecosystem: From Core Tech to Market Impact

This article provides a comprehensive analysis of DeepSeek, covering its foundational AI research, technology stack, product offerings, and the broader upstream, midstream, and downstream AI industry landscape, including hardware, server, cloud, and market trends.

AI infrastructureArtificial IntelligenceDeepSeek

0 likes · 13 min read

DeepSeek’s AI Ecosystem: From Core Tech to Market Impact

Fun with Large Models

Mar 8, 2025 · Artificial Intelligence

Make AI Obey: A Detailed Prompt Engineering Guide to Boost Large‑Model Logic

This tutorial explains how to enhance large language models' logical reasoning by using DeepSeek‑R1's deep‑thinking mode, few‑shot prompting, chain‑of‑thought, and zero‑shot chain‑of‑thought techniques, providing concrete examples, comparisons, and a step‑by‑step template for effective prompt design.

AI reasoningDeepSeekLarge Language Models

0 likes · 10 min read

Make AI Obey: A Detailed Prompt Engineering Guide to Boost Large‑Model Logic

Code Mala Tang

Mar 8, 2025 · Artificial Intelligence

14 Powerful Prompt Engineering Techniques to Unlock AI’s Full Potential

This article introduces the fundamentals of prompt engineering and presents fourteen practical techniques—ranging from role‑playing and step‑by‑step reasoning to chain‑of‑thought and ReAct—that help users craft precise, high‑quality prompts for any large language model, dramatically improving AI output.

AIAI productivityLLM techniques

0 likes · 16 min read

14 Powerful Prompt Engineering Techniques to Unlock AI’s Full Potential

Cognitive Technology Team

Mar 7, 2025 · Artificial Intelligence

From Word Embeddings to Large Language Models: A Comprehensive Overview of AI Model Evolution

This article traces the development of AI models—from early word embeddings like Word2Vec and ELMo, through transformer‑based encoders such as BERT and decoder‑only models like GPT‑1/2/3, to recent multimodal systems and scaling laws—explaining their architectures, training methods, and impact on modern AI applications.

AIEmbeddingLarge Language Models

0 likes · 22 min read

From Word Embeddings to Large Language Models: A Comprehensive Overview of AI Model Evolution

Alibaba Cloud Big Data AI Platform

Mar 7, 2025 · Artificial Intelligence

How Pai‑Megatron‑Patch Boosts Qwen2‑VL Multimodal Training Efficiency

This article explains how the Pai‑Megatron‑Patch toolkit enhances the usability and training performance of the Qwen2‑VL multimodal large model by introducing model‑parallel weight conversion, user‑friendly data loading, visual feature processing optimizations, optimizer offloading, and pipeline parallelism techniques, supported by extensive experimental analysis.

Large Language ModelsMegatronPipeline Parallelism

0 likes · 25 min read

How Pai‑Megatron‑Patch Boosts Qwen2‑VL Multimodal Training Efficiency

dbaplus Community

Mar 7, 2025 · Artificial Intelligence

Master Prompt Engineering: Frameworks, Strategies, and Real‑World Examples for Large Language Models

This comprehensive guide explains what prompts are, outlines essential prompt components and multiple engineering frameworks, presents practical strategies for crafting clear and structured prompts, addresses model limitations such as hallucinations, and showcases a wide range of advanced prompting techniques with code examples.

AILLMLarge Language Models

0 likes · 29 min read

Master Prompt Engineering: Frameworks, Strategies, and Real‑World Examples for Large Language Models

Data Thinking Notes

Mar 6, 2025 · Artificial Intelligence

How China’s State‑Owned Giants Are Accelerating AI with DeepSeek

Amid a global digital surge, 45% of China’s central state‑owned enterprises have deployed the DeepSeek large‑model platform, rapidly integrating AI across energy, power, telecom, construction and other sectors to boost intelligent transformation and operational efficiency.

AI adoptionChinaDeepSeek

0 likes · 7 min read

How China’s State‑Owned Giants Are Accelerating AI with DeepSeek

MaGe Linux Operations

Mar 6, 2025 · Operations

How Large Language Models Are Revolutionizing SRE from Firefighting to Proactive Ops

This article explores how open‑source large language models like DeepSeek empower SRE teams to shift from reactive firefighting to proactive, predictive operations, detailing technical principles, real‑world case studies, essential skill sets, and future trends that reshape the operations landscape.

AI OpsAutomationLarge Language Models

0 likes · 8 min read

How Large Language Models Are Revolutionizing SRE from Firefighting to Proactive Ops

JD Retail Technology

Mar 6, 2025 · Artificial Intelligence

Dynamic Margin Selection for Efficient Deep Learning and Low-Resource Large Model Training

Jia Xing’s research introduces Dynamic Margin Selection, a technique that repeatedly refreshes a core set of boundary‑close samples to train large language models efficiently on limited resources, achieving comparable loss to full‑data training, enabling six‑fold model compression, faster inference, and a proposed exponential scaling law for data‑efficient AI.

ICLRLarge Language ModelsLow-Resource Training

0 likes · 10 min read

Dynamic Margin Selection for Efficient Deep Learning and Low-Resource Large Model Training

Tencent Technical Engineering

Mar 5, 2025 · Information Security

Detecting Critical AI Infrastructure Vulnerabilities with AI-Infra-Guard

As open‑source large language model tools like Ollama, OpenWebUI and ComfyUI gain popularity, numerous security flaws such as unauthenticated APIs, CVE‑exploits, model theft and remote code execution emerge, prompting the development of AI‑Infra‑Guard—a lightweight, cross‑platform scanner that identifies over 30 component vulnerabilities and offers both web UI and CLI modes for rapid risk assessment.

AI securityAI-Infra-GuardCVE

0 likes · 13 min read

Detecting Critical AI Infrastructure Vulnerabilities with AI-Infra-Guard

Architects' Tech Alliance

Mar 5, 2025 · Industry Insights

DeepSeek R1 & Kimi 1.5: Inside the Development of Near‑Strong Reasoning Models

The article analyzes DeepSeek's recent releases—V3 dialogue model and R1 inference model—detailing their launch dates, rapid popularity surge, R1's reinforcement‑learning‑based design for code and math tasks, and provides links to related Beijing University technical reports while stripping promotional sales content.

AIDeepSeekLarge Language Models

0 likes · 3 min read

DeepSeek R1 & Kimi 1.5: Inside the Development of Near‑Strong Reasoning Models

AntTech

Mar 4, 2025 · Artificial Intelligence

GraphCLIP and 2D‑TPE: Enhancing Transferability of Graph Models and Table Understanding for Large Language Models

This article introduces GraphCLIP, a self‑supervised graph‑summary pre‑training framework that boosts zero‑ and few‑shot transferability of graph foundation models for text‑attributed graphs, and 2D‑TPE, a two‑dimensional positional encoding method that preserves table structure to markedly improve large language model performance on table‑understanding tasks, while also announcing a live paper session at WWW 2025 featuring the authors.

Large Language ModelsPositional EncodingSelf‑Supervised Learning

0 likes · 6 min read

GraphCLIP and 2D‑TPE: Enhancing Transferability of Graph Models and Table Understanding for Large Language Models

JD Retail Technology

Feb 28, 2025 · Artificial Intelligence

Generative Recommendation with DPO Alignment for JD Alliance Advertising: Multi‑Objective Optimization and Online Results

The paper presents a generative recommendation framework for JD Alliance advertising that combines semantic‑ID modeling, large‑model pre‑training and fine‑tuning, and Direct Preference Optimization (including Softmax‑DPO and β‑DPO) to jointly boost click‑through and conversion rates, achieving +0.6% UCTR and +8% UCVR in online tests while outlining future multi‑objective extensions.

AdvertisingDPOGenerative Recommendation

0 likes · 12 min read

Generative Recommendation with DPO Alignment for JD Alliance Advertising: Multi‑Objective Optimization and Online Results

Architect

Feb 27, 2025 · Artificial Intelligence

Understanding Inference Large Language Models: DeepSeek‑R1 and the Rise of Test‑Time Computation

This article explains how inference‑oriented large language models such as DeepSeek‑R1 and OpenAI o1‑mini shift AI research from training‑time scaling to test‑time computation, detailing the underlying principles, new scaling laws, verification techniques, reinforcement‑learning pipelines, and practical methods for distilling reasoning capabilities into smaller models.

DeepSeek-R1Large Language Modelsinference

0 likes · 18 min read

Understanding Inference Large Language Models: DeepSeek‑R1 and the Rise of Test‑Time Computation

Code Mala Tang

Feb 27, 2025 · Artificial Intelligence

Do New AI Reasoning Models Really Think? Unpacking the Debate

The article examines whether the latest AI models that claim to perform true reasoning—by breaking problems into steps and using chain‑of‑thought—actually reason like humans, presenting skeptical and supportive expert viewpoints, and offering practical guidance on how to use such models responsibly.

AI reasoningAI safetyLarge Language Models

0 likes · 14 min read

Do New AI Reasoning Models Really Think? Unpacking the Debate

DataFunSummit

Feb 26, 2025 · Artificial Intelligence

Applying Multimodal Large Models to Music Recommendation at NetEase Cloud Music

This article details how NetEase Cloud Music leverages multimodal large language models to improve music recommendation across daily, personalized, and playlist scenarios by extracting rich audio, text, and visual features, addressing data skew, cold‑start challenges, and achieving measurable gains in user engagement and distribution efficiency.

Large Language ModelsNetEase Cloud Musicfeature extraction

0 likes · 12 min read

Applying Multimodal Large Models to Music Recommendation at NetEase Cloud Music

AntTech

Feb 26, 2025 · Artificial Intelligence

Ant Group’s 18 Accepted Papers at AAAI 2025: Summaries and Highlights

This article presents concise English summaries of the 18 Ant Group papers accepted at AAAI 2025, covering topics such as privacy‑preserving large‑model tuning, knowledge‑graph integration, AI‑generated image detection, multi‑task learning, generative retrieval, role‑playing evaluation, and video hallucination mitigation.

AAAI 2025AI evaluationGenerative Retrieval

0 likes · 29 min read

Ant Group’s 18 Accepted Papers at AAAI 2025: Summaries and Highlights

Ops Development & AI Practice

Feb 25, 2025 · Artificial Intelligence

What Is Hybrid Reasoning in Claude 3.7 Sonnet and Why It Matters

Hybrid reasoning lets Claude 3.7 Sonnet dynamically switch between fast, intuition‑like answers and step‑by‑step, deep analysis, improving both speed and accuracy for tasks ranging from simple code snippets to complex algorithm design, and signals a broader shift in large language model capabilities.

AI reasoningClaude 3.7Hybrid Reasoning

0 likes · 9 min read

What Is Hybrid Reasoning in Claude 3.7 Sonnet and Why It Matters

21CTO

Feb 25, 2025 · Artificial Intelligence

How Alibaba’s Qwen 2.5‑Max Challenges GPT‑4o and Redefines China’s AI Race

Chinese tech giants Huawei and Alibaba respond to President Xi’s call for stronger innovation, with Huawei showcasing its HarmonyOS and server‑grade Arm processor while Alibaba unveils the Qwen 2.5‑Max large language model that outperforms leading Western AI systems on multiple benchmarks, highlighting China’s accelerating AI ambitions.

AIAlibabaChina

0 likes · 5 min read

How Alibaba’s Qwen 2.5‑Max Challenges GPT‑4o and Redefines China’s AI Race

Architecture Digest

Feb 25, 2025 · Artificial Intelligence

DeepSeek Distillation Technology: Overview, Innovations, Architecture, Training, Performance, and Challenges

DeepSeek’s distillation technology combines data and model distillation to transfer knowledge from large teacher models to compact student models, detailing its definitions, principles, key innovations, architecture, training methods, performance gains, and challenges, especially in multimodal contexts.

AI researchDeepSeekKnowledge Distillation

0 likes · 16 min read

DeepSeek Distillation Technology: Overview, Innovations, Architecture, Training, Performance, and Challenges

21CTO

Feb 24, 2025 · Artificial Intelligence

From Transformers to DeepSeek-R1: Evolution of Large Language Models

Since the 2017 introduction of the Transformer architecture, this article chronicles the rapid development of large language models—including BERT, GPT series, multimodal systems, and the cost‑effective DeepSeek‑R1—highlighting key innovations, scaling trends, alignment techniques, and their transformative impact across AI research and industry.

AI evolutionDeepSeekLLM History

0 likes · 23 min read

From Transformers to DeepSeek-R1: Evolution of Large Language Models

Architects' Tech Alliance

Feb 24, 2025 · Artificial Intelligence

NSA: Hardware‑Optimized Sparse Attention Mechanism from DeepSeek, Peking University and University of Washington

The NSA mechanism introduces a three‑branch hardware‑optimized sparse attention architecture—token compression, token selection, and sliding window—combined with learnable gating to balance global and local context, dramatically improving inference speed and efficiency for long‑context large language models.

AI ArchitectureDeepSeekLarge Language Models

0 likes · 5 min read

NSA: Hardware‑Optimized Sparse Attention Mechanism from DeepSeek, Peking University and University of Washington

Su San Talks Tech

Feb 23, 2025 · Artificial Intelligence

How DeepSeek’s Distillation Breaks AI Model Limits: Core Principles & Performance

This article explores DeepSeek’s cutting‑edge distillation technology, detailing its definition, underlying principles, innovative data‑model fusion, architecture choices, training strategies, performance gains over large language models, and the remaining challenges in knowledge transfer and multimodal data processing.

DeepSeekKnowledge DistillationLarge Language Models

0 likes · 16 min read

How DeepSeek’s Distillation Breaks AI Model Limits: Core Principles & Performance

Model Perspective

Feb 22, 2025 · Artificial Intelligence

Why DeepSeek Is Gaining Traction Beyond ChatGPT: Insights from the Global Developers Conference

The article examines DeepSeek’s surge in popularity by analyzing its timely release, cost‑effective performance, open‑source approach, and broader AI ecosystem trends, while also sharing expert predictions and practical coding tool recommendations for developers.

AI predictionsAI trendsDeepSeek

0 likes · 5 min read

Why DeepSeek Is Gaining Traction Beyond ChatGPT: Insights from the Global Developers Conference

Architect

Feb 19, 2025 · Artificial Intelligence

Does Scaling Law Still Hold for Grok 3? A Deep Dive into LLM Training Economics

The article critically examines whether the pre‑training Scaling Law still applies to Grok 3, compares its compute usage and model size with DeepSeek and OpenAI models, evaluates the cost‑effectiveness of pre‑training, RL and test‑time scaling, and explores how these insights shape future large‑language‑model development strategies.

Grok-3Large Language ModelsPre‑training

0 likes · 11 min read

Does Scaling Law Still Hold for Grok 3? A Deep Dive into LLM Training Economics

AI Algorithm Path

Feb 19, 2025 · Artificial Intelligence

How Temperature Shapes Output in Large Language Models

The article explains the Temperature hyper‑parameter in large language models, shows how it modifies the softmax distribution, provides a Python visualisation script, and demonstrates through experiments that higher values increase creativity while lower values make outputs more deterministic.

Large Language ModelsPythonSampling

0 likes · 5 min read

How Temperature Shapes Output in Large Language Models

Alibaba Cloud Developer

Feb 19, 2025 · Artificial Intelligence

How to Replicate DeepSeek‑R1’s Thought Process on Claude 3.5 Sonnet with Prompt Engineering

The article explains how to use prompt‑engineering techniques on Claude 3.5 Sonnet to mimic DeepSeek‑R1’s transparent reasoning, detailing background, prompt design, iterative optimization, and the broader impact on AI communication and user expression.

AI reasoningClaudeDeepSeek

0 likes · 25 min read

How to Replicate DeepSeek‑R1’s Thought Process on Claude 3.5 Sonnet with Prompt Engineering

Code Mala Tang

Feb 19, 2025 · Artificial Intelligence

Compute Power’s Role in the AI Race: Insights from Grok 3, DeepSeek & the Post‑Training Era

The article analyzes how massive compute resources drive AI breakthroughs, highlighting Grok 3's top‑tier performance, DeepSeek's efficient engineering under constraints, and the emerging post‑training paradigm that reshapes competition among major AI players.

AI scalingDeepSeekGrok-3

0 likes · 7 min read

Compute Power’s Role in the AI Race: Insights from Grok 3, DeepSeek & the Post‑Training Era

DataFunTalk

Feb 19, 2025 · Artificial Intelligence

Large Models: Concepts, Principles, Classifications and Applications

This report provides a comprehensive overview of large-scale AI models, explaining their definition, massive parameter and data requirements, underlying transformer architecture, classification into language, vision and multimodal models, notable examples such as DeepSeek, and a survey of popular AIGC tools and practical use cases.

AIGC toolsLarge Language ModelsLarge Models

0 likes · 9 min read

Large Models: Concepts, Principles, Classifications and Applications

Architects' Tech Alliance

Feb 19, 2025 · Industry Insights

Why DeepSeek One‑Stop AI Machines Are Redefining Private Model Deployment

The surge in demand for private AI deployment has prompted multiple vendors to launch DeepSeek one‑stop machines—integrated hardware solutions that support the full DeepSeek model family, offering higher stability, easier setup, customization, cost savings, and data security across diverse industry scenarios.

AI hardwareAI infrastructureDeepSeek

0 likes · 7 min read

Why DeepSeek One‑Stop AI Machines Are Redefining Private Model Deployment

Tencent Cloud Developer

Feb 19, 2025 · Industry Insights

Why Every Enterprise Needs a Knowledge‑Management System in the LLM Era

The article analyzes how the shift from data‑driven to knowledge‑driven operations, powered by large language models like DeepSeek, forces companies to build dynamic knowledge‑management platforms that integrate personal and corporate knowledge, improve efficiency, and create sustainable competitive advantage.

DeepSeekEnterprise AILarge Language Models

0 likes · 14 min read

Why Every Enterprise Needs a Knowledge‑Management System in the LLM Era

Architects' Tech Alliance

Feb 18, 2025 · Artificial Intelligence

How DeepSeek’s Latest Models Redefine AI Performance and Industry Adoption

The DeepSeek report details rapid model releases from 2024 onward, highlighting innovations such as model distillation, a 671 B MoE architecture, FP8 mixed‑precision, and the Janus‑Pro multimodal framework, while also documenting major cloud and chip providers' integration of these models into their services.

AI industry adoptionDeepSeekLarge Language Models

0 likes · 10 min read

How DeepSeek’s Latest Models Redefine AI Performance and Industry Adoption

Mingyi World Elasticsearch

Feb 18, 2025 · Artificial Intelligence

Master Prompt Engineering for DeepSeek and ChatGPT‑4o: Essential Techniques

This guide explains the fundamentals of prompt engineering for large language models such as DeepSeek and ChatGPT‑4o, illustrating clear‑prompt design, giving models time to think, chaining prompts, iterative refinement, and advanced tricks with concrete good and bad examples.

AIChatGPT-4oDeepSeek

0 likes · 12 min read

Master Prompt Engineering for DeepSeek and ChatGPT‑4o: Essential Techniques

DataFunTalk

Feb 18, 2025 · Artificial Intelligence

CODEI/O: Leveraging Code to Train Large Language Models for Enhanced Reasoning

The DeepSeek team introduced CODEI/O, a massive dataset that converts code into natural‑language reasoning chains, and demonstrated that training large language models on this data markedly improves their performance on diverse inference tasks, including non‑code domains, through a two‑stage training strategy.

CODEI/OLarge Language Modelscode reasoning

0 likes · 8 min read

CODEI/O: Leveraging Code to Train Large Language Models for Enhanced Reasoning

Cognitive Technology Team

Feb 18, 2025 · Artificial Intelligence

Two Major Bottlenecks in Deploying Large Language Models: Machine Deception and Hallucination

Deploying large language models faces two critical challenges—machine deception, where AI generates plausible yet false content, and machine hallucination, where outputs are logically coherent but factually inaccurate—both undermining trust, and the article outlines their causes, impacts, and technical, ethical, and regulatory mitigation strategies.

Artificial IntelligenceLarge Language ModelsMachine Deception

0 likes · 6 min read

Two Major Bottlenecks in Deploying Large Language Models: Machine Deception and Hallucination

Architect's Alchemy Furnace

Feb 17, 2025 · Artificial Intelligence

24 Proven Prompt Formulas to Unlock DeepSeek’s Full Potential

Discover a comprehensive collection of 24 structured prompting techniques—from basic role‑play formulas to advanced cross‑disciplinary and managerial frameworks—designed to help users of DeepSeek and other large language models craft precise, high‑impact queries that dramatically improve response quality and efficiency.

AI promptingDeepSeekLarge Language Models

0 likes · 12 min read

24 Proven Prompt Formulas to Unlock DeepSeek’s Full Potential

Baobao Algorithm Notes

Feb 17, 2025 · Artificial Intelligence

Can TransMLA Turn GQA into a More Powerful MLA? A Deep Dive into DeepSeek Models

This article presents a theoretical and experimental analysis of converting Group Query Attention (GQA) models to Multi‑Head Linear Attention (MLA) using the TransMLA method, demonstrating superior expressiveness and performance on DeepSeek‑based large language models while keeping KV‑Cache costs unchanged.

DeepSeekLarge Language ModelsMLA

0 likes · 11 min read

Can TransMLA Turn GQA into a More Powerful MLA? A Deep Dive into DeepSeek Models

Java Architecture Diary

Feb 17, 2025 · Artificial Intelligence

What Is LLMs.txt? The New AI‑Friendly Web Standard Explained

LLMs.txt is a lightweight, AI‑optimized web standard that provides concise Markdown navigation files for large language models, addressing context limits, redundant content, and lack of structure, and is already adopted by companies like Mintlify, Anthropic, and Cursor.

AI StandardsAI indexingLarge Language Models

0 likes · 6 min read

What Is LLMs.txt? The New AI‑Friendly Web Standard Explained

Fun with Large Models

Feb 16, 2025 · Artificial Intelligence

Can You Claim to Know Large Models? Guide to Distillation, Quantization & Fine‑Tuning

This article explains why the massive DeepSeek V3/R1 model (671 B parameters) is hard to deploy and introduces three key techniques—model distillation, quantization, and fine‑tuning—that can shrink, accelerate, or specialize large models, while outlining their trade‑offs and practical steps.

AI model compressionDeepSeekLarge Language Models

0 likes · 10 min read

Can You Claim to Know Large Models? Guide to Distillation, Quantization & Fine‑Tuning

Architects' Tech Alliance

Feb 16, 2025 · Artificial Intelligence

How DeepSeek’s Distillation Breaks Bottlenecks and Boosts Multimodal AI Performance

This article provides an in‑depth technical analysis of DeepSeek’s model distillation technology, covering its core principles, innovative data‑model fusion strategies, architecture design, training optimizations, performance benchmarks, and the remaining challenges of scaling distillation to multimodal tasks.

DeepSeekLarge Language ModelsMultimodal

0 likes · 16 min read

How DeepSeek’s Distillation Breaks Bottlenecks and Boosts Multimodal AI Performance

Lao Guo's Learning Space

Feb 15, 2025 · Artificial Intelligence

What Is deepseek-MoE? Understanding the Mixture‑of‑Experts Architecture

The article explains deepseek-MoE (Mixture of Experts), describing its full English name, Chinese translation, how a gating network selects and weights multiple expert models for each input, and uses an analogy to illustrate load‑balancing and the divide‑and‑conquer design in large AI models.

AI ArchitectureLarge Language ModelsMixture of Experts

0 likes · 2 min read

What Is deepseek-MoE? Understanding the Mixture‑of‑Experts Architecture

Ops Development & AI Practice

Feb 14, 2025 · Artificial Intelligence

Large Model Format Showdown: Hugging Face, TensorFlow, ONNX, TorchScript, GGUF

This comprehensive guide examines the leading large‑model storage formats—including Hugging Face Transformers, TensorFlow SavedModel, ONNX, TorchScript, and GGUF—detailing their file structures, serialization methods, strengths, weaknesses, and typical use‑cases, helping developers and researchers select the optimal format for their specific AI workloads.

AI deploymentGGUFLarge Language Models

0 likes · 21 min read

Large Model Format Showdown: Hugging Face, TensorFlow, ONNX, TorchScript, GGUF

DataFunSummit

Feb 14, 2025 · Artificial Intelligence

Building Large‑Scale Recommendation Systems with Big Data and Large Language Models on Alibaba Cloud AI Platform

This presentation details how Alibaba Cloud's AI platform integrates big‑data pipelines, feature‑store services, and large language model capabilities to construct high‑performance search‑recommendation architectures, covering system design, training and inference optimizations, LLM‑driven use cases, and open‑source RAG tooling.

AI PlatformBig DataFeature Store

0 likes · 17 min read

Building Large‑Scale Recommendation Systems with Big Data and Large Language Models on Alibaba Cloud AI Platform

Top Architect

Feb 14, 2025 · Artificial Intelligence

DeepSeek Model Distillation: Principles, Innovations, Architecture, and Performance

This article provides an in‑depth overview of DeepSeek’s model distillation technology, covering its definition, core principles, innovative data‑model distillation integration, architecture design, training strategies, performance gains, and the challenges of scaling to multimodal data.

DeepSeekKnowledge TransferLarge Language Models

0 likes · 16 min read

DeepSeek Model Distillation: Principles, Innovations, Architecture, and Performance

Code Mala Tang

Feb 13, 2025 · Artificial Intelligence

Why Apple Chose Alibaba: Inside the AI Partnership Reshaping China’s Smartphone Market

Apple’s steep sales decline in China has driven it to partner with Alibaba’s Qwen AI platform, a move that blends cutting‑edge large‑model technology, cloud scalability, and local compliance to revive iPhone market share and showcase China’s rising AI prowess.

AI partnershipAlibabaApple

0 likes · 11 min read

Why Apple Chose Alibaba: Inside the AI Partnership Reshaping China’s Smartphone Market

Ma Wei Says

Feb 13, 2025 · Artificial Intelligence

Master AI Prompting: 5 Proven Techniques to Unlock Accurate Outputs

This guide presents five practical prompting techniques—including structured output, role‑playing, visual conversion, multi‑turn refinement, and multilingual handling—plus industry‑specific examples and common pitfalls, helping users craft precise commands for AI models like DeepSeek.

AI promptingLarge Language ModelsPrompt Engineering

0 likes · 8 min read

Master AI Prompting: 5 Proven Techniques to Unlock Accurate Outputs

Architect

Feb 12, 2025 · Artificial Intelligence

Can S‑Curve Theory Explain the Limits of Large‑Model Scaling Laws?

The article analyses how S‑shaped growth curves can model the apparent scaling laws of large language models, discusses the three phases of model development, proposes an ability‑density hypothesis, and explores future scenarios where scaling laws may plateau or shift.

AI growthAbility DensityLarge Language Models

0 likes · 16 min read

Can S‑Curve Theory Explain the Limits of Large‑Model Scaling Laws?

Architect

Feb 12, 2025 · Artificial Intelligence

Master Prompt Engineering: A Universal Framework for LLMs

This article presents a comprehensive, step‑by‑step Prompt engineering framework—including role definition, problem description, goal setting, and requirement specification—augmented with techniques such as RAG, few‑shot examples, memory handling, and parameter tuning, enabling users to craft effective prompts for large language models across domains.

AI Prompt OptimizationFew-shotLarge Language Models

0 likes · 27 min read

Master Prompt Engineering: A Universal Framework for LLMs

AIWalker

Feb 11, 2025 · Artificial Intelligence

LLMDet: LLM‑Powered Open‑Vocabulary Detector Beats Grounding DINO

LLMDet introduces a novel training pipeline that leverages large language models to generate detailed image‑level captions and region‑level phrases, fine‑tunes an open‑vocabulary detector with the GroundingCap‑1M dataset, and achieves state‑of‑the‑art zero‑shot performance surpassing Grounding DINO across multiple benchmarks.

GroundingCapLLMDetLarge Language Models

0 likes · 20 min read

LLMDet: LLM‑Powered Open‑Vocabulary Detector Beats Grounding DINO

DataFunTalk

Feb 11, 2025 · Artificial Intelligence

Roundtable on Enhancing Large Model Effectiveness: RAG, Tool Use, and Knowledge Engineering

Experts from Dipu, Ant Financial, iKang, and Zhihu discuss practical strategies for improving large model performance, covering RAG, tool‑using, offline knowledge engineering, multimodal training, evaluation metrics, and future trends, while sharing case studies from manufacturing, healthcare, retail, and C‑end applications.

Knowledge EngineeringLarge Language ModelsRAG

0 likes · 9 min read

Roundtable on Enhancing Large Model Effectiveness: RAG, Tool Use, and Knowledge Engineering

Cognitive Technology Team

Feb 10, 2025 · Artificial Intelligence

Survey of Major Chinese AI Large Language Models: Technologies, Innovations, and Comparative Evaluation

This report systematically reviews the key technologies, innovations, and performance of leading Chinese AI large language models—including DeepSeek, Kimi, and Qwen2.5—detailing their architectures, training methods, multimodal capabilities, and comparative evaluations against each other and foreign models.

AIChinaLarge Language Models

0 likes · 20 min read

Survey of Major Chinese AI Large Language Models: Technologies, Innovations, and Comparative Evaluation

AI Algorithm Path

Feb 10, 2025 · Artificial Intelligence

Understanding DualPipe: DeepDive into DeepSeek‑R1 Architecture (Part 5)

This article explains how the DualPipe scheduling mechanism in DeepSeek‑R1 improves GPU cluster compute‑communication efficiency by using fine‑grained pipeline stages and bidirectional data flow, comparing it with Zero Bubble pipeline parallelism and discussing the challenges of large‑scale distributed training.

DeepSeekDualPipeLarge Language Models

0 likes · 10 min read

Understanding DualPipe: DeepDive into DeepSeek‑R1 Architecture (Part 5)

IT Architects Alliance

Feb 10, 2025 · Artificial Intelligence

DeepSeek Distillation Technology: Principles, Innovations, Performance, and Future Outlook

The article explains DeepSeek's model distillation technique, covering its fundamental knowledge‑transfer principles, unique innovations such as data‑model fusion and task‑specific strategies, impressive benchmark results, practical applications in edge and online inference, existing challenges, and future research directions.

Knowledge TransferLarge Language Modelsai-optimization

0 likes · 15 min read

DeepSeek Distillation Technology: Principles, Innovations, Performance, and Future Outlook

Baidu Geek Talk

Feb 10, 2025 · Artificial Intelligence

How Baidu Cloud Slashes Inference Costs: DeepSeek Model Optimizations Unveiled

Baidu Cloud's Qianfan platform launched DeepSeek‑R1 and DeepSeek‑V3 with ultra‑low inference pricing, leveraging advanced engine performance tweaks, a split Prefill/Decode architecture, and comprehensive security measures that together boost throughput, cut costs, and ensure enterprise‑grade reliability.

AI inferenceBaidu CloudLarge Language Models

0 likes · 5 min read

How Baidu Cloud Slashes Inference Costs: DeepSeek Model Optimizations Unveiled

Architects' Tech Alliance

Feb 10, 2025 · Artificial Intelligence

Why DeepSeek Is Disrupting the Global AI Landscape: Tech, Cost, and Open‑Source Edge

DeepSeek, a Chinese AI startup, has rapidly risen to global prominence by releasing high‑performance large language models such as V2, V3, and R1, which combine innovative architectures, dramatically lower training costs, and an open‑source strategy that challenges established AI giants and reshapes industry dynamics.

Artificial IntelligenceChina AIDeepSeek

0 likes · 14 min read

Why DeepSeek Is Disrupting the Global AI Landscape: Tech, Cost, and Open‑Source Edge