Tagged articles
5000 articles
Page 23 of 50
AntData
AntData
Apr 3, 2025 · Artificial Intelligence

Ray Flow Insight: Visualizing and Debugging Distributed AI Applications

Ray Flow Insight is an Ant Group open‑source tool that visualizes Ray's distributed programming primitives—Actors, Tasks, and Objects—to turn complex reinforcement‑learning systems from opaque "black boxes" into transparent, debuggable workflows, providing logical, physical, distributed stack, and flame‑graph views for performance analysis and optimization.

AIDebuggingDistributed Systems
0 likes · 32 min read
Ray Flow Insight: Visualizing and Debugging Distributed AI Applications
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 3, 2025 · Artificial Intelligence

Build a Text‑and‑Image Article with Alibaba Cloud AI Custom Plugin in 5 Steps

This tutorial shows how to use Alibaba Cloud's Baileian platform to create a workflow that generates a Xiaohongshu‑style article together with matching images by leveraging a custom large‑model plugin, Python script nodes, and image‑generation tools, complete with step‑by‑step configuration and code examples.

AIPythonimage generation
0 likes · 14 min read
Build a Text‑and‑Image Article with Alibaba Cloud AI Custom Plugin in 5 Steps
Architects' Tech Alliance
Architects' Tech Alliance
Apr 3, 2025 · Artificial Intelligence

Why NVLink and NVSwitch Are Essential for Training Massive AI Models

Training today's massive AI foundation models demands extensive GPU resources and sophisticated multi‑GPU communication, making technologies like NVLink and NVSwitch crucial for efficient distributed training, while data‑parallel and model‑parallel strategies together optimize performance across large‑scale hardware clusters.

AIGPUNVLink
0 likes · 8 min read
Why NVLink and NVSwitch Are Essential for Training Massive AI Models
21CTO
21CTO
Apr 2, 2025 · Artificial Intelligence

Can AI Shrink the Workweek to Two Days? Bill Gates' Bold Prediction

Bill Gates predicts that rapid AI advances could automate most tasks, potentially reducing the standard workweek to just two days within a decade, sparking debate about productivity, burnout, and which professions—like software developers, biologists, and energy experts—might survive the AI revolution.

AIBill GatesFuture of Work
0 likes · 6 min read
Can AI Shrink the Workweek to Two Days? Bill Gates' Bold Prediction
DataFunTalk
DataFunTalk
Apr 2, 2025 · Artificial Intelligence

Trends, Applications, and Future Directions of Large Models and Inference Acceleration

This article examines the current state and future prospects of large AI models and inference acceleration, covering technology trends, diverse application scenarios from research to industry, and the challenges and opportunities that lie ahead for intelligent data governance, multimodal agents, and AGI.

AGIAIInference Acceleration
0 likes · 11 min read
Trends, Applications, and Future Directions of Large Models and Inference Acceleration
Programmer Xu Shu
Programmer Xu Shu
Apr 2, 2025 · Backend Development

Boost Java Productivity 40% with Cursor IDE: A Complete Setup Guide

This article walks Java developers through configuring the Cursor IDE, essential plugins, advanced JDK and Maven settings, AI-powered coding assistance, shortcut tips, and best‑practice workflows that together can increase daily development efficiency by roughly forty percent.

AICursorDevelopment
0 likes · 13 min read
Boost Java Productivity 40% with Cursor IDE: A Complete Setup Guide
Tencent Cloud Developer
Tencent Cloud Developer
Apr 2, 2025 · Artificial Intelligence

Understanding Retrieval‑Augmented Generation (RAG): Concepts, Types, and Development

Retrieval‑Augmented Generation (RAG) enhances large language models by fetching up‑to‑date external knowledge before generation, mitigating knowledge‑cutoff limits and hallucinations through a retrieval step (using text, vector, or graph methods) and a generation step, evolving from naive single‑method approaches to advanced, modular, graph‑based, and agentic systems that enable adaptive, multi‑hop reasoning and future intelligent, multimodal pipelines.

AIHallucination MitigationKnowledge retrieval
0 likes · 9 min read
Understanding Retrieval‑Augmented Generation (RAG): Concepts, Types, and Development
Java Architecture Diary
Java Architecture Diary
Apr 2, 2025 · Artificial Intelligence

Run AI Models Locally with Docker Model Runner and Java Integration

This article explains how Docker Model Runner enables effortless local execution of AI models, details platform support, provides a full command reference, shows how to use the REST endpoint, and demonstrates integration with Java via LangChain4j, including code examples and a feature comparison with Ollama.

AIDockerLangChain4j
0 likes · 9 min read
Run AI Models Locally with Docker Model Runner and Java Integration
Huolala Tech
Huolala Tech
Apr 1, 2025 · Frontend Development

How Frontend Teams Can Leverage LLMs for Real‑Time Compliance Checks

This article explains how frontend developers can use large language models to detect and prevent marketing content violations in WeChat mini‑programs, covering pain‑point discovery, LLM‑driven compliance architecture, prompt optimization, model selection, testing methods, and seamless frontend integration with Feishu notifications.

AILLMintegration
0 likes · 10 min read
How Frontend Teams Can Leverage LLMs for Real‑Time Compliance Checks
AntTech
AntTech
Apr 1, 2025 · Artificial Intelligence

AReaL‑boba: Open‑Source Reinforcement Learning Training Framework v0.2 with SOTA Performance

The Ant Research Institute and Tsinghua University's Wu Yi team released AReaL‑boba 0.2, an open‑source reinforcement‑learning training framework that dramatically speeds up large‑scale model training, achieves state‑of‑the‑art mathematical reasoning results, and provides all code, data, and scripts for reproducible research.

AITraining Frameworklarge models
0 likes · 5 min read
AReaL‑boba: Open‑Source Reinforcement Learning Training Framework v0.2 with SOTA Performance
Architect
Architect
Mar 31, 2025 · Artificial Intelligence

A Comprehensive Study of Failure Modes in Large‑Language‑Model Based Multi‑Agent Systems

This paper presents a systematic investigation of failure patterns in LLM‑driven multi‑agent systems, introducing a 14‑type taxonomy (MASFT) derived from over 150 annotated dialogues, evaluating it with an LLM‑as‑a‑judge pipeline, and exploring modest intervention strategies while releasing all data and tools for future research.

AILLMagentic
0 likes · 29 min read
A Comprehensive Study of Failure Modes in Large‑Language‑Model Based Multi‑Agent Systems
Cognitive Technology Team
Cognitive Technology Team
Mar 31, 2025 · Artificial Intelligence

Understanding Douyin's Recommendation Algorithm: From Behavior Prediction to Value Modeling

The article explains how Douyin's recommendation system uses machine‑learning and deep‑learning models to predict user actions, assign value weights, and dynamically adjust scores, highlighting both its efficiency in large‑scale content distribution and its inherent limitations compared to human understanding.

AIDeep Learningrecommendation system
0 likes · 7 min read
Understanding Douyin's Recommendation Algorithm: From Behavior Prediction to Value Modeling
Architects' Tech Alliance
Architects' Tech Alliance
Mar 29, 2025 · Industry Insights

Why Network Becomes the New Bottleneck for AI Training and How InfiniBand vs RoCE Compare

AI large‑model training relies on GPU clusters, generating massive inter‑node traffic that turns network performance into the primary bottleneck, prompting a detailed comparison of InfiniBand and RoCE protocols, their histories, strengths, limitations, and the need for next‑generation network chip architectures.

AIData CenterHPC
0 likes · 5 min read
Why Network Becomes the New Bottleneck for AI Training and How InfiniBand vs RoCE Compare
Architects' Tech Alliance
Architects' Tech Alliance
Mar 29, 2025 · Artificial Intelligence

What GTC2025 Reveals About AI's Next Leap: Agentic AI, Blackwell GPUs, and Emerging Technologies

At GTC2025 NVIDIA showcased the transition from Generative to Agentic AI, introduced the Blackwell GPU family with massive compute gains, unveiled the open‑source Dynamo inference OS, announced the upcoming Blackwell Ultra chip, VeraRubin roadmap, and highlighted advances in silicon‑photonic switches, robotics and quantum computing.

AIBlackwell GPUDynamo
0 likes · 7 min read
What GTC2025 Reveals About AI's Next Leap: Agentic AI, Blackwell GPUs, and Emerging Technologies
Alimama Tech
Alimama Tech
Mar 28, 2025 · Artificial Intelligence

How Alibaba’s Taobao AI Models Revolutionize E‑Commerce Recommendations and Bidding

Alibaba’s Taobao Group unveiled its AIGX technology suite, including the RecGPT recommendation model, the AIGB generative bidding system, and a new AI‑generated video engine, detailing open‑source benchmarks, NeurIPS workshop participation, and measurable ROI improvements for e‑commerce advertising.

AIGenerative BiddingLarge Language Models
0 likes · 5 min read
How Alibaba’s Taobao AI Models Revolutionize E‑Commerce Recommendations and Bidding
Data Thinking Notes
Data Thinking Notes
Mar 27, 2025 · Artificial Intelligence

How DeepSeek AI Is Revolutionizing Government Services and Operations

DeepSeek's large language model is reshaping government work by enabling intelligent public services, streamlining office processes, enhancing city governance, and offering proactive policy push, smart hall assistants, knowledge management, automated workflows, and advanced risk‑warning systems, all backed by real‑world case studies and measurable impact metrics.

AIDigital Transformationpublic services
0 likes · 21 min read
How DeepSeek AI Is Revolutionizing Government Services and Operations
Liangxu Linux
Liangxu Linux
Mar 27, 2025 · Databases

How Chat2DB Uses AI to Simplify Database Management and SQL Generation

Chat2DB is an open‑source AI‑enhanced database client that turns natural language into SQL, auto‑generates table schemas and test data, offers a smart editor, visual chart creation, Excel analysis, and supports multi‑platform installation for dozens of databases.

AIChat2DBDatabase Management
0 likes · 7 min read
How Chat2DB Uses AI to Simplify Database Management and SQL Generation
AIWalker
AIWalker
Mar 27, 2025 · Artificial Intelligence

MagicColor: First Multi‑Instance AI Sketch‑Coloring System for Professional‑Grade Comics

MagicColor introduces a novel multi‑instance sketch‑coloring framework that uses a two‑stage self‑play training strategy, instance guidance, and edge‑aware pixel‑level color matching to automatically produce high‑quality, consistent colors for multiple line‑art instances, outperforming prior GAN and diffusion‑based methods.

AIMulti-InstanceSketch Colorization
0 likes · 16 min read
MagicColor: First Multi‑Instance AI Sketch‑Coloring System for Professional‑Grade Comics
ByteDance Cloud Native
ByteDance Cloud Native
Mar 27, 2025 · Operations

Taming High Cardinality in AI & Autonomous Driving with Prometheus

This article shares practical experience from Volcengine's managed Prometheus service and its deep integration with large‑model and autonomous‑driving platforms, explaining what high cardinality is, its impact on monitoring systems, root causes, and a range of design, collection, and analysis techniques to mitigate it.

AIAutonomous DrivingObservability
0 likes · 12 min read
Taming High Cardinality in AI & Autonomous Driving with Prometheus
Qborfy AI
Qborfy AI
Mar 27, 2025 · Artificial Intelligence

How to Deploy DeepSeek‑R1 Locally with Ollama and Dify: A Step‑by‑Step Guide

This article walks through the entire process of deploying the DeepSeek‑R1 large language model on a personal machine, covering hardware requirements, Ollama installation, model download, service startup, remote access configuration, and visual UI integration with Dify, complete with concrete commands and screenshots.

AIDeepSeekDocker
0 likes · 9 min read
How to Deploy DeepSeek‑R1 Locally with Ollama and Dify: A Step‑by‑Step Guide
AntTech
AntTech
Mar 27, 2025 · Artificial Intelligence

LMM‑R1: A Two‑Stage Reinforcement Learning Framework for Enhancing Multimodal Model Reasoning

Researchers from Ant Group, Southeast University and others introduced the open‑source LMM‑R1 framework, a two‑stage reinforcement‑learning approach that first strengthens textual reasoning and then generalizes it to multimodal tasks, achieving significant performance gains on benchmarks such as football, Sokoban, and geometry reasoning with modest GPU costs.

AI
0 likes · 8 min read
LMM‑R1: A Two‑Stage Reinforcement Learning Framework for Enhancing Multimodal Model Reasoning
37 Interactive Technology Team
37 Interactive Technology Team
Mar 26, 2025 · Artificial Intelligence

LUI vs GUI: Choosing the Right Interface for AI Product Design

When designing AI products, choosing between a Language User Interface—leveraging speech recognition, NLP, and conversational flexibility—and a Graphical User Interface—relying on visual icons, layouts, and intuitive interaction—depends on technology maturity, response speed, and user learning cost, while emerging multimodal designs increasingly blend both for richer, context‑aware experiences.

AIGUIInteraction
0 likes · 11 min read
LUI vs GUI: Choosing the Right Interface for AI Product Design
Java Architecture Diary
Java Architecture Diary
Mar 26, 2025 · Artificial Intelligence

How DeepSeek V3-0324 Boosts Java AI Apps with Function Calling

The article introduces DeepSeek's new V3-0324 model, highlights its performance gains and new features like function calling and standardized JSON output, demonstrates Chinese and frontend coding tests, provides Java code examples for AI integration, and concludes with a summary of its business impact.

AIChat2BIDeepSeek
0 likes · 6 min read
How DeepSeek V3-0324 Boosts Java AI Apps with Function Calling
AI Frontier Lectures
AI Frontier Lectures
Mar 25, 2025 · Artificial Intelligence

Can Mixed‑Modality Graphs Unlock Precise 3D Indoor Scene Generation?

MMGDreamer introduces a mixed‑modality graph and a dual‑branch diffusion model that jointly enhance geometric control and realism in 3D indoor scene synthesis, outperforming state‑of‑the‑art methods across multiple quantitative and qualitative benchmarks.

3D scene generationAIcomputer vision
0 likes · 12 min read
Can Mixed‑Modality Graphs Unlock Precise 3D Indoor Scene Generation?
21CTO
21CTO
Mar 25, 2025 · Artificial Intelligence

Which LLM Is Best for Coding? Speed, Hallucination, and Context Compared

This article breaks down major large language models, defining key comparison metrics such as speed, hallucination rate, and context window, then evaluates each model with benchmarks like HumanEval+, ChatBot Arena, and Aider to help you choose the most suitable LLM for your coding tasks.

AIBenchmarkLLM
0 likes · 10 min read
Which LLM Is Best for Coding? Speed, Hallucination, and Context Compared
Java Architect Essentials
Java Architect Essentials
Mar 25, 2025 · Artificial Intelligence

DeepSeek4j 1.4: A Java Framework for Seamless DeepSeek AI Integration with Full Chain‑of‑Thought and Streaming Support

The article introduces DeepSeek4j 1.4, a Java‑centric framework that overcomes Spring AI’s limitations by preserving DeepSeek’s chain‑of‑thought, supporting streaming output, and offering a simple Spring Boot starter with reactive, configurable, and ready‑to‑use APIs for AI developers.

AIDeepSeekJava
0 likes · 5 min read
DeepSeek4j 1.4: A Java Framework for Seamless DeepSeek AI Integration with Full Chain‑of‑Thought and Streaming Support
JD Retail Technology
JD Retail Technology
Mar 25, 2025 · Artificial Intelligence

2024 Advances in Advertising Creative Generation and Selection

In 2024 the advertising team deployed an end‑to‑end AIGC pipeline that automatically creates high‑quality ad images, uses the multimodal Reliable Feedback Network and the million‑size RF1M dataset to filter outputs, builds rich offline and online multimodal representations with contrastive and list‑wise learning, and optimizes ranking architecture to deliver scalable, personalized creative selection.

AIAIGCAdvertising
0 likes · 10 min read
2024 Advances in Advertising Creative Generation and Selection
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 25, 2025 · Artificial Intelligence

Boost Your AI Search Skills: Advanced Prompt & Query Tricks

This guide explains how to leverage AI tools with deep web‑search capabilities, covering site‑specific queries, wildcard operators, date ranges, Boolean logic, and effective prompt engineering techniques—including Socratic questioning and CRISPE framework—to improve information retrieval accuracy and efficiency across various domains.

AILarge Language ModelsSearch Operators
0 likes · 8 min read
Boost Your AI Search Skills: Advanced Prompt & Query Tricks
Alibaba Cloud Observability
Alibaba Cloud Observability
Mar 24, 2025 · Information Security

DeepSeek ClickHouse Leak: AI Data Risks & Cloud Native Log Service Safeguards

An exposed ClickHouse database at DeepSeek revealed over a million sensitive logs—including chats, API keys, and backend details—highlighting AI data security gaps, while Alibaba Cloud’s Log Service (SLS) offers comprehensive protection through access control, data masking, fine-grained query limits, and real‑time monitoring.

AILog ServiceObservability
0 likes · 11 min read
DeepSeek ClickHouse Leak: AI Data Risks & Cloud Native Log Service Safeguards
58UXD
58UXD
Mar 24, 2025 · Fundamentals

How Design‑Driven Strategies Boost Your Impact in the AI Era

This article explores how designers can shift from passive execution to proactive, design‑driven initiatives—leveraging AI, aligning with business goals, starting with small wins, building ally networks, and overcoming resource and verification challenges—to increase their professional value and influence within product teams.

AIcareer developmentdesign
0 likes · 8 min read
How Design‑Driven Strategies Boost Your Impact in the AI Era
JD Cloud Developers
JD Cloud Developers
Mar 24, 2025 · Artificial Intelligence

How Multi-Agent Reinforcement Learning Boosts Ad Computation Allocation

This article presents MaRCA, a multi‑agent reinforcement‑learning framework that allocates computation resources across the full ad‑serving chain, modeling user value, compute cost, and action rewards to maximize ad revenue while keeping system load stable under fluctuating traffic.

AIad optimizationcomputation allocation
0 likes · 16 min read
How Multi-Agent Reinforcement Learning Boosts Ad Computation Allocation
Java Architecture Diary
Java Architecture Diary
Mar 24, 2025 · Backend Development

Build a Java MCP Server with Spring AI in Minutes

This tutorial shows how to use Spring AI MCP to create a Java MCP server, covering environment setup, business logic implementation, service registration, and client configuration, enabling seamless AI service integration with minimal effort.

AIMCPintegration
0 likes · 6 min read
Build a Java MCP Server with Spring AI in Minutes
Model Perspective
Model Perspective
Mar 24, 2025 · Artificial Intelligence

Can AI Play Devil’s Advocate to Sharpen Decision‑Making?

The article explores how assigning AI the role of a contrarian can expose hidden risks, challenge assumptions, and improve strategic decisions across education, business, and personal contexts, illustrating the approach with a simulated debate on launching a math‑modeling training venture.

AIDevil's Advocatebusiness strategy
0 likes · 9 min read
Can AI Play Devil’s Advocate to Sharpen Decision‑Making?
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 24, 2025 · Artificial Intelligence

Why LLM Internet Search Fails and How to Fix It: A Deep Dive into Qwen, Doubao, and DeepSeek

This article analyses the shortcomings of large‑model internet search—such as unverifiable sources, fabricated content, and poor instruction compliance—by comparing Qwen‑max, Doubao‑1.5‑pro‑256k, and DeepSeek‑v3, and proposes prompt engineering, post‑processing, and custom tool improvements to boost reliability.

AIEvaluationLLM
0 likes · 22 min read
Why LLM Internet Search Fails and How to Fix It: A Deep Dive into Qwen, Doubao, and DeepSeek
Continuous Delivery 2.0
Continuous Delivery 2.0
Mar 24, 2025 · Artificial Intelligence

Traditional vs AI: Can R&D Efficiency Increase Tenfold?

The live session examines how AI tools impact software development productivity, detailing personal, team, and organizational effects, practical use cases, limitations, industry implications, and a comparison between domestic and foreign AI solutions, concluding that AI boosts individual output but offers limited gains at scale.

AIR&D efficiencycode generation
0 likes · 7 min read
Traditional vs AI: Can R&D Efficiency Increase Tenfold?
Xiaokun's Architecture Exploration Notes
Xiaokun's Architecture Exploration Notes
Mar 24, 2025 · Artificial Intelligence

How to Model Architecture for a High‑Performance Recommendation System

This article walks through business, conceptual, logical, and physical modeling steps to design a recommendation system architecture, detailing value propositions, workflow decomposition, component breakdown, and technology choices to meet reliability, low‑latency, and scalability requirements.

AISystem Designarchitecture modeling
0 likes · 10 min read
How to Model Architecture for a High‑Performance Recommendation System
Fighter's World
Fighter's World
Mar 23, 2025 · Industry Insights

What a 1999 Book Got Right About AI, Crypto, and the Rise of the Sovereign Individual

A reflective analysis shows how the 1999 predictions in *The Sovereign Individual* about AI, decentralized finance and the diminishing relevance of geography closely match today’s rapid advances in large‑language models, cryptocurrency adoption, and the empowerment of individuals over traditional institutions.

AIDecentralizationDigital Economy
0 likes · 10 min read
What a 1999 Book Got Right About AI, Crypto, and the Rise of the Sovereign Individual
Cognitive Technology Team
Cognitive Technology Team
Mar 22, 2025 · Artificial Intelligence

Principle-Based Thinking in the AI Era: A 15‑Minute Guide

The article explains principle‑based thinking as a deep cognitive approach grounded in fundamental laws, outlines its core practices, and shows how to apply it for transparent, collaborative, and risk‑aware AI usage while offering actionable steps for individuals and organizations.

AIcognitive strategyhuman-AI collaboration
0 likes · 5 min read
Principle-Based Thinking in the AI Era: A 15‑Minute Guide
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Mar 21, 2025 · Industry Insights

How DeepSeek’s Large Model is Revolutionizing Digital Twin Simulations

This article analyzes how DeepSeek’s multimodal large model overcomes traditional digital‑twin simulation bottlenecks through dynamic modeling, generative data augmentation, and low‑cost open‑source architecture, enabling smarter city traffic, industrial design, and water‑resource management while reshaping the industry’s AI‑driven simulation landscape.

AIDeepSeekDigital Twin
0 likes · 22 min read
How DeepSeek’s Large Model is Revolutionizing Digital Twin Simulations
ByteDance Web Infra
ByteDance Web Infra
Mar 21, 2025 · Artificial Intelligence

Midscene.js: An AI‑Driven UI Automation Framework from ByteDance

Midscene.js is an open‑source UI automation framework that leverages multimodal AI to simplify web UI testing and interaction, offering three core interfaces—Action, Query, and Assert—along with a JavaScript SDK, support for multiple AI models, YAML scripting, and future‑focused features for stable, scalable automation.

AIJavaScriptMidscene.js
0 likes · 21 min read
Midscene.js: An AI‑Driven UI Automation Framework from ByteDance
Architect
Architect
Mar 20, 2025 · Artificial Intelligence

Building a Gitee AI Repository Assistant with MCP and LangChain4j

This article explains the Model Context Protocol (MCP) introduced by Gitee, shows how Java developers can integrate it using LangChain4j, compares stdio and SSE transport modes, provides full code samples, installation steps, and demonstrates a practical AI‑powered repository assistant.

AICode AutomationGitee
0 likes · 9 min read
Building a Gitee AI Repository Assistant with MCP and LangChain4j
Didi Tech
Didi Tech
Mar 20, 2025 · Big Data

Key Questions and Value Assessment in Data Warehouse Modeling and Development

The article explores nine fundamental questions about data‑warehouse modeling—why and when to model, how to evaluate and compare models, the warehouse’s unique role versus business systems, modern architectural shifts, a quantitative value‑proof scoring framework, industry‑standard versus custom approaches, demonstrating business impact, and career insights—concluding that true value lies in enabling informed decisions rather than technology hype.

AIBig DataData Value
0 likes · 12 min read
Key Questions and Value Assessment in Data Warehouse Modeling and Development
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 20, 2025 · Backend Development

Build an AI-Powered Financial Data Analyzer with XXL‑JOB and Deepseek

This guide explains how to create a scheduled financial data analysis system by integrating the XXL‑JOB distributed task scheduler with the Deepseek large‑model AI, covering model selection, local and cloud deployment options, job configuration, and a complete code example for automated news processing.

AIData AnalysisFinancial AI
0 likes · 10 min read
Build an AI-Powered Financial Data Analyzer with XXL‑JOB and Deepseek
Meituan Technology Team
Meituan Technology Team
Mar 20, 2025 · Artificial Intelligence

Meituan Tech Team's Selected Papers on Large Language Models and AI (2024-2025)

The article compiles Meituan’s recent 2024‑2025 research on large language models, presenting a diverse set of papers that explore transformer enhancements, scaling laws, safety optimization, instruction fine‑tuning, temporal decay learning, code generation, agent refinement, cost‑efficient MoE inference, quantization, fast parallel inference, speculative decoding, multilingual speech, vision‑language models, evaluation benchmarks, and jailbreak robustness.

AILLMMeituan
0 likes · 4 min read
Meituan Tech Team's Selected Papers on Large Language Models and AI (2024-2025)
Practical DevOps Architecture
Practical DevOps Architecture
Mar 20, 2025 · Artificial Intelligence

DeepSeek Model Integration Tutorial Series

This collection provides a step‑by‑step tutorial series of sixteen short videos demonstrating how to access, configure, and use the DeepSeek large language model across various office applications such as Word, Excel, PowerPoint, as well as platforms like WPS and online APIs.

AIAPIDeepSeek
0 likes · 5 min read
DeepSeek Model Integration Tutorial Series
Sohu Tech Products
Sohu Tech Products
Mar 19, 2025 · Artificial Intelligence

How to Recreate a Translation Agent with LangGraph and LLMs

This guide demonstrates building a steerable LLM‑based translation workflow using LangGraph, covering the initial translation, model‑generated reflection suggestions, and final improvement steps with full Python code examples and a complete execution result.

AILLMLangGraph
0 likes · 34 min read
How to Recreate a Translation Agent with LangGraph and LLMs
Sohu Tech Products
Sohu Tech Products
Mar 19, 2025 · Artificial Intelligence

Easy DataSet: An Open‑Source Tool for Building Domain‑Specific Datasets and Fine‑Tuning Large Language Models

The article introduces Easy DataSet, an open‑source tool that streamlines the creation of domain‑specific datasets by aggregating public data sources, chunking Markdown documents, generating and managing QA pairs with configurable LLM endpoints, and exporting them in common formats, while outlining its architecture and future roadmap.

AIData ManagementLLM Fine‑Tuning
0 likes · 30 min read
Easy DataSet: An Open‑Source Tool for Building Domain‑Specific Datasets and Fine‑Tuning Large Language Models
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Mar 19, 2025 · Artificial Intelligence

Choosing the Right Deployment Strategy for Large Language Models: QwQ‑32B vs DeepSeek‑R1

This article compares QwQ‑32B and DeepSeek‑R1 large language models across performance, technical breakthroughs, deployment costs, and open‑source ecosystems, then evaluates pure‑local, hybrid, and pure‑cloud deployment options, and finally provides practical guidelines for preparing knowledge‑base documents and indexing methods.

AIKnowledge BaseLarge Language Model
0 likes · 10 min read
Choosing the Right Deployment Strategy for Large Language Models: QwQ‑32B vs DeepSeek‑R1
21CTO
21CTO
Mar 19, 2025 · Fundamentals

What’s New in Java 24? AI‑Ready Features, Post‑Quantum Security, and More

Oracle’s Java 24 release introduces AI‑focused enhancements such as the Vector API, new post‑quantum cryptography support with ML‑KEM and ML‑DSA, language improvements like pattern‑matching for primitives, and beginner‑friendly features, positioning Java for modern development and long‑term support.

AIJEPJava
0 likes · 7 min read
What’s New in Java 24? AI‑Ready Features, Post‑Quantum Security, and More
Architect
Architect
Mar 19, 2025 · Artificial Intelligence

Choosing the Best Embedding Model for RAG: A Practical Guide Using MTEB Rankings

This guide explains how to leverage the Massive Text Embedding Benchmark (MTEB) to identify high‑performing embedding models for Retrieval‑Augmented Generation (RAG) and outlines key factors such as model size, dimension, language support, resource requirements, inference speed, domain suitability, long‑text handling, scalability, and cost.

AIEmbeddingMTEB
0 likes · 12 min read
Choosing the Best Embedding Model for RAG: A Practical Guide Using MTEB Rankings
Alibaba Cloud Native
Alibaba Cloud Native
Mar 19, 2025 · Artificial Intelligence

Mastering Retrieval‑Augmented Generation with Spring AI: A Complete Guide

This article explains the Retrieval‑Augmented Generation (RAG) paradigm, walks through its four core steps, and provides a detailed Spring AI implementation—including configuration, vector storage, REST controller, multi‑query expansion, query rewriting, document joining, and error handling—plus best‑practice recommendations for production deployments.

AIJavaRAG
0 likes · 23 min read
Mastering Retrieval‑Augmented Generation with Spring AI: A Complete Guide
Yuantong Information Technology
Yuantong Information Technology
Mar 19, 2025 · Artificial Intelligence

AI Boosts Express Delivery: Smart Sorting, Route Planning & Collaboration

The article explains how AI-driven technologies such as intelligent sorting systems, route‑optimization platforms, and voice‑activated assistants are reshaping China’s parcel delivery sector, boosting efficiency by up to 50%, cutting costs, and fostering human‑machine collaboration while still emphasizing the irreplaceable role of workers.

AISmart Sortinghuman‑machine collaboration
0 likes · 5 min read
AI Boosts Express Delivery: Smart Sorting, Route Planning & Collaboration
Amap Tech
Amap Tech
Mar 19, 2025 · Artificial Intelligence

Driving by the Rules: Integrating Lane-Level Traffic Regulations into Online HD Maps

Gaode Map and Xi'an Jiaotong University introduce the “Driving by the Rules” task, releasing the MapDR benchmark that integrates lane‑level traffic‑sign regulations into online‑constructed HD maps, and provide modular (VLE‑MEE) and end‑to‑end (RuleVLM) baselines to evaluate rule extraction and lane association.

AIAutonomous DrivingBenchmark
0 likes · 8 min read
Driving by the Rules: Integrating Lane-Level Traffic Regulations into Online HD Maps
DataFunSummit
DataFunSummit
Mar 18, 2025 · Artificial Intelligence

Application and Implementation of Multimodal Relational Networks in Financial Risk Control

This article presents the background, key technologies, system architecture, data processing pipeline, and practical use cases of multimodal relational networks for enhancing financial risk control, highlighting how integrating image, voice, text, and device data improves fraud detection, modeling, and operational efficiency.

AIfinancial technologyfraud detection
0 likes · 15 min read
Application and Implementation of Multimodal Relational Networks in Financial Risk Control
AntTech
AntTech
Mar 18, 2025 · Artificial Intelligence

MoLE: Decoding by Mixture of Layer Experts Alleviates Hallucination in Large Vision-Language Models

Researchers from Ant Insurance and Zhejiang University propose MoLE, a Mixture of Layer Experts decoding method that reduces hallucinations in large vision‑language models, demonstrating state‑of‑the‑art performance on LVLM benchmarks and enabling reliable end‑to‑end medical‑record‑to‑claim automation.

AIHallucination MitigationMixture of Experts
0 likes · 7 min read
MoLE: Decoding by Mixture of Layer Experts Alleviates Hallucination in Large Vision-Language Models
NewBeeNLP
NewBeeNLP
Mar 18, 2025 · Interview Experience

How to Ace Multimodal Model Interviews at Taobao's Search AI Division

This article recounts a three‑stage interview for a multimodal large‑model position at Taobao's Search AI division, detailing typical questions on CLIP, LoRA, BLIP, Qwen‑VL, Transformer fundamentals, RLHF, and coding challenges, and offers insights on what interviewers focus on.

AICLIPInterview
0 likes · 5 min read
How to Ace Multimodal Model Interviews at Taobao's Search AI Division
58UXD
58UXD
Mar 18, 2025 · Artificial Intelligence

How AI is Revolutionizing Design: 8 Powerful Tools Every Designer Must Know

This article explores how rapidly advancing AI technologies are transforming designers into creative partners, detailing eight practical AI tools that boost idea generation, streamline workflows, enhance detail work, add motion, provide feedback, generate copy, auto‑create UI, and enable cross‑domain innovations.

AIMidjourneyStable Diffusion
0 likes · 8 min read
How AI is Revolutionizing Design: 8 Powerful Tools Every Designer Must Know
DevOps
DevOps
Mar 17, 2025 · Artificial Intelligence

Building an MCP Server and Client in Python: From 0 to 1 with Stdio and SSE Transports

This tutorial explains how to create a Model Context Protocol (MCP) server and client in Python, covering environment setup, unified tool integration, Stdio and SSE transport implementations, and step‑by‑step code examples for building, configuring, and running both local and cloud‑based MCP services.

AIMCPPython
0 likes · 18 min read
Building an MCP Server and Client in Python: From 0 to 1 with Stdio and SSE Transports
Ops Development & AI Practice
Ops Development & AI Practice
Mar 17, 2025 · Artificial Intelligence

Unlocking LLM Power: A Hands‑On Guide to Open WebUI

Open WebUI offers a user‑friendly, open‑source web interface that simplifies interaction with large language models, supporting multiple back‑ends, offline operation, and extensible plugins, making AI experimentation accessible for developers, researchers, and enthusiasts alike.

AILLMModel Management
0 likes · 4 min read
Unlocking LLM Power: A Hands‑On Guide to Open WebUI
DaTaobao Tech
DaTaobao Tech
Mar 17, 2025 · Artificial Intelligence

AI-Powered Content Generation and Template Automation for E‑commerce

Taobao’s AI‑driven content system now generates product guides, videos and multimodal assets across the shopping journey by automatically converting designer mock‑ups into HTML templates, extracting key information, filling slots with product data, and refining layouts via natural‑language feedback, dramatically cutting manual effort and enabling rapid, personalized e‑commerce experiences.

AIcontent generationtemplate automation
0 likes · 8 min read
AI-Powered Content Generation and Template Automation for E‑commerce
Eric Tech Circle
Eric Tech Circle
Mar 17, 2025 · Artificial Intelligence

Mastering Cursor Rules: Tame AI Code Generation for Better Development

This article explains how Cursor Rules let developers constrain AI behavior in the Cursor editor, covering global and project rule types, practical file configurations, best‑practice examples for general, Python, documentation, and Git rules, and tips for effective rule management.

AIBest PracticesCursor
0 likes · 6 min read
Mastering Cursor Rules: Tame AI Code Generation for Better Development
Efficient Ops
Efficient Ops
Mar 16, 2025 · Artificial Intelligence

How AI Digital Humans Transform Banking Services: Architecture, Capabilities, and Use Cases

This article explains how AI-powered digital humans can modernize banking by offering modular, multi‑modal interaction, personalized multilingual service, 24‑hour availability, and risk‑aware automation, while detailing the underlying AI foundation, decision engine, visual rendering, and deployment strategies.

AICustomer ServiceFinTech
0 likes · 7 min read
How AI Digital Humans Transform Banking Services: Architecture, Capabilities, and Use Cases
21CTO
21CTO
Mar 16, 2025 · Artificial Intelligence

How Google’s AI Co‑Scientist Is Accelerating Biomedical Discoveries

Google unveiled an AI co‑scientist built on Gemini 2.0 that uses multiple specialized agents to generate, evaluate, and refine research hypotheses, demonstrating promising results in drug repurposing, liver fibrosis target discovery, and antibiotic resistance, while also highlighting current limitations and community reactions.

AIGoogle AIScientific Discovery
0 likes · 6 min read
How Google’s AI Co‑Scientist Is Accelerating Biomedical Discoveries
Ops Development & AI Practice
Ops Development & AI Practice
Mar 16, 2025 · Artificial Intelligence

How Function Calling Helps LLMs Overcome Hallucinations

This article explains how LLM function calling works, from defining external functions to processing API responses, and demonstrates a Python example using OpenAI's ChatGPT‑4o to fetch real‑time weather, showing how the technique mitigates hallucinations and expands practical AI applications.

AIFunction CallingHallucination Mitigation
0 likes · 8 min read
How Function Calling Helps LLMs Overcome Hallucinations
Architect
Architect
Mar 15, 2025 · Artificial Intelligence

Why Building Your Own RAG System Is a Costly Mistake

The article explains that developing a custom Retrieval‑Augmented Generation (RAG) solution incurs hidden infrastructure, personnel, and security costs, leads to operational overload and budget overruns, and is rarely justified compared to purchasing a proven vendor solution.

AILLMRAG
0 likes · 11 min read
Why Building Your Own RAG System Is a Costly Mistake
Nightwalker Tech
Nightwalker Tech
Mar 15, 2025 · Artificial Intelligence

Guide to Accessing International AI Large Models via Aggregation Tools, APIs, and Python Code

This article introduces major international and domestic AI large models, recommends desktop aggregation tools and APIs such as POE, Monica, and OpenRouter, and provides complete Python code examples for synchronous and streaming text and multimodal conversations, along with additional API and compute‑rental resources.

AIAPIOpenRouter
0 likes · 11 min read
Guide to Accessing International AI Large Models via Aggregation Tools, APIs, and Python Code
AI Frontier Lectures
AI Frontier Lectures
Mar 14, 2025 · Artificial Intelligence

Open-Sora 2.0: How an 11B Open-Source Model Beats Closed-Source Video AI at 720p

Open‑Sora 2.0, an open‑source 11‑billion‑parameter video generation model, delivers 720p 24 fps videos with visual quality and text‑image alignment comparable to proprietary systems like HunyuanVideo and Step‑Video, while cutting training costs to $200 k using only 224 GPUs, and the release includes full code, weights, and a Gradio demo.

3D autoencoderAIMMDiT
0 likes · 7 min read
Open-Sora 2.0: How an 11B Open-Source Model Beats Closed-Source Video AI at 720p
DaTaobao Tech
DaTaobao Tech
Mar 14, 2025 · Artificial Intelligence

AI-Driven Engineering Efficiency: Practices and Insights from a Live-Streaming Team

The article recounts a live‑streaming team’s six‑month experiment using large‑language‑model AI to boost backend, frontend, testing, data‑science and data‑engineering productivity, detailing goals, LLM strengths and limits, and practical tactics such as task splitting, input refinement, human‑AI guidance, retrieval‑augmented generation and fine‑tuning, while emphasizing disciplined task design, prompt iteration, and future vertical integrations.

AIRAGfine-tuning
0 likes · 17 min read
AI-Driven Engineering Efficiency: Practices and Insights from a Live-Streaming Team
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Mar 14, 2025 · Artificial Intelligence

How Automatic Creation of Digital Cousins Revolutionizes Digital Twin Simulations

The article analyzes the Automatic Creation of Digital Cousins (ACDC) technology, detailing its pipeline—from object recognition and semantic segmentation to depth estimation, camera calibration, and 3D scene reconstruction—while discussing challenges, industry applications, and future research directions.

3D scene generationAIdigital twins
0 likes · 20 min read
How Automatic Creation of Digital Cousins Revolutionizes Digital Twin Simulations
JD Retail Technology
JD Retail Technology
Mar 14, 2025 · Artificial Intelligence

CTR-Driven Advertising Image Generation Using Multimodal Large Language Models

The paper presents CAIG, a CTR‑driven advertising image generation pipeline that pre‑trains a multimodal LLM on e‑commerce data, trains a reward model on CTR‑labeled image pairs, and fine‑tunes generation via product‑centric preference optimization, achieving state‑of‑the‑art online and offline performance.

AICTRad image generation
0 likes · 11 min read
CTR-Driven Advertising Image Generation Using Multimodal Large Language Models
Nightwalker Tech
Nightwalker Tech
Mar 14, 2025 · Backend Development

Overview and Installation Guide for Various MCP Services and Their Use with Sequential Thinking for Manus‑like Effects

This article introduces several Model Context Protocol (MCP) services—including Sequential Thinking, Firecrawl, Fetch, Hot News, Playwright, Magic, and Brave Search—provides their GitHub links, detailed Mac and Windows installation commands, and explains how to combine them with a Sequential Thinking prompt to achieve a Manus‑style AI agent workflow.

AIInstallationMCP
0 likes · 9 min read
Overview and Installation Guide for Various MCP Services and Their Use with Sequential Thinking for Manus‑like Effects
Continuous Delivery 2.0
Continuous Delivery 2.0
Mar 14, 2025 · Operations

The Birth of DevOps: Breaking the Collaboration Wall

This article traces the evolution of DevOps from its 2009 origin, through automation, security, FinOps, platform engineering, and the rise of AI-driven intelligent automation, highlighting future trends such as AI-native toolchains, cognitive collaboration, and sustainable practices that reshape how development and operations work together.

AIDevOpsFinOps
0 likes · 7 min read
The Birth of DevOps: Breaking the Collaboration Wall
DevOps
DevOps
Mar 13, 2025 · Artificial Intelligence

Large Model Commercialization Reshapes Cloud AI Competition: Capital Spending, Strategic Paths, and Multi‑Model Ecosystems

The article analyzes how the commercialization of large AI models is redefining cloud providers' competitive dynamics, highlighting Amazon Bedrock's DeepSeek‑R1 launch, IDC forecasts on model usage, major vendors' capital expenditures, and the shift toward flexible, cost‑effective multi‑model ecosystems for enterprise AI.

AICloud ComputingEnterprise AI
0 likes · 14 min read
Large Model Commercialization Reshapes Cloud AI Competition: Capital Spending, Strategic Paths, and Multi‑Model Ecosystems
Open Source Tech Hub
Open Source Tech Hub
Mar 13, 2025 · Artificial Intelligence

Build a Private AI Knowledge Base with Webman AI, Redis‑Stack, and Ollama

This guide walks you through setting up a private AI knowledge base using Webman AI 5.4.0, deploying Redis‑Stack, installing the illuminate/redis component, adding Ollama with DeepSeek and other embedding models, configuring Redis, importing training data, running the training process, and configuring role prompts for accurate AI responses.

AIDeepSeekOllama
0 likes · 6 min read
Build a Private AI Knowledge Base with Webman AI, Redis‑Stack, and Ollama
Java Tech Enthusiast
Java Tech Enthusiast
Mar 12, 2025 · Artificial Intelligence

Open-Source Enterprise Knowledge Base and Intelligent Dialogue Platform Based on Spring Boot and DeepSeek

The open‑source DeepSeek‑Flow‑AI platform combines Spring Boot 3.4 back‑end APIs with a Vue 3 front‑end to deliver an enterprise‑grade knowledge base and intelligent multi‑turn dialogue system, supporting private deployment, role‑based access, analytics, CRM/ERP integration, and easy installation via Maven and Yarn.

AIDeepSeekKnowledge Base
0 likes · 5 min read
Open-Source Enterprise Knowledge Base and Intelligent Dialogue Platform Based on Spring Boot and DeepSeek
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Mar 11, 2025 · Artificial Intelligence

How AI Is Revolutionizing Traditional Chinese Medicine with Huawei’s Pangu Model

This article explores how Huawei Cloud's Pangu large‑model platform powers the TianShiLi Digital Herbal AI system, merging centuries‑old Chinese medicine knowledge with cutting‑edge AI to accelerate drug discovery, enable intelligent diagnosis, and transform the entire TCM research and development workflow.

AIHealthcare AIHuawei Cloud
0 likes · 9 min read
How AI Is Revolutionizing Traditional Chinese Medicine with Huawei’s Pangu Model
Tencent Cloud Developer
Tencent Cloud Developer
Mar 11, 2025 · Artificial Intelligence

Fine‑Tuning Local LLaMA‑Factory Models and Building Networked AI Applications

The article walks through preparing a GPU‑enabled environment, downloading and LoRA‑fine‑tuning a DeepSeek model with LLaMA‑Factory, merging the adapter, then wrapping the model in a web UI that queries a ChromaDB vector store via crawled web data, illustrating security‑focused use cases and forecasting domain‑specific LLM adoption.

AILLMLLaMA-Factory
0 likes · 17 min read
Fine‑Tuning Local LLaMA‑Factory Models and Building Networked AI Applications
Baidu Geek Talk
Baidu Geek Talk
Mar 10, 2025 · Artificial Intelligence

How Baidu Cloud’s AI+ Strategy Powers State‑Owned Enterprises with DeepSeek One‑Box Solutions

The article examines Baidu Cloud’s integration of DeepSeek large‑model hardware, detailing the Baige and Qianfan one‑box systems, their technical specs, deployment speed, and how they enable state‑owned enterprises across energy, manufacturing, and logistics to accelerate AI‑driven digital transformation.

AIBaidu CloudCloud Computing
0 likes · 6 min read
How Baidu Cloud’s AI+ Strategy Powers State‑Owned Enterprises with DeepSeek One‑Box Solutions
Cognitive Technology Team
Cognitive Technology Team
Mar 10, 2025 · Artificial Intelligence

Understanding Transformers: From NLP Challenges to Architecture and Core Mechanisms

This article explains the evolution of natural language processing, the limitations of rule‑based, statistical, and recurrent neural network models, and then introduces the Transformer architecture—covering word and position embeddings, self‑attention, multi‑head attention, Add & Norm, feed‑forward layers, and encoder‑decoder design—to help beginners grasp why Transformers solve key NLP problems.

AINLPSelf-Attention
0 likes · 15 min read
Understanding Transformers: From NLP Challenges to Architecture and Core Mechanisms
phodal
phodal
Mar 10, 2025 · Artificial Intelligence

How AutoDev Bridge Uses LLMs to Accelerate Legacy System Migration

AutoDev Bridge combines large‑model reasoning, C4 architecture analysis, AST‑based business logic extraction, and IDE‑integrated tooling to automate the migration of legacy systems, reducing manual effort and migration risk while highlighting the unique advantages of modern AI agents.

AICode TranslationLLM
0 likes · 7 min read
How AutoDev Bridge Uses LLMs to Accelerate Legacy System Migration
Java Architect Essentials
Java Architect Essentials
Mar 9, 2025 · Backend Development

Building an AI-Powered Chatbot with Spring Boot and DeepSeek

This tutorial demonstrates how to create an AI-driven Spring Boot application by integrating DeepSeek's large language model, covering project setup, dependency configuration, API key management, and implementing a REST controller that provides weather forecasts via a conversational interface.

AIChatbotDeepSeek
0 likes · 8 min read
Building an AI-Powered Chatbot with Spring Boot and DeepSeek