Tagged articles

190 articles

Page 1 of 2

May 19, 2026 · Artificial Intelligence

100k‑Token Natural‑Language Reasoning Enables a 30B‑A3B Model to Reach Olympiad Gold Level

A 30B‑A3B model, trained with reverse‑perplexity supervised fine‑tuning, two‑stage reinforcement learning, and a multi‑round generate‑verify‑revise inference loop, achieves gold‑medal performance on IMO, USAMO and IPhO contests using over 100 k token natural‑language reasoning without external tools.

30B-A3Bnatural language processingolympiad AI

0 likes · 11 min read

100k‑Token Natural‑Language Reasoning Enables a 30B‑A3B Model to Reach Olympiad Gold Level

Architects' Tech Alliance

May 8, 2026 · Artificial Intelligence

Token Fundamentals: A Technical Panorama of AI Language Units

Tokens are the smallest language building blocks that AI models process, representing characters, words, subwords, punctuation or emojis; they determine context window size and generation speed, so tokenization directly impacts model understanding accuracy and efficiency, as explained in the 2026 Token Report.

AI fundamentalsLanguage Modelscontext window

0 likes · 4 min read

Token Fundamentals: A Technical Panorama of AI Language Units

Machine Learning Algorithms & Natural Language Processing

Apr 21, 2026 · Artificial Intelligence

Why Do Papers with a '?' in the Title Achieve a 45% Acceptance Rate? A Five‑Year ICLR Keyword Analysis

Analyzing five years of ICLR submission metadata reveals that titles containing a question mark boost acceptance to 45.5% in 2022, while emerging keywords such as diffusion, sparse, and planning dominate high‑acceptance lists, and older topics like federated learning, adversarial attacks, and security suffer low acceptance and high withdrawal rates.

Data AnalysisICLRMachine Learning

0 likes · 8 min read

Why Do Papers with a '?' in the Title Achieve a 45% Acceptance Rate? A Five‑Year ICLR Keyword Analysis

Machine Learning Algorithms & Natural Language Processing

Apr 17, 2026 · Artificial Intelligence

When RAG Retrieves the Right Docs but Still Answers Wrong: Insights from Saarland University (ACL 2026)

The article explains why conventional Retrieval‑Augmented Generation often produces incorrect answers despite retrieving relevant documents, introduces the Disco‑RAG framework that adds a structured reading step using argument trees and relation graphs, and shows how this three‑step approach dramatically improves performance on long‑document and ambiguous‑question benchmarks without any model training.

Disco-RAGRAGRetrieval-Augmented Generation

0 likes · 13 min read

When RAG Retrieves the Right Docs but Still Answers Wrong: Insights from Saarland University (ACL 2026)

Machine Learning Algorithms & Natural Language Processing

Mar 6, 2026 · Artificial Intelligence

Why Learning from Context Is Harder Than We Thought

The talk examines why large language models, despite impressive performance on knowledge‑based tasks, struggle dramatically when required to learn new information from the immediate input context, analyzes systematic biases behind this limitation, and explores rubric‑based synthesis as a potential remedy.

context learninglarge language modelsnatural language processing

0 likes · 4 min read

Why Learning from Context Is Harder Than We Thought

Qborfy AI

Feb 21, 2026 · Artificial Intelligence

How Self-Attention Powers Modern AI: From Theory to Real-World Impact

This article explains the self‑attention mechanism behind transformers, detailing its core components, mathematical formulation, step‑by‑step example, multi‑head extension, industry use cases, and a thorough comparison with RNN and CNN approaches, all supported by concrete numbers and citations.

Self-AttentionTransformerattention mechanism

0 likes · 8 min read

How Self-Attention Powers Modern AI: From Theory to Real-World Impact

Mingyi World Elasticsearch

Jan 30, 2026 · Backend Development

Text2DSL: Convert Natural Language to Precise Elasticsearch/Easysearch DSL

Text2DSL lets users describe search requirements in plain language, uses DeepSeek to generate Elasticsearch DSL, validates the DSL locally with Elasticsearch/Easysearch, iteratively refines it up to five times, and achieves over 95% first‑try accuracy while cutting query‑building time by at least threefold.

DSL generationDeepSeekEasysearch

0 likes · 12 min read

Text2DSL: Convert Natural Language to Precise Elasticsearch/Easysearch DSL

Cognitive Technology Team

Nov 5, 2025 · Artificial Intelligence

How AI Is Revolutionizing End-to-End Test Automation at Tmall

Leveraging AI and natural language processing, Tmall’s quality assurance team transformed traditional manual testing into a semi‑automated and fully automated pipeline—covering requirement analysis, test case generation, data construction, execution, and validation—resulting in significant efficiency gains, traceability, and continuous improvement across multiple business lines.

AI testingSoftware qualitycontinuous integration

0 likes · 10 min read

How AI Is Revolutionizing End-to-End Test Automation at Tmall

dbaplus Community

Nov 3, 2025 · Artificial Intelligence

How RAG Turns Natural Language Queries into Accurate SQL for Data Platforms

This article explains how Retrieval‑Augmented Generation (RAG) combines vector databases with large language models to let non‑technical users ask natural‑language questions and receive precise SQL statements, detailing the workflow, architecture, chunking methods, performance gains, and remaining challenges.

Data PlatformLLMRAG

0 likes · 17 min read

How RAG Turns Natural Language Queries into Accurate SQL for Data Platforms

Data Party THU

Oct 11, 2025 · Artificial Intelligence

From Transformers to LLaMA 4: A Journey Through the Biggest LLMs

This article surveys the most influential large language models released since 2017, detailing the core innovations of Transformer, BERT, GPT series, T5, Retrieval‑Augmented Generation, and the latest LLaMA and Meta models, while highlighting their architectures, training paradigms, and impact on NLP research.

LLMModel Scalinglarge language models

0 likes · 21 min read

From Transformers to LLaMA 4: A Journey Through the Biggest LLMs

Baidu Geek Talk

Sep 15, 2025 · Artificial Intelligence

How Baidu’s AI Navigation Turns Voice Commands into Precise Actions

This article explains how Baidu Map’s AI navigation system converts spoken queries into accurate map instructions by combining speech recognition, intent parsing, large‑language‑model reasoning, tool calling, and memory‑reflection techniques, showcasing the underlying technologies that enable instant, context‑aware responses.

AILLMMap Services

0 likes · 13 min read

How Baidu’s AI Navigation Turns Voice Commands into Precise Actions

Baidu Maps Tech Team

Jul 31, 2025 · Artificial Intelligence

How Baidu’s AI Voice Assistant Turns Speech into Precise Navigation Commands

This article explains how Baidu Map’s AI voice assistant converts spoken commands into precise navigation actions by detailing the speech‑to‑text pipeline, intent parsing, template and generative approaches, tool‑calling mechanisms, memory and reflection capabilities, and future directions for intelligent agents.

AIIntent ParsingLLM

0 likes · 14 min read

How Baidu’s AI Voice Assistant Turns Speech into Precise Navigation Commands

Programmer Xu Shu

Jun 23, 2025 · Artificial Intelligence

From Bag‑of‑Words to ChatGPT: How Large Language Models Evolved

Tracing the evolution of large language models—from early bag‑of‑words techniques, through word embeddings, RNNs, attention mechanisms, Transformers, BERT, and GPT—this article explains each breakthrough, its limitations, and how they culminated in ChatGPT’s conversational AI.

AI evolutionChatGPTTransformer

0 likes · 12 min read

From Bag‑of‑Words to ChatGPT: How Large Language Models Evolved

MaGe Linux Operations

Jun 15, 2025 · Artificial Intelligence

Mastering Transformers: Key Extensions and Optimization Techniques Explained

This comprehensive guide walks you through the Transformer architecture—from its encoder‑decoder structure and self‑attention mechanism to multi‑head attention, positional embeddings, and practical PyTorch implementations—providing clear visualizations and code examples for deep learning practitioners.

PyTorchSelf-AttentionTransformer

0 likes · 22 min read

Mastering Transformers: Key Extensions and Optimization Techniques Explained

Instant Consumer Technology Team

Jun 4, 2025 · Artificial Intelligence

Unlocking Retrieval-Augmented Generation: Theory, Practice, and Future Trends

This comprehensive article examines Retrieval‑Augmented Generation (RAG), covering its historical evolution, core theory, implementation variants, practical code examples, diverse applications, current controversies, and future research directions within the AI and NLP landscape.

Artificial IntelligenceGenerative ModelsRAG

0 likes · 21 min read

Unlocking Retrieval-Augmented Generation: Theory, Practice, and Future Trends

Data Thinking Notes

Jun 2, 2025 · Artificial Intelligence

Why Pre‑Training Powers Modern AI: From Theory to Real‑World Applications

Pre‑training enables AI models to first acquire a universal knowledge map from massive unlabelled text, then quickly adapt to specific tasks with minimal labelled data, offering superior generalization, reduced annotation costs, and versatile applications across chatbots, content creation, retrieval, coding assistance, and more.

AI applicationsTransformerlarge language models

0 likes · 14 min read

Why Pre‑Training Powers Modern AI: From Theory to Real‑World Applications

AntData

May 30, 2025 · Artificial Intelligence

DeepInsight Copilot: AI‑Powered Data Analysis Platform Overview and Technical Evolution

The article presents an in‑depth overview of DeepInsight Copilot, an AI‑driven business intelligence product that streamlines data, information, insight, and decision‑recommendation stages, detailing its functional modules, intelligent agents, multi‑generation technical evolution, architecture, model fine‑tuning, and future challenges and solutions in data analysis.

AIData AnalysisMachine Learning

0 likes · 21 min read

DeepInsight Copilot: AI‑Powered Data Analysis Platform Overview and Technical Evolution

Beijing SF i-TECH City Technology Team

Apr 7, 2025 · Artificial Intelligence

LLM Application in Text Information Detection and Extraction: A Case Study of Blue-Collar Recruitment Data Processing

This article explores the application of Large Language Models (LLM) in text information detection and extraction, focusing on blue-collar recruitment data processing. It details the implementation of LLM through prompt engineering, RAG enhancement, and model fine-tuning to improve data cleaning efficiency and accuracy.

AI applicationsLLMRAG

0 likes · 31 min read

LLM Application in Text Information Detection and Extraction: A Case Study of Blue-Collar Recruitment Data Processing

Cognitive Technology Team

Mar 30, 2025 · Artificial Intelligence

Why Prompt Engineering Is the “Mind‑Reading” Technique of AI: The Crucial Role of In‑Context Learning

Prompt engineering uses in‑context learning to turn large language models into precise, task‑aware assistants by providing well‑crafted prompts that guide the model’s probability distribution, reduce hallucinations, and unlock hidden knowledge without any parameter tuning.

Artificial IntelligenceIn-Context Learninglarge language models

0 likes · 6 min read

Why Prompt Engineering Is the “Mind‑Reading” Technique of AI: The Crucial Role of In‑Context Learning

Efficient Ops

Mar 16, 2025 · Artificial Intelligence

How AI Digital Humans Transform Banking Services: Architecture, Capabilities, and Use Cases

This article explains how AI-powered digital humans can modernize banking by offering modular, multi‑modal interaction, personalized multilingual service, 24‑hour availability, and risk‑aware automation, while detailing the underlying AI foundation, decision engine, visual rendering, and deployment strategies.

AICustomer ServiceFinTech

0 likes · 7 min read

How AI Digital Humans Transform Banking Services: Architecture, Capabilities, and Use Cases

AI Large Model Application Practice

Feb 14, 2025 · Artificial Intelligence

Why Sub‑word Tokenizers Power Modern LLMs: From Characters to Tokens

This article explains how language models evolved from character‑level embeddings to word‑level and finally to sub‑word tokenizers, highlighting the efficiency, vocabulary coverage, and practical engineering challenges of sub‑word segmentation in modern AI systems.

AI fundamentalsLLMSubword Tokenization

0 likes · 8 min read

Why Sub‑word Tokenizers Power Modern LLMs: From Characters to Tokens

Xiaohongshu Tech REDtech

Jan 16, 2025 · Artificial Intelligence

Enhancing Uncertainty Modeling with Semantic Graph for Hallucination Detection

The authors present a semantic‑graph‑enhanced uncertainty modeling framework that captures token, sentence, and paragraph dependencies, propagates uncertainty through entity relations and contradiction probabilities, and achieves roughly a 20 % gain in paragraph‑level hallucination detection on WikiBio and NoteSum compared with existing uncertainty‑based baselines.

Semantic GraphSentence-level ModelingToken-level Modeling

0 likes · 13 min read

Enhancing Uncertainty Modeling with Semantic Graph for Hallucination Detection

Java Architecture Diary

Jan 8, 2025 · Backend Development

How to Use Spring AI MCP with Chat2DB for Natural Language Database Queries

This tutorial explains how to integrate Spring AI's Model Context Protocol (MCP) with Chat2DB to enable secure, natural‑language queries against a PostgreSQL database, covering configuration, core Java code, execution flow, and sample SQL queries.

Chat2DBDatabase QueryMCP

0 likes · 8 min read

How to Use Spring AI MCP with Chat2DB for Natural Language Database Queries

Data Thinking Notes

Jan 7, 2025 · Databases

Unlocking LLM-Powered Text-to-SQL: From Basics to Cutting-Edge Techniques

This article provides a comprehensive overview of LLM-based Text-to-SQL technology, covering its background, evolution, challenges, various LLM-driven methods, benchmark datasets, evaluation metrics, and future research directions to guide researchers and practitioners in advancing natural language interfaces for databases.

DatabaseLLMText-to-SQL

0 likes · 18 min read

Unlocking LLM-Powered Text-to-SQL: From Basics to Cutting-Edge Techniques

Baidu Geek Talk

Dec 11, 2024 · Artificial Intelligence

How AI Cuts Essay Grading Time by 6×: Inside the Smart Writing Platform

This article examines how an AI‑powered essay‑grading platform combines PaddleOCR and Baidu's Wenxin large model to automate scoring, generate personalized feedback, and reduce teachers' grading workload by over six times, while improving student learning outcomes across hundreds of Chinese schools.

AIEducation TechnologyPaddleOCR

0 likes · 11 min read

How AI Cuts Essay Grading Time by 6×: Inside the Smart Writing Platform

Architect's Alchemy Furnace

Sep 16, 2024 · Artificial Intelligence

Why Transformers Revolutionize AI: From Basics to Advanced Applications

This article explains what AI Transformers are, why they matter, their key components and mechanisms, various applications ranging from language processing to bioinformatics, and how they differ from traditional neural networks, providing a comprehensive overview of Transformer architecture and its impact on modern AI research.

AISelf-AttentionTransformer

0 likes · 20 min read

Why Transformers Revolutionize AI: From Basics to Advanced Applications

Ops Development & AI Practice

Aug 20, 2024 · Artificial Intelligence

How ERobot Redefines No-Code AI Automation with Natural Language

The article examines Hugging Face's ERobot, an AI model that leverages Transformer-based pre‑trained models to execute a wide range of automation tasks through natural‑language commands, discusses its technical foundations, real‑world applications, future prospects, and the challenges it must overcome.

Hugging FaceTask AutomationTransformer

0 likes · 8 min read

How ERobot Redefines No-Code AI Automation with Natural Language

DaTaobao Tech

Aug 16, 2024 · Artificial Intelligence

Effective Prompt Design for Large Language Models

Effective prompt design for large language models requires clear goals, relevant context, explicit input/output formats, evaluation criteria, and illustrative examples, combined with specific language, step‑by‑step instructions, edge‑case handling, ethical considerations, and proper tokenization, encoding, decoding, and post‑processing to produce accurate, concise, low‑hallucination responses.

AIlarge language modelsnatural language processing

0 likes · 33 min read

Effective Prompt Design for Large Language Models

JD Tech

Jul 11, 2024 · Artificial Intelligence

Intelligent Parcel Identification in JD Express Logistics Using Large Language Models

This article examines the challenges of low parcel matching rates in JD Express logistics and proposes a large‑model‑based intelligent identification system, detailing its architecture, accuracy validation, cost‑saving cache strategy, and future prospects for improved efficiency and personalized services.

AI in e-commerceLogisticsOperational Efficiency

0 likes · 24 min read

Intelligent Parcel Identification in JD Express Logistics Using Large Language Models

Huolala Tech

Jul 9, 2024 · Artificial Intelligence

Building an In-Car Voice Assistant: From Wake‑Word to NLP

This article details the end‑to‑end development of an in‑vehicle voice assistant, covering motivation, functional design, technology stack selection, dialogue flow, privacy, third‑party integration, wake‑word detection, on‑device speech recognition, noise filtering, NLP processing, and deployment considerations.

Speech RecognitionVoice Assistantin‑car technology

0 likes · 18 min read

Building an In-Car Voice Assistant: From Wake‑Word to NLP

Baidu Tech Salon

May 27, 2024 · Artificial Intelligence

Intelligent Agent Technology in Commercial Advertising Platforms: Architecture and Applications

The paper describes Baidu’s AI‑native advertising platform that employs a multi‑agent architecture built on large‑language models—combining large‑small model collaboration, domain SOP‑driven coordination, and long‑term memory—to enable natural‑language understanding, proactive planning, execution and human‑like responses, illustrated by GBI analytics and JarvisBot operations, delivering higher consumption, accuracy, speed and efficiency.

AI-native platformsLLM applicationsLong-term Memory

0 likes · 16 min read

Intelligent Agent Technology in Commercial Advertising Platforms: Architecture and Applications

NewBeeNLP

May 16, 2024 · Artificial Intelligence

How Large Language Models Transform Advertising Copy Generation

This article examines the adoption of large language models for intelligent advertising copy creation, detailing business challenges, model selection criteria, training data preparation, fine‑tuning methods, performance evaluation, deployment results, while highlighting the trade‑offs between model size, cost, and output quality.

AI marketingadvertising copyfine-tuning

0 likes · 20 min read

How Large Language Models Transform Advertising Copy Generation

Architect's Guide

May 13, 2024 · Artificial Intelligence

Understanding the Core Principles of Transformer Architecture

This article explains how Transformer models work by detailing the encoder‑decoder structure, self‑attention, multi‑head attention, positional encoding, and feed‑forward networks, and shows their applications in machine translation, recommendation systems, and large language models.

AITransformerattention mechanism

0 likes · 11 min read

Understanding the Core Principles of Transformer Architecture

ITPUB

Apr 20, 2024 · Artificial Intelligence

Unveiling GPT-4’s Magic: How Large Language Models Learn, Reason, and Translate – A Kid‑Friendly Story

This article uses a playful dialogue to demystify how large language models like GPT‑4 work, covering data collection, vectorization, the transformer’s attention mechanism, position encoding, training stages, multilingual translation, reasoning puzzles, and alignment, all illustrated through the tale of a curious learner named Wuming.

Artificial IntelligenceTransformerattention mechanism

0 likes · 50 min read

Unveiling GPT-4’s Magic: How Large Language Models Learn, Reason, and Translate – A Kid‑Friendly Story

New Oriental Technology

Apr 19, 2024 · Artificial Intelligence

Effective Prompt Engineering for Large Language Models

This article explains how large language models work, why well‑crafted prompts are essential, and presents practical strategies—such as clarity, conciseness, focus, role‑setting, delimiters, few‑shot examples, and step‑by‑step instructions—to help users obtain accurate and relevant responses from AI systems.

AILLM strategiesLarge Language Model

0 likes · 12 min read

Effective Prompt Engineering for Large Language Models

DataFunSummit

Mar 17, 2024 · Databases

Exploring Natural Language Interaction Methods for Database Systems

Postdoctoral researcher Fan Yuankai from Fudan University will present his work on enabling natural-language queries for database systems, covering NL2SQL approaches, reliable ranking mechanisms, and guiding large models to generate accurate SQL, aiming to improve usability for users unfamiliar with query languages.

AI for DatabasesDatabase InteractionNL2SQL

0 likes · 3 min read

Exploring Natural Language Interaction Methods for Database Systems

Bilibili Tech

Feb 18, 2024 · Artificial Intelligence

Bilibili Personal Attack Content Governance: Background, Goals, Methods, and Effectiveness

Bilibili combats personal‑attack and trolling comments by combining sector‑specific keyword databases, user‑group analysis, advanced word‑matching (including pinyin and homophone detection) and multiple NLP/graph models, which has cut personal‑attack reports in entertainment, film and gaming by about 32 % and trolling reports by roughly 25 % between June and December 2023.

BilibiliMachine Learningabusive language detection

0 likes · 12 min read

Bilibili Personal Attack Content Governance: Background, Goals, Methods, and Effectiveness

DataFunSummit

Jan 17, 2024 · Artificial Intelligence

Applying Large Language Models in Zhihu’s Jianqiao Enterprise Analytics Platform

This article shares the practical application of large language models within Zhihu’s internal Jianqiao analytics platform, covering business background, knowledge taxonomy organization, natural‑language‑to‑filter conversion, natural‑language data analysis, and summarizing challenges, solutions, and future outlooks.

AI applicationsEnterprise Analyticsknowledge organization

0 likes · 14 min read

Applying Large Language Models in Zhihu’s Jianqiao Enterprise Analytics Platform

DataFunTalk

Dec 26, 2023 · Artificial Intelligence

The Evolution of AI and Its Challenges in the Data Industry

This article reviews the historical development of artificial intelligence, explains how AI technologies such as large language models are reshaping data processing and analysis, and discusses the practical challenges, trust issues, and governance requirements when applying AI to the data industry.

Data IndustryGPTdata governance

0 likes · 10 min read

The Evolution of AI and Its Challenges in the Data Industry

Alibaba Cloud Big Data AI Platform

Dec 13, 2023 · Artificial Intelligence

Boosting Cross-Lingual Machine Reading Comprehension with X-STA: A New Knowledge Transfer Approach

The X-STA algorithm, introduced by Alibaba Cloud’s PAI and researchers from South China University of Technology, leverages gradient‑decomposed knowledge sharing, teacher‑guided attention, and multi‑level alignment to enhance cross‑lingual machine reading comprehension, achieving state‑of‑the‑art results on three multilingual MRC benchmarks.

Knowledge TransferX-STAcross-lingual

0 likes · 7 min read

Boosting Cross-Lingual Machine Reading Comprehension with X-STA: A New Knowledge Transfer Approach

Model Perspective

Sep 11, 2023 · Artificial Intelligence

Why Chinese Word Segmentation Matters: Techniques, Challenges, and Python Demo

This article explores Chinese word segmentation, illustrating its linguistic nuances with a humorous example, explains key methods—including dictionary‑based, statistical, and deep‑learning approaches—and provides Python code using a simple dictionary algorithm and the popular jieba library to demonstrate practical implementation.

Chinese NLPPythonjieba

0 likes · 6 min read

Why Chinese Word Segmentation Matters: Techniques, Challenges, and Python Demo

Model Perspective

Sep 1, 2023 · Fundamentals

Unlocking the Power of Finite State Transducers: From Theory to Python Implementation

This article introduces finite‑state transducers, explains their mathematical definition, illustrates state‑transition examples such as binary counters, word detection, and parentheses matching, explores key applications in speech synthesis, spell‑checking, lemmatization, transliteration, and lexical analysis, and provides a concise Python implementation.

Pythonautomata theoryfinite state transducer

0 likes · 9 min read

Unlocking the Power of Finite State Transducers: From Theory to Python Implementation

DataFunTalk

Aug 19, 2023 · Artificial Intelligence

Applying Large Language Models to Zhihu's Bridge Platform: Use Cases, Challenges, and Solutions

This article details how Zhihu's internal Bridge platform integrates large language models for business analysis, knowledge taxonomy, natural‑language‑to‑filter conversion, and ad‑hoc data queries, describing the workflow, technical hurdles, iterative improvements, and future directions.

AI for business analyticsknowledge taxonomylarge language models

0 likes · 12 min read

Applying Large Language Models to Zhihu's Bridge Platform: Use Cases, Challenges, and Solutions

Model Perspective

Aug 10, 2023 · Artificial Intelligence

Understanding Word2Vec: Theory, Architecture, and Python Implementation

This article explains the Word2Vec algorithm, its CBOW and Skip‑Gram architectures, cosine similarity mathematics, training process with negative sampling, and provides a concise Python example using the gensim library.

AIGensimMachine Learning

0 likes · 8 min read

Understanding Word2Vec: Theory, Architecture, and Python Implementation

php Courses

Aug 2, 2023 · Artificial Intelligence

Stanford and UC Berkeley Study Finds Significant Decline in GPT-4 Capabilities Across Math, Coding, and Visual Reasoning

A joint Stanford and UC Berkeley study reveals that GPT‑4’s performance on mathematics, code generation, and visual‑reasoning tasks sharply declined between March and June 2023, with accuracy dropping from 97.6% to 2.4% on a prime‑checking benchmark and executable code rates falling from 52% to 10%.

AI evaluationGPT-4Machine Learning

0 likes · 3 min read

Stanford and UC Berkeley Study Finds Significant Decline in GPT-4 Capabilities Across Math, Coding, and Visual Reasoning

NetEase LeiHuo Testing Center

Jul 14, 2023 · Artificial Intelligence

Using GPT-4 for Game Text Review: A Prompt‑Engineering Case Study

This article documents a step‑by‑step case study of applying GPT‑4 to automatically review and filter game dialogue, detailing the initial challenges, iterative prompt refinements, added safety rules, and the resulting improvements compared with earlier models.

AI moderationGPT-4game text review

0 likes · 6 min read

Using GPT-4 for Game Text Review: A Prompt‑Engineering Case Study

Architecture Digest

Jun 29, 2023 · Databases

Chat2DB: Open‑Source AI‑Powered Multi‑Database Client – Installation, Configuration, and Feature Demonstration

This article introduces Chat2DB, an open‑source AI‑enhanced multi‑database client, walks through its download, installation, configuration, demonstrates natural‑language‑to‑SQL, SQL explanation and optimization features using sample MySQL tables, and outlines future roadmap for developers.

AIChat2DBDatabase client

0 likes · 13 min read

Chat2DB: Open‑Source AI‑Powered Multi‑Database Client – Installation, Configuration, and Feature Demonstration

Programmer DD

Jun 12, 2023 · Artificial Intelligence

Master Prompt Engineering: Guide ChatGPT to Deliver Precise Answers

This article explains prompt engineering for large language models like ChatGPT, covering its definition, essential techniques such as diverse prompting strategies, problem restatement, background provision, gradient prompting, example inclusion, role‑playing, and the importance of systematic experimentation and quantitative evaluation to achieve high‑quality, task‑specific AI outputs.

AIChatGPTlarge language models

0 likes · 16 min read

Master Prompt Engineering: Guide ChatGPT to Deliver Precise Answers

Python Crawling & Data Mining

Jun 4, 2023 · Artificial Intelligence

Master Prompt Engineering: Unlock ChatGPT’s Full Potential

This article explains why crafting good prompts matters more than memorizing facts, introduces prompt engineering for large language models, and provides practical techniques—such as diverse prompting strategies, problem restatement, background provision, gradient prompting, role‑playing, and systematic evaluation—to help users get the most out of ChatGPT and similar AI systems.

AIChatGPTnatural language processing

0 likes · 16 min read

Master Prompt Engineering: Unlock ChatGPT’s Full Potential

Tencent Cloud Developer

Jun 1, 2023 · Artificial Intelligence

A Comprehensive Guide to Decoding Strategies for Text Generation with HuggingFace Transformers

This guide thoroughly explains the major decoding strategies for neural text generation in HuggingFace Transformers—including greedy, beam, diverse beam, sampling, top‑k, top‑p, sample‑and‑rank, beam sampling, and group beam search—detailing their principles, Python implementations with LogitsProcessor components, workflow diagrams, comparative analysis, and references to original research.

Beam SearchSamplingText Generation

0 likes · 60 min read

A Comprehensive Guide to Decoding Strategies for Text Generation with HuggingFace Transformers

Architects' Tech Alliance

May 15, 2023 · Artificial Intelligence

How Transformer Powers ChatGPT: A Deep Dive into Attention and Architecture

This article provides a comprehensive analysis of the Transformer model behind ChatGPT, covering its origin, core mechanisms such as embedding, positional encoding, self‑attention, multi‑head attention, a step‑by‑step translation example, and the broader implications for AI research and industry.

AI ArchitectureChatGPTTransformer

0 likes · 19 min read

How Transformer Powers ChatGPT: A Deep Dive into Attention and Architecture

Rare Earth Juejin Tech Community

May 6, 2023 · Artificial Intelligence

The Art of Asking ChatGPT for High‑Quality Answers – A Complete Guide to Prompt Engineering

This article translates Ibrahim John’s book on prompt engineering for ChatGPT, explaining a wide range of prompting techniques—including instruction, role, seed‑word, zero‑shot, few‑shot, and reinforcement‑learning prompts—through clear English descriptions, formulas, and illustrative examples to help readers obtain high‑quality model outputs.

Artificial IntelligenceChatGPTnatural language processing

0 likes · 32 min read

The Art of Asking ChatGPT for High‑Quality Answers – A Complete Guide to Prompt Engineering

dbaplus Community

Apr 15, 2023 · Artificial Intelligence

Why ChatGPT Isn't a New Revolution: Insights from AI Pioneer Wu Jun

In a live talk, AI veteran Wu Jun explains why the hype around ChatGPT is overblown, traces the history of language models from the 1970s, details the massive compute and data requirements, and discusses the real impact of large‑scale AI on society and work.

AI hypeChatGPTcomputational resources

0 likes · 20 min read

Why ChatGPT Isn't a New Revolution: Insights from AI Pioneer Wu Jun

ByteFE

Apr 3, 2023 · Artificial Intelligence

Mastering Prompt Engineering: Building a CLI Command Translator with the ChatGPT API

This comprehensive tutorial demonstrates how to leverage the official ChatGPT API and advanced prompt engineering techniques to develop a command-line tool that translates natural language instructions into precise shell commands, covering iterative prompt design, constraint implementation, and Node.js integration.

AI toolsCLI DevelopmentChatGPT API

0 likes · 15 min read

Mastering Prompt Engineering: Building a CLI Command Translator with the ChatGPT API

21CTO

Apr 2, 2023 · Artificial Intelligence

Which Jobs Will Vanish and Which Will Thrive with ChatGPT?

The article examines how ChatGPT automates many manual tasks, reducing demand for translators, editors, customer support reps, and data analysts, while boosting opportunities for chatbot developers, NLP engineers, data scientists, content creators, and software developers, and outlines broader industry advancements driven by AI.

AI ImpactChatGPTdata-science

0 likes · 7 min read

Which Jobs Will Vanish and Which Will Thrive with ChatGPT?

Tencent Cloud Developer

Mar 29, 2023 · Artificial Intelligence

GPT-4 Capabilities and Limitations: A Comprehensive Analysis

The article reviews GPT‑4’s expanded visual and coding abilities, modest arithmetic gains, and capacity to use external tools, while highlighting persistent shortcomings in planning, long‑range context, complex calculations, and societal risks such as misinformation, concluding that despite impressive advances it remains far from true artificial general intelligence.

AI capabilitiesAI limitationsArithmetic

0 likes · 14 min read

GPT-4 Capabilities and Limitations: A Comprehensive Analysis

DataFunSummit

Feb 26, 2023 · Artificial Intelligence

Fudan University's MOSS: China's First Conversational Large Language Model

Fudan University's Natural Language Processing Lab introduced MOSS, the country's first conversational large language model capable of dialogue generation, programming, factual QA and ethical reasoning, with plans for open‑source release despite current limitations in Chinese language proficiency.

AIFudan UniversityLarge Language Model

0 likes · 3 min read

Fudan University's MOSS: China's First Conversational Large Language Model

Programmer DD

Feb 21, 2023 · Artificial Intelligence

Meet MOSS: China’s Homegrown ChatGPT Rival and Its Capabilities

MOSS, a Chinese large‑language model released by Fudan University, offers ChatGPT‑like functions such as text generation, summarization, translation, and code writing, while being open‑source and free during preview, yet it still lags behind due to limited data, compute, and model size.

AIChatGPTFudan University

0 likes · 11 min read

Meet MOSS: China’s Homegrown ChatGPT Rival and Its Capabilities

DataFunTalk

Feb 16, 2023 · Artificial Intelligence

Fine‑Grained Entity Recognition in Tencent TexSmart: System Overview and Key Techniques

This article presents an in‑depth overview of Tencent's TexSmart natural‑language understanding system, highlighting its fine‑grained NER capabilities, knowledge‑base combination methods, remote‑supervision via similar entities, multi‑source zero‑shot fusion, experimental results, and practical insights from a recent NLP summit.

Entity TypingFine-grained NERTexSmart

0 likes · 12 min read

Fine‑Grained Entity Recognition in Tencent TexSmart: System Overview and Key Techniques

IT Architects Alliance

Feb 6, 2023 · Artificial Intelligence

Understanding the Transformer Model: A Deep Dive into “Attention Is All You Need”

This article provides a comprehensive, plain‑language walkthrough of the 2017 “Attention Is All You Need” paper, explaining the Transformer’s architecture, core mechanisms such as embedding, positional encoding and self‑attention, and discussing its broader impact on AI research and applications.

AITransformerattention mechanism

0 likes · 17 min read

Understanding the Transformer Model: A Deep Dive into “Attention Is All You Need”

DataFunSummit

Feb 5, 2023 · Artificial Intelligence

Interview with ChatGPT: Understanding Large Language Models, Their Limitations, and Societal Implications

The article presents a detailed interview between Time magazine and ChatGPT, exploring how large language models work, their ethical risks, limitations, and potential societal impact while offering guidance on responsible use and future expectations.

AI ethicsChatGPTMachine Learning

0 likes · 16 min read

Interview with ChatGPT: Understanding Large Language Models, Their Limitations, and Societal Implications

DataFunTalk

Dec 17, 2022 · Artificial Intelligence

Multimodal Pre‑training Techniques and Applications – Overview, OPPOVL Dataset, Architecture, and Performance

This article presents a comprehensive overview of multimodal pre‑training, describing its motivation, architecture choices, large‑scale Chinese image‑text dataset construction, training optimizations, performance benchmarks, downstream applications, and a Q&A session that highlights practical deployment considerations.

Model architectureMultimodalcomputer vision

0 likes · 16 min read

Multimodal Pre‑training Techniques and Applications – Overview, OPPOVL Dataset, Architecture, and Performance

Alibaba Cloud Big Data AI Platform

Dec 7, 2022 · Artificial Intelligence

Alibaba Cloud PAI’s Few-Shot NLP Breakthroughs at EMNLP 2022

At EMNLP 2022, Alibaba Cloud’s PAI platform showcased three pioneering few-shot NLP papers—KECP, SpanProto, and UPT—demonstrating advanced prompt-tuning techniques, knowledge-enhanced models, and a unified learning paradigm that push the boundaries of low-resource language understanding.

Alibaba CloudEMNLP 2022natural language processing

0 likes · 8 min read

Alibaba Cloud PAI’s Few-Shot NLP Breakthroughs at EMNLP 2022

Model Perspective

Nov 17, 2022 · Artificial Intelligence

How Mathematics Sparked the Rise of Modern Linguistics and NLP

This article traces the historical convergence of mathematics and linguistics, from 19th‑century pioneers to post‑war computer‑driven research, highlighting how statistical, probabilistic, and formal methods laid the foundation for machine translation, morphological analysis, and contemporary natural language processing.

history of linguisticsmachine translationmathematical linguistics

0 likes · 7 min read

How Mathematics Sparked the Rise of Modern Linguistics and NLP

DataFunTalk

Nov 8, 2022 · Artificial Intelligence

Retrieval-Based Dialogue System Framework for Customer Service: Architecture, Retrieval, Ranking, and Practical Applications

This article presents a comprehensive retrieval‑based dialogue system designed to assist customer‑service agents by recommending candidate replies, detailing its five‑layer architecture, metric suite, text and vector retrieval modules, ranking strategies, and real‑world deployment results across multiple business scenarios.

AICustomer Servicedialogue system

0 likes · 34 min read

Retrieval-Based Dialogue System Framework for Customer Service: Architecture, Retrieval, Ranking, and Practical Applications

DataFunSummit

Oct 20, 2022 · Artificial Intelligence

End-to-End Speech Relation Extraction

This paper presents an end‑to‑end approach for extracting relational triples directly from speech signals, bypassing intermediate transcription, and demonstrates its effectiveness on synthesized speech versions of the CoNLL04 and TACRED datasets, highlighting challenges such as length constraints and cross‑modal alignment.

End-to-EndMultimodalnatural language processing

0 likes · 17 min read

DataFunTalk

Oct 16, 2022 · Artificial Intelligence

Query Understanding and Processing in E‑commerce Search Systems

This article explains the end‑to‑end pipeline of query understanding for e‑commerce search, covering preprocessing, segmentation, spell correction, normalization, and expansion, and discusses both academic research and industry implementations with examples and references.

Query ProcessingQuery Rewritingnatural language processing

0 likes · 13 min read

Query Understanding and Processing in E‑commerce Search Systems

DataFunTalk

Sep 27, 2022 · Artificial Intelligence

Contrastive Learning for Text Generation: Motivation, Methodology, Experiments, and Discussion (CoNT Framework)

This article reviews the integration of contrastive learning into text generation, explains why it helps mitigate exposure bias, introduces the CoNT framework with three key improvements, presents extensive experiments on translation, summarization, code comment and data‑to‑text tasks, and discusses practical deployment considerations.

AICoNTText Generation

0 likes · 21 min read

Contrastive Learning for Text Generation: Motivation, Methodology, Experiments, and Discussion (CoNT Framework)

Model Perspective

Aug 27, 2022 · Artificial Intelligence

How AI Can Summarize Research Papers Instantly with paper-digest

This article introduces paper-digest, an AI‑driven assistant that parses full research papers to generate concise, clear summaries, helping students read more efficiently, researchers share findings faster, and science communicators produce effective overviews.

AIResearch Toolsacademic productivity

0 likes · 3 min read

How AI Can Summarize Research Papers Instantly with paper-digest

Baidu Geek Talk

Aug 15, 2022 · Artificial Intelligence

GEEK TALK: Practical Applications of Augmented Analysis

The article explains Augmented Analysis—using AI‑driven natural‑language queries, intelligent assistants, and automated business insights—to enable non‑technical users to explore data, gain actionable recommendations, and boost business value, illustrated with real‑world use cases and practical guidance for embedding the technology into everyday workflows.

AI in AnalyticsAugmented Analysisbusiness intelligence

0 likes · 9 min read

GEEK TALK: Practical Applications of Augmented Analysis

JD Cloud Developers

Aug 15, 2022 · Artificial Intelligence

How FCA Doubles BERT’s Inference Speed with Less Than 1% Accuracy Loss

This article explains how the Fine‑ and Coarse‑Granularity Hybrid Self‑Attention (FCA) mechanism reduces BERT’s computational cost by over 50% while keeping accuracy loss under 1%, detailing the method, experimental results, and its significance for efficient large‑scale language models.

BERTFCASelf-Attention

0 likes · 8 min read

How FCA Doubles BERT’s Inference Speed with Less Than 1% Accuracy Loss

Huawei Cloud Developer Alliance

Jul 26, 2022 · Artificial Intelligence

Unlock AI-Driven Optimization: Join the NL4Opt Challenge at NeurIPS 2022

The NL4Opt competition at NeurIPS 2022, co‑hosted by Huawei Cloud and two Canadian universities, introduces the first natural‑language‑based mathematical‑modeling dataset, offering named‑entity‑recognition and model‑generation tracks, a detailed schedule, and prize incentives to advance AI‑powered optimization research.

AI competitionNL4OptNeurIPS

0 likes · 5 min read

Unlock AI-Driven Optimization: Join the NL4Opt Challenge at NeurIPS 2022

Meituan Technology Team

Jul 21, 2022 · Artificial Intelligence

Overview of Meituan Technical Team Papers Featured at ACM SIGIR 2022 and Related Works

The article highlights ten representative Meituan technical papers accepted at ACM SIGIR 2022, spanning personalized opinion tagging, cross‑domain sentiment classification, dialogue summarization transfer, universal retrieval, CTR prediction, image behavior modeling, and topic segmentation, each summarized with abstracts and download links for researchers.

Machine LearningRecommendation Systemscross-domain learning

0 likes · 25 min read

Overview of Meituan Technical Team Papers Featured at ACM SIGIR 2022 and Related Works

DataFunSummit

Jun 21, 2022 · Artificial Intelligence

JiuGe: An Automatic Chinese Classical Poetry Generation System – Algorithms and Research Overview

This article presents the JiuGe system developed by THUNLP for automatically generating Chinese classical poetry, detailing its research motivations, model architecture—including salient‑clue, working‑memory, topic‑memory, style‑transfer and reinforcement‑learning components—implementation, applications, and future directions.

Artificial IntelligencePoetry Generationdeep learning

0 likes · 18 min read

JiuGe: An Automatic Chinese Classical Poetry Generation System – Algorithms and Research Overview

360 Quality & Efficiency

Jun 10, 2022 · Artificial Intelligence

Overview of Modern Text Summarization Techniques

This article reviews contemporary text summarization methods, covering extractive approaches such as TextRank and clustering, abstractive models like Seq2Seq with attention, pointer‑generator networks, and recent pre‑trained transformers including BART, CPT and PEGASUS, highlighting their strengths, limitations, and combined strategies.

abstractive modelsextractive methodsnatural language processing

0 likes · 13 min read

Overview of Modern Text Summarization Techniques

Meituan Technology Team

May 26, 2022 · Artificial Intelligence

Span-Level Dialogue Summarization via Distant Supervision and Machine Reading Comprehension (DSMRC‑S)

The paper reviews classic summarization models, then proposes DSMRC‑S, a span-level extractive dialogue summarization method using distant supervision and a machine‑reading‑comprehension framework, with token‑level labeling and density‑based span selection, achieving state‑of‑the‑art BLEU and ROUGE improvements on a large Meituan dialogue dataset.

BERTDialogue Summarizationmachine reading comprehension

0 likes · 33 min read

Span-Level Dialogue Summarization via Distant Supervision and Machine Reading Comprehension (DSMRC‑S)

Java Architect Essentials

May 8, 2022 · Artificial Intelligence

How Tsinghua’s WantWords Reverse Dictionary Works and Why It Matters

WantWords, an open‑source reverse dictionary from Tsinghua University, lets users input a description and receive matching words across Chinese and English, leveraging a multi‑channel model from a AAAI‑20 paper and offering customizable part‑of‑speech and rhyme options.

AI modelOpen SourceTsinghua University

0 likes · 5 min read

How Tsinghua’s WantWords Reverse Dictionary Works and Why It Matters

DataFunTalk

Mar 16, 2022 · Artificial Intelligence

A Survey of Entity Linking: Definitions, Methods, and Applications

This article provides a comprehensive overview of entity linking, detailing its definition, the two-stage pipeline of entity recognition and disambiguation, common methodologies such as candidate generation and ranking, advanced approaches, challenges like unlinkable mentions, and various applications in knowledge graphs, text mining, and question answering.

entity linkinginformation retrievalnatural language processing

0 likes · 15 min read

A Survey of Entity Linking: Definitions, Methods, and Applications

Baobao Algorithm Notes

Mar 3, 2022 · Artificial Intelligence

How Hierarchical Curriculum Learning Improves Dialogue Response Selection

This article explains how treating negative response candidates with varying difficulty through a hierarchical curriculum learning framework—combining corpus‑level and instance‑level curricula—enhances dialogue response selection models, backed by experiments on Douban, Ubuntu, and E‑Commerce datasets.

Curriculum Learningdialogue response selectionhierarchical learning

0 likes · 8 min read

How Hierarchical Curriculum Learning Improves Dialogue Response Selection

Youku Technology

Feb 28, 2022 · Artificial Intelligence

Seq2Path: Generating Sentiment Tuples as Paths of a Tree

Seq2Path treats each sentiment tuple as an independent tree path, training with average path loss and decoding via constrained beam search with a discriminative token, achieving state‑of‑the‑art results on five aspect‑based sentiment analysis datasets and deployment in Alibaba Entertainment AI Brain.

Beam SearchSentiment AnalysisSeq2Seq

0 likes · 3 min read

Seq2Path: Generating Sentiment Tuples as Paths of a Tree

DataFunSummit

Jan 25, 2022 · Artificial Intelligence

Intelligent Lyric Generation for Music: Techniques, Models, and Future Directions

This article explores how AI and natural language processing technologies are applied to music lyric creation, covering background challenges, rhyme retrieval methods, advanced language models such as SongNet, decoding strategies, style transfer, and a multi‑level generation platform that aims to streamline professional songwriting.

AI lyric generationLanguage ModelsSongNet

0 likes · 14 min read

Intelligent Lyric Generation for Music: Techniques, Models, and Future Directions

DataFunTalk

Jan 9, 2022 · Artificial Intelligence

Information Extraction for Unstructured Text: From Closed to Open

This presentation reviews the concepts, tasks, and challenges of information extraction from unstructured text, covering closed and open settings, relation extraction, joint extraction, and open extraction methods, and discusses recent advances such as segment‑attention, global‑rationale models, ETL, TPLinker, and maximal‑clique based approaches with experimental results.

information extractionjoint extractionknowledge graph

0 likes · 18 min read

Information Extraction for Unstructured Text: From Closed to Open

DataFunTalk

Jan 5, 2022 · Artificial Intelligence

Graph-Based Methods for Hot Event Discovery, Long Text Matching, and Ontology Construction in Natural Language Processing

This talk presents a series of graph‑based techniques for natural language processing, including the Story Forest system for hot event discovery, the GIANT framework for ontology creation and user interest modeling, and a divide‑and‑conquer approach to long‑text matching that leverages graph neural networks and community detection.

event detectiongraph mininggraph neural networks

0 likes · 19 min read

Graph-Based Methods for Hot Event Discovery, Long Text Matching, and Ontology Construction in Natural Language Processing

Code DAO

Dec 12, 2021 · Artificial Intelligence

Lightning Flash 0.3 Introduces New Tasks, Visualization Tools, Data Pipelines, and Registry API

Lightning Flash 0.3 expands the PyTorch Lightning ecosystem with eight new computer‑vision and NLP tasks, modular API design, integrated model hubs, visualisation callbacks, customizable data‑source hooks, and a central registry for model backbones, all illustrated with concrete code examples.

Lightning FlashPyTorch Lightningcomputer vision

0 likes · 7 min read

Lightning Flash 0.3 Introduces New Tasks, Visualization Tools, Data Pipelines, and Registry API

Youku Technology

Dec 2, 2021 · Artificial Intelligence

Hybrid Curriculum Learning for Emotion Recognition in Conversation

The paper introduces a hybrid curriculum learning framework that tackles emotion shift and confusing labels in emotion recognition in conversation by applying nested curriculum stages at both conversation and utterance levels, enabling a progressive easy‑to‑hard training that markedly boosts classic ERC model performance across four public datasets and is already deployed in Alibaba’s entertainment AI brain script health‑check service.

Curriculum LearningEmotion Recognitionconversation analysis

0 likes · 2 min read

Hybrid Curriculum Learning for Emotion Recognition in Conversation

DataFunSummit

Nov 27, 2021 · Artificial Intelligence

Knowledge Graph Construction, Applications, and Recent Advances in Entity Linking

This article reviews the fundamentals of knowledge graphs, their practical uses in question answering, search and recommendation, and surveys recent research on entity linking—including dual‑encoder retrieval, BERT‑based models, multilingual approaches, and zero‑shot methods—while also outlining modern knowledge‑graph construction pipelines and open challenges.

Machine Learningentity linkinginformation extraction

0 likes · 21 min read

Knowledge Graph Construction, Applications, and Recent Advances in Entity Linking

JD Retail Technology

Nov 16, 2021 · Artificial Intelligence

Intelligent Online Selling Point Extraction for E‑Commerce Recommendation (IOSPE) Wins AAAI 2022 Innovation Award

The IOSPE system, which uses BERT‑based scoring, transformer‑pointer generation, and personalized distribution to automatically extract and generate selling points for millions of e‑commerce products, earned the AAAI 2022 Artificial Intelligence Innovation Application Award and has boosted click‑through rates and user dwell time across JD.com platforms.

AIBERTInnovation Award

0 likes · 6 min read

Intelligent Online Selling Point Extraction for E‑Commerce Recommendation (IOSPE) Wins AAAI 2022 Innovation Award

58 Tech

Nov 16, 2021 · Artificial Intelligence

Deep Optimization of the 58 Yellow Pages Smart Chat Assistant for Enhanced User Experience and Business Opportunity Conversion

This article details the development and continuous optimization of 58.com’s Yellow Pages smart chat assistant, covering background, metrics, model improvements for QABot and TaskBot, slot extraction, quality assessment, and future directions, resulting in near‑human conversion rates and significant operational savings.

AIBusiness OpportunityChatbot

0 likes · 22 min read

Deep Optimization of the 58 Yellow Pages Smart Chat Assistant for Enhanced User Experience and Business Opportunity Conversion

DataFunTalk

Nov 14, 2021 · Artificial Intelligence

Knowledge Graph Construction and Entity Linking: Techniques, Applications, and Recent Advances

This article provides a comprehensive overview of knowledge graphs and entity linking, covering their definitions, practical uses in question answering, search and recommendation, the standard pipeline of mention detection, candidate generation and scoring, challenges such as scalability and multilinguality, and recent research advances including dual‑encoder, RELIC, deep retrieval, and multilingual BERT‑based models, followed by a discussion of modern knowledge‑graph construction methods.

AIentity linkingknowledge graph

0 likes · 21 min read

Knowledge Graph Construction and Entity Linking: Techniques, Applications, and Recent Advances

Dada Group Technology

Oct 29, 2021 · Artificial Intelligence

Query Understanding in JD Daojia E‑commerce Search: Architecture, Core Algorithms, and Experimental Results

This article presents a comprehensive overview of JD Daojia's query understanding system for e‑commerce search, detailing its overall architecture, core modules such as tokenization, term weighting, query rewriting, intent detection, the algorithms employed, experimental evaluations, and future directions.

natural language processingquery understandingsearch engine

0 likes · 27 min read

Query Understanding in JD Daojia E‑commerce Search: Architecture, Core Algorithms, and Experimental Results

Meituan Technology Team

Sep 30, 2021 · Artificial Intelligence

Meituan's Intelligent Customer Service Technology and Practice

Meituan’s intelligent customer service platform, serving over 630 million users and 7.7 million merchants, integrates six core AI capabilities—including problem recommendation, understanding, dialogue management, answer supply, response recommendation, and session summarization—across pre‑sale, in‑sale, after‑sale and internal scenarios, leveraging multi‑turn dialogue, intent recognition, knowledge‑graph Q&A, and the Moses platform, while targeting future end‑to‑end and emotionally intelligent interactions.

BERTDialogue SystemsIntelligent Customer Service

0 likes · 23 min read

Meituan's Intelligent Customer Service Technology and Practice

Alibaba Cloud Developer

Sep 26, 2021 · Artificial Intelligence

Controllable Text Generation: Models, Techniques, and Real-World Applications

This comprehensive article surveys controllable text generation, covering core NLP concepts, model architectures, evaluation metrics, four main control strategies, recent research trends, and a practical e‑commerce query‑generation case study.

controllable text generationevaluation metricsnatural language processing

0 likes · 33 min read

Controllable Text Generation: Models, Techniques, and Real-World Applications

Volcano Engine Developer Services

Sep 25, 2021 · Artificial Intelligence

Cutting‑Edge AI from ByteDance & OPPO: Audio, NLP, and Translation

The ByteDance Engine Developer Community Meetup featured senior engineers from ByteDance and OPPO who presented the latest advances in intelligent audio signal processing, natural language processing for recommendation, entity linking in knowledge graphs, and multimedia machine translation, highlighting practical applications and performance challenges.

Artificial IntelligenceRecommendation Systemsknowledge graph

0 likes · 4 min read

Cutting‑Edge AI from ByteDance & OPPO: Audio, NLP, and Translation

DataFunTalk

Sep 24, 2021 · Artificial Intelligence

Intelligent Question Answering in QQ Browser Search Engine: KBQA, DeepQA, and IRQA

This article presents the architecture, techniques, and practical solutions behind intelligent question answering in QQ Browser's search engine, covering knowledge‑graph based QA (KBQA), machine‑reading‑comprehension QA (DeepQA), and information‑retrieval QA (IRQA), and discusses system design, model optimization, and future directions.

AIinformation retrievalknowledge graph

0 likes · 23 min read

Intelligent Question Answering in QQ Browser Search Engine: KBQA, DeepQA, and IRQA

DataFunTalk

Sep 22, 2021 · Artificial Intelligence

Baidu Unveils PLATO-XL: A 110‑Billion‑Parameter Bilingual Dialogue Generation Model

Baidu's newly released PLATO‑XL, a 110‑billion‑parameter bilingual pre‑training dialogue model, surpasses previous large‑scale models, introduces multi‑role awareness for consistent multi‑turn conversations, and demonstrates state‑of‑the‑art performance across open‑domain, knowledge‑grounded, and task‑oriented dialogue tasks.

Large Language ModelPLATO-XLbilingual AI

0 likes · 9 min read

Baidu Unveils PLATO-XL: A 110‑Billion‑Parameter Bilingual Dialogue Generation Model

HaoDF Tech Team

Sep 15, 2021 · Artificial Intelligence

Optimizing Question‑Answer Search Similarity in Haodf Online: A Semantic Similarity Model Case Study

This article describes how Haodf Online improved its medical question‑answer search by analyzing search challenges, adopting semantic similarity models based on pre‑trained language embeddings, designing contrastive training tasks, and evaluating the resulting increase in click‑through rate and user engagement.

medical AImodel optimizationnatural language processing

0 likes · 12 min read

Optimizing Question‑Answer Search Similarity in Haodf Online: A Semantic Similarity Model Case Study

DataFunTalk

Aug 31, 2021 · Artificial Intelligence

Applying Knowledge Graphs for Clinical VTE Risk Assessment: A Case Study from HuiMei Technology

This article describes how HuiMei Technology leverages a medical knowledge graph, natural‑language processing, and AI‑driven scoring to automate venous thromboembolism (VTE) risk assessment in large hospitals, detailing the business background, technical architecture, implementation workflow, and ongoing research directions.

SNOMED-CTVTEclinical risk assessment

0 likes · 16 min read

Applying Knowledge Graphs for Clinical VTE Risk Assessment: A Case Study from HuiMei Technology

DeWu Technology

Jul 18, 2021 · Artificial Intelligence

Deep Learning Techniques for Sentiment Analysis

The article explains how deep‑learning models, particularly convolutional neural networks with token‑level padding, kernel size three, and max‑pooling, can automatically classify e‑commerce product reviews into eight sentiment categories, offering scalable insight for decision‑making and paving the way for recommendation, QA, and risk‑assessment applications.

Sentiment Analysisconvolutional neural networkdeep learning

0 likes · 9 min read

Deep Learning Techniques for Sentiment Analysis

Meituan Technology Team

Jun 24, 2021 · Artificial Intelligence

Construction and Application of Meituan's Common‑Sense Concept Knowledge Graph

The paper describes Meituan’s common‑sense concept knowledge graph, detailing a multi‑stage construction pipeline—concept, hierarchy, attribute, bridging, and POI/SPU linking—using BERT, XGBoost, and graph neural networks, and demonstrates its deployment in category‑word enrichment, search suggestions, and medical‑beauty tagging, achieving over two million concepts, three million relations, and roughly 90 % accuracy.

Machine Learningconcept miningentity linking

0 likes · 26 min read

Construction and Application of Meituan's Common‑Sense Concept Knowledge Graph

DataFunTalk

May 8, 2021 · Artificial Intelligence

Attribute‑Level Sentiment Analysis for E‑commerce: Tasks, Challenges, and System Design

This article presents a comprehensive overview of sentiment analysis in user‑generated content, detailing document‑, sentence‑, and aspect‑level tasks, defining the Aspect Sentiment Triplet Extraction problem for e‑commerce reviews, describing a three‑stage pipeline with pre‑training, multi‑domain modeling and attribute normalization, and reporting significant business improvements such as 400% CTR lift, while also discussing data imbalance, annotation scarcity, and future research directions.

Machine LearningSentiment Analysisaspect based sentiment

0 likes · 15 min read

Attribute‑Level Sentiment Analysis for E‑commerce: Tasks, Challenges, and System Design