Tagged articles
190 articles
Page 1 of 2
Machine Heart
Machine Heart
May 19, 2026 · Artificial Intelligence

100k‑Token Natural‑Language Reasoning Enables a 30B‑A3B Model to Reach Olympiad Gold Level

A 30B‑A3B model, trained with reverse‑perplexity supervised fine‑tuning, two‑stage reinforcement learning, and a multi‑round generate‑verify‑revise inference loop, achieves gold‑medal performance on IMO, USAMO and IPhO contests using over 100 k token natural‑language reasoning without external tools.

30B-A3Bnatural language processingolympiad AI
0 likes · 11 min read
100k‑Token Natural‑Language Reasoning Enables a 30B‑A3B Model to Reach Olympiad Gold Level
Architects' Tech Alliance
Architects' Tech Alliance
May 8, 2026 · Artificial Intelligence

Token Fundamentals: A Technical Panorama of AI Language Units

Tokens are the smallest language building blocks that AI models process, representing characters, words, subwords, punctuation or emojis; they determine context window size and generation speed, so tokenization directly impacts model understanding accuracy and efficiency, as explained in the 2026 Token Report.

AI fundamentalsLanguage Modelscontext window
0 likes · 4 min read
Token Fundamentals: A Technical Panorama of AI Language Units
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Apr 21, 2026 · Artificial Intelligence

Why Do Papers with a '?' in the Title Achieve a 45% Acceptance Rate? A Five‑Year ICLR Keyword Analysis

Analyzing five years of ICLR submission metadata reveals that titles containing a question mark boost acceptance to 45.5% in 2022, while emerging keywords such as diffusion, sparse, and planning dominate high‑acceptance lists, and older topics like federated learning, adversarial attacks, and security suffer low acceptance and high withdrawal rates.

Data AnalysisICLRMachine Learning
0 likes · 8 min read
Why Do Papers with a '?' in the Title Achieve a 45% Acceptance Rate? A Five‑Year ICLR Keyword Analysis
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Apr 17, 2026 · Artificial Intelligence

When RAG Retrieves the Right Docs but Still Answers Wrong: Insights from Saarland University (ACL 2026)

The article explains why conventional Retrieval‑Augmented Generation often produces incorrect answers despite retrieving relevant documents, introduces the Disco‑RAG framework that adds a structured reading step using argument trees and relation graphs, and shows how this three‑step approach dramatically improves performance on long‑document and ambiguous‑question benchmarks without any model training.

Disco-RAGRAGRetrieval-Augmented Generation
0 likes · 13 min read
When RAG Retrieves the Right Docs but Still Answers Wrong: Insights from Saarland University (ACL 2026)
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Mar 6, 2026 · Artificial Intelligence

Why Learning from Context Is Harder Than We Thought

The talk examines why large language models, despite impressive performance on knowledge‑based tasks, struggle dramatically when required to learn new information from the immediate input context, analyzes systematic biases behind this limitation, and explores rubric‑based synthesis as a potential remedy.

context learninglarge language modelsnatural language processing
0 likes · 4 min read
Why Learning from Context Is Harder Than We Thought
Qborfy AI
Qborfy AI
Feb 21, 2026 · Artificial Intelligence

How Self-Attention Powers Modern AI: From Theory to Real-World Impact

This article explains the self‑attention mechanism behind transformers, detailing its core components, mathematical formulation, step‑by‑step example, multi‑head extension, industry use cases, and a thorough comparison with RNN and CNN approaches, all supported by concrete numbers and citations.

Self-AttentionTransformerattention mechanism
0 likes · 8 min read
How Self-Attention Powers Modern AI: From Theory to Real-World Impact
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Jan 30, 2026 · Backend Development

Text2DSL: Convert Natural Language to Precise Elasticsearch/Easysearch DSL

Text2DSL lets users describe search requirements in plain language, uses DeepSeek to generate Elasticsearch DSL, validates the DSL locally with Elasticsearch/Easysearch, iteratively refines it up to five times, and achieves over 95% first‑try accuracy while cutting query‑building time by at least threefold.

DSL generationDeepSeekEasysearch
0 likes · 12 min read
Text2DSL: Convert Natural Language to Precise Elasticsearch/Easysearch DSL
Cognitive Technology Team
Cognitive Technology Team
Nov 5, 2025 · Artificial Intelligence

How AI Is Revolutionizing End-to-End Test Automation at Tmall

Leveraging AI and natural language processing, Tmall’s quality assurance team transformed traditional manual testing into a semi‑automated and fully automated pipeline—covering requirement analysis, test case generation, data construction, execution, and validation—resulting in significant efficiency gains, traceability, and continuous improvement across multiple business lines.

AI testingSoftware qualitycontinuous integration
0 likes · 10 min read
How AI Is Revolutionizing End-to-End Test Automation at Tmall
dbaplus Community
dbaplus Community
Nov 3, 2025 · Artificial Intelligence

How RAG Turns Natural Language Queries into Accurate SQL for Data Platforms

This article explains how Retrieval‑Augmented Generation (RAG) combines vector databases with large language models to let non‑technical users ask natural‑language questions and receive precise SQL statements, detailing the workflow, architecture, chunking methods, performance gains, and remaining challenges.

Data PlatformLLMRAG
0 likes · 17 min read
How RAG Turns Natural Language Queries into Accurate SQL for Data Platforms
Data Party THU
Data Party THU
Oct 11, 2025 · Artificial Intelligence

From Transformers to LLaMA 4: A Journey Through the Biggest LLMs

This article surveys the most influential large language models released since 2017, detailing the core innovations of Transformer, BERT, GPT series, T5, Retrieval‑Augmented Generation, and the latest LLaMA and Meta models, while highlighting their architectures, training paradigms, and impact on NLP research.

LLMModel Scalinglarge language models
0 likes · 21 min read
From Transformers to LLaMA 4: A Journey Through the Biggest LLMs
Baidu Geek Talk
Baidu Geek Talk
Sep 15, 2025 · Artificial Intelligence

How Baidu’s AI Navigation Turns Voice Commands into Precise Actions

This article explains how Baidu Map’s AI navigation system converts spoken queries into accurate map instructions by combining speech recognition, intent parsing, large‑language‑model reasoning, tool calling, and memory‑reflection techniques, showcasing the underlying technologies that enable instant, context‑aware responses.

AILLMMap Services
0 likes · 13 min read
How Baidu’s AI Navigation Turns Voice Commands into Precise Actions
Baidu Maps Tech Team
Baidu Maps Tech Team
Jul 31, 2025 · Artificial Intelligence

How Baidu’s AI Voice Assistant Turns Speech into Precise Navigation Commands

This article explains how Baidu Map’s AI voice assistant converts spoken commands into precise navigation actions by detailing the speech‑to‑text pipeline, intent parsing, template and generative approaches, tool‑calling mechanisms, memory and reflection capabilities, and future directions for intelligent agents.

AIIntent ParsingLLM
0 likes · 14 min read
How Baidu’s AI Voice Assistant Turns Speech into Precise Navigation Commands
Programmer Xu Shu
Programmer Xu Shu
Jun 23, 2025 · Artificial Intelligence

From Bag‑of‑Words to ChatGPT: How Large Language Models Evolved

Tracing the evolution of large language models—from early bag‑of‑words techniques, through word embeddings, RNNs, attention mechanisms, Transformers, BERT, and GPT—this article explains each breakthrough, its limitations, and how they culminated in ChatGPT’s conversational AI.

AI evolutionChatGPTTransformer
0 likes · 12 min read
From Bag‑of‑Words to ChatGPT: How Large Language Models Evolved
MaGe Linux Operations
MaGe Linux Operations
Jun 15, 2025 · Artificial Intelligence

Mastering Transformers: Key Extensions and Optimization Techniques Explained

This comprehensive guide walks you through the Transformer architecture—from its encoder‑decoder structure and self‑attention mechanism to multi‑head attention, positional embeddings, and practical PyTorch implementations—providing clear visualizations and code examples for deep learning practitioners.

PyTorchSelf-AttentionTransformer
0 likes · 22 min read
Mastering Transformers: Key Extensions and Optimization Techniques Explained
Data Thinking Notes
Data Thinking Notes
Jun 2, 2025 · Artificial Intelligence

Why Pre‑Training Powers Modern AI: From Theory to Real‑World Applications

Pre‑training enables AI models to first acquire a universal knowledge map from massive unlabelled text, then quickly adapt to specific tasks with minimal labelled data, offering superior generalization, reduced annotation costs, and versatile applications across chatbots, content creation, retrieval, coding assistance, and more.

AI applicationsTransformerlarge language models
0 likes · 14 min read
Why Pre‑Training Powers Modern AI: From Theory to Real‑World Applications
AntData
AntData
May 30, 2025 · Artificial Intelligence

DeepInsight Copilot: AI‑Powered Data Analysis Platform Overview and Technical Evolution

The article presents an in‑depth overview of DeepInsight Copilot, an AI‑driven business intelligence product that streamlines data, information, insight, and decision‑recommendation stages, detailing its functional modules, intelligent agents, multi‑generation technical evolution, architecture, model fine‑tuning, and future challenges and solutions in data analysis.

AIData AnalysisMachine Learning
0 likes · 21 min read
DeepInsight Copilot: AI‑Powered Data Analysis Platform Overview and Technical Evolution
Beijing SF i-TECH City Technology Team
Beijing SF i-TECH City Technology Team
Apr 7, 2025 · Artificial Intelligence

LLM Application in Text Information Detection and Extraction: A Case Study of Blue-Collar Recruitment Data Processing

This article explores the application of Large Language Models (LLM) in text information detection and extraction, focusing on blue-collar recruitment data processing. It details the implementation of LLM through prompt engineering, RAG enhancement, and model fine-tuning to improve data cleaning efficiency and accuracy.

AI applicationsLLMRAG
0 likes · 31 min read
LLM Application in Text Information Detection and Extraction: A Case Study of Blue-Collar Recruitment Data Processing
Cognitive Technology Team
Cognitive Technology Team
Mar 30, 2025 · Artificial Intelligence

Why Prompt Engineering Is the “Mind‑Reading” Technique of AI: The Crucial Role of In‑Context Learning

Prompt engineering uses in‑context learning to turn large language models into precise, task‑aware assistants by providing well‑crafted prompts that guide the model’s probability distribution, reduce hallucinations, and unlock hidden knowledge without any parameter tuning.

Artificial IntelligenceIn-Context Learninglarge language models
0 likes · 6 min read
Why Prompt Engineering Is the “Mind‑Reading” Technique of AI: The Crucial Role of In‑Context Learning
Efficient Ops
Efficient Ops
Mar 16, 2025 · Artificial Intelligence

How AI Digital Humans Transform Banking Services: Architecture, Capabilities, and Use Cases

This article explains how AI-powered digital humans can modernize banking by offering modular, multi‑modal interaction, personalized multilingual service, 24‑hour availability, and risk‑aware automation, while detailing the underlying AI foundation, decision engine, visual rendering, and deployment strategies.

AICustomer ServiceFinTech
0 likes · 7 min read
How AI Digital Humans Transform Banking Services: Architecture, Capabilities, and Use Cases
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Jan 16, 2025 · Artificial Intelligence

Enhancing Uncertainty Modeling with Semantic Graph for Hallucination Detection

The authors present a semantic‑graph‑enhanced uncertainty modeling framework that captures token, sentence, and paragraph dependencies, propagates uncertainty through entity relations and contradiction probabilities, and achieves roughly a 20 % gain in paragraph‑level hallucination detection on WikiBio and NoteSum compared with existing uncertainty‑based baselines.

Semantic GraphSentence-level ModelingToken-level Modeling
0 likes · 13 min read
Enhancing Uncertainty Modeling with Semantic Graph for Hallucination Detection
Data Thinking Notes
Data Thinking Notes
Jan 7, 2025 · Databases

Unlocking LLM-Powered Text-to-SQL: From Basics to Cutting-Edge Techniques

This article provides a comprehensive overview of LLM-based Text-to-SQL technology, covering its background, evolution, challenges, various LLM-driven methods, benchmark datasets, evaluation metrics, and future research directions to guide researchers and practitioners in advancing natural language interfaces for databases.

DatabaseLLMText-to-SQL
0 likes · 18 min read
Unlocking LLM-Powered Text-to-SQL: From Basics to Cutting-Edge Techniques
Baidu Geek Talk
Baidu Geek Talk
Dec 11, 2024 · Artificial Intelligence

How AI Cuts Essay Grading Time by 6×: Inside the Smart Writing Platform

This article examines how an AI‑powered essay‑grading platform combines PaddleOCR and Baidu's Wenxin large model to automate scoring, generate personalized feedback, and reduce teachers' grading workload by over six times, while improving student learning outcomes across hundreds of Chinese schools.

AIEducation TechnologyPaddleOCR
0 likes · 11 min read
How AI Cuts Essay Grading Time by 6×: Inside the Smart Writing Platform
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Sep 16, 2024 · Artificial Intelligence

Why Transformers Revolutionize AI: From Basics to Advanced Applications

This article explains what AI Transformers are, why they matter, their key components and mechanisms, various applications ranging from language processing to bioinformatics, and how they differ from traditional neural networks, providing a comprehensive overview of Transformer architecture and its impact on modern AI research.

AISelf-AttentionTransformer
0 likes · 20 min read
Why Transformers Revolutionize AI: From Basics to Advanced Applications
Ops Development & AI Practice
Ops Development & AI Practice
Aug 20, 2024 · Artificial Intelligence

How ERobot Redefines No-Code AI Automation with Natural Language

The article examines Hugging Face's ERobot, an AI model that leverages Transformer-based pre‑trained models to execute a wide range of automation tasks through natural‑language commands, discusses its technical foundations, real‑world applications, future prospects, and the challenges it must overcome.

Hugging FaceTask AutomationTransformer
0 likes · 8 min read
How ERobot Redefines No-Code AI Automation with Natural Language
DaTaobao Tech
DaTaobao Tech
Aug 16, 2024 · Artificial Intelligence

Effective Prompt Design for Large Language Models

Effective prompt design for large language models requires clear goals, relevant context, explicit input/output formats, evaluation criteria, and illustrative examples, combined with specific language, step‑by‑step instructions, edge‑case handling, ethical considerations, and proper tokenization, encoding, decoding, and post‑processing to produce accurate, concise, low‑hallucination responses.

AIlarge language modelsnatural language processing
0 likes · 33 min read
Effective Prompt Design for Large Language Models
JD Tech
JD Tech
Jul 11, 2024 · Artificial Intelligence

Intelligent Parcel Identification in JD Express Logistics Using Large Language Models

This article examines the challenges of low parcel matching rates in JD Express logistics and proposes a large‑model‑based intelligent identification system, detailing its architecture, accuracy validation, cost‑saving cache strategy, and future prospects for improved efficiency and personalized services.

AI in e-commerceLogisticsOperational Efficiency
0 likes · 24 min read
Intelligent Parcel Identification in JD Express Logistics Using Large Language Models
Huolala Tech
Huolala Tech
Jul 9, 2024 · Artificial Intelligence

Building an In-Car Voice Assistant: From Wake‑Word to NLP

This article details the end‑to‑end development of an in‑vehicle voice assistant, covering motivation, functional design, technology stack selection, dialogue flow, privacy, third‑party integration, wake‑word detection, on‑device speech recognition, noise filtering, NLP processing, and deployment considerations.

Speech RecognitionVoice Assistantin‑car technology
0 likes · 18 min read
Building an In-Car Voice Assistant: From Wake‑Word to NLP
Baidu Tech Salon
Baidu Tech Salon
May 27, 2024 · Artificial Intelligence

Intelligent Agent Technology in Commercial Advertising Platforms: Architecture and Applications

The paper describes Baidu’s AI‑native advertising platform that employs a multi‑agent architecture built on large‑language models—combining large‑small model collaboration, domain SOP‑driven coordination, and long‑term memory—to enable natural‑language understanding, proactive planning, execution and human‑like responses, illustrated by GBI analytics and JarvisBot operations, delivering higher consumption, accuracy, speed and efficiency.

AI-native platformsLLM applicationsLong-term Memory
0 likes · 16 min read
Intelligent Agent Technology in Commercial Advertising Platforms: Architecture and Applications
NewBeeNLP
NewBeeNLP
May 16, 2024 · Artificial Intelligence

How Large Language Models Transform Advertising Copy Generation

This article examines the adoption of large language models for intelligent advertising copy creation, detailing business challenges, model selection criteria, training data preparation, fine‑tuning methods, performance evaluation, deployment results, while highlighting the trade‑offs between model size, cost, and output quality.

AI marketingadvertising copyfine-tuning
0 likes · 20 min read
How Large Language Models Transform Advertising Copy Generation
Architect's Guide
Architect's Guide
May 13, 2024 · Artificial Intelligence

Understanding the Core Principles of Transformer Architecture

This article explains how Transformer models work by detailing the encoder‑decoder structure, self‑attention, multi‑head attention, positional encoding, and feed‑forward networks, and shows their applications in machine translation, recommendation systems, and large language models.

AITransformerattention mechanism
0 likes · 11 min read
Understanding the Core Principles of Transformer Architecture
ITPUB
ITPUB
Apr 20, 2024 · Artificial Intelligence

Unveiling GPT-4’s Magic: How Large Language Models Learn, Reason, and Translate – A Kid‑Friendly Story

This article uses a playful dialogue to demystify how large language models like GPT‑4 work, covering data collection, vectorization, the transformer’s attention mechanism, position encoding, training stages, multilingual translation, reasoning puzzles, and alignment, all illustrated through the tale of a curious learner named Wuming.

Artificial IntelligenceTransformerattention mechanism
0 likes · 50 min read
Unveiling GPT-4’s Magic: How Large Language Models Learn, Reason, and Translate – A Kid‑Friendly Story
New Oriental Technology
New Oriental Technology
Apr 19, 2024 · Artificial Intelligence

Effective Prompt Engineering for Large Language Models

This article explains how large language models work, why well‑crafted prompts are essential, and presents practical strategies—such as clarity, conciseness, focus, role‑setting, delimiters, few‑shot examples, and step‑by‑step instructions—to help users obtain accurate and relevant responses from AI systems.

AILLM strategiesLarge Language Model
0 likes · 12 min read
Effective Prompt Engineering for Large Language Models
DataFunSummit
DataFunSummit
Mar 17, 2024 · Databases

Exploring Natural Language Interaction Methods for Database Systems

Postdoctoral researcher Fan Yuankai from Fudan University will present his work on enabling natural-language queries for database systems, covering NL2SQL approaches, reliable ranking mechanisms, and guiding large models to generate accurate SQL, aiming to improve usability for users unfamiliar with query languages.

AI for DatabasesDatabase InteractionNL2SQL
0 likes · 3 min read
Exploring Natural Language Interaction Methods for Database Systems
Bilibili Tech
Bilibili Tech
Feb 18, 2024 · Artificial Intelligence

Bilibili Personal Attack Content Governance: Background, Goals, Methods, and Effectiveness

Bilibili combats personal‑attack and trolling comments by combining sector‑specific keyword databases, user‑group analysis, advanced word‑matching (including pinyin and homophone detection) and multiple NLP/graph models, which has cut personal‑attack reports in entertainment, film and gaming by about 32 % and trolling reports by roughly 25 % between June and December 2023.

BilibiliMachine Learningabusive language detection
0 likes · 12 min read
Bilibili Personal Attack Content Governance: Background, Goals, Methods, and Effectiveness
DataFunSummit
DataFunSummit
Jan 17, 2024 · Artificial Intelligence

Applying Large Language Models in Zhihu’s Jianqiao Enterprise Analytics Platform

This article shares the practical application of large language models within Zhihu’s internal Jianqiao analytics platform, covering business background, knowledge taxonomy organization, natural‑language‑to‑filter conversion, natural‑language data analysis, and summarizing challenges, solutions, and future outlooks.

AI applicationsEnterprise Analyticsknowledge organization
0 likes · 14 min read
Applying Large Language Models in Zhihu’s Jianqiao Enterprise Analytics Platform
DataFunTalk
DataFunTalk
Dec 26, 2023 · Artificial Intelligence

The Evolution of AI and Its Challenges in the Data Industry

This article reviews the historical development of artificial intelligence, explains how AI technologies such as large language models are reshaping data processing and analysis, and discusses the practical challenges, trust issues, and governance requirements when applying AI to the data industry.

Data IndustryGPTdata governance
0 likes · 10 min read
The Evolution of AI and Its Challenges in the Data Industry
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 13, 2023 · Artificial Intelligence

Boosting Cross-Lingual Machine Reading Comprehension with X-STA: A New Knowledge Transfer Approach

The X-STA algorithm, introduced by Alibaba Cloud’s PAI and researchers from South China University of Technology, leverages gradient‑decomposed knowledge sharing, teacher‑guided attention, and multi‑level alignment to enhance cross‑lingual machine reading comprehension, achieving state‑of‑the‑art results on three multilingual MRC benchmarks.

Knowledge TransferX-STAcross-lingual
0 likes · 7 min read
Boosting Cross-Lingual Machine Reading Comprehension with X-STA: A New Knowledge Transfer Approach
Model Perspective
Model Perspective
Sep 11, 2023 · Artificial Intelligence

Why Chinese Word Segmentation Matters: Techniques, Challenges, and Python Demo

This article explores Chinese word segmentation, illustrating its linguistic nuances with a humorous example, explains key methods—including dictionary‑based, statistical, and deep‑learning approaches—and provides Python code using a simple dictionary algorithm and the popular jieba library to demonstrate practical implementation.

Chinese NLPPythonjieba
0 likes · 6 min read
Why Chinese Word Segmentation Matters: Techniques, Challenges, and Python Demo
Model Perspective
Model Perspective
Sep 1, 2023 · Fundamentals

Unlocking the Power of Finite State Transducers: From Theory to Python Implementation

This article introduces finite‑state transducers, explains their mathematical definition, illustrates state‑transition examples such as binary counters, word detection, and parentheses matching, explores key applications in speech synthesis, spell‑checking, lemmatization, transliteration, and lexical analysis, and provides a concise Python implementation.

Pythonautomata theoryfinite state transducer
0 likes · 9 min read
Unlocking the Power of Finite State Transducers: From Theory to Python Implementation
DataFunTalk
DataFunTalk
Aug 19, 2023 · Artificial Intelligence

Applying Large Language Models to Zhihu's Bridge Platform: Use Cases, Challenges, and Solutions

This article details how Zhihu's internal Bridge platform integrates large language models for business analysis, knowledge taxonomy, natural‑language‑to‑filter conversion, and ad‑hoc data queries, describing the workflow, technical hurdles, iterative improvements, and future directions.

AI for business analyticsknowledge taxonomylarge language models
0 likes · 12 min read
Applying Large Language Models to Zhihu's Bridge Platform: Use Cases, Challenges, and Solutions
php Courses
php Courses
Aug 2, 2023 · Artificial Intelligence

Stanford and UC Berkeley Study Finds Significant Decline in GPT-4 Capabilities Across Math, Coding, and Visual Reasoning

A joint Stanford and UC Berkeley study reveals that GPT‑4’s performance on mathematics, code generation, and visual‑reasoning tasks sharply declined between March and June 2023, with accuracy dropping from 97.6% to 2.4% on a prime‑checking benchmark and executable code rates falling from 52% to 10%.

AI evaluationGPT-4Machine Learning
0 likes · 3 min read
Stanford and UC Berkeley Study Finds Significant Decline in GPT-4 Capabilities Across Math, Coding, and Visual Reasoning
Architecture Digest
Architecture Digest
Jun 29, 2023 · Databases

Chat2DB: Open‑Source AI‑Powered Multi‑Database Client – Installation, Configuration, and Feature Demonstration

This article introduces Chat2DB, an open‑source AI‑enhanced multi‑database client, walks through its download, installation, configuration, demonstrates natural‑language‑to‑SQL, SQL explanation and optimization features using sample MySQL tables, and outlines future roadmap for developers.

AIChat2DBDatabase client
0 likes · 13 min read
Chat2DB: Open‑Source AI‑Powered Multi‑Database Client – Installation, Configuration, and Feature Demonstration
Programmer DD
Programmer DD
Jun 12, 2023 · Artificial Intelligence

Master Prompt Engineering: Guide ChatGPT to Deliver Precise Answers

This article explains prompt engineering for large language models like ChatGPT, covering its definition, essential techniques such as diverse prompting strategies, problem restatement, background provision, gradient prompting, example inclusion, role‑playing, and the importance of systematic experimentation and quantitative evaluation to achieve high‑quality, task‑specific AI outputs.

AIChatGPTlarge language models
0 likes · 16 min read
Master Prompt Engineering: Guide ChatGPT to Deliver Precise Answers
Python Crawling & Data Mining
Python Crawling & Data Mining
Jun 4, 2023 · Artificial Intelligence

Master Prompt Engineering: Unlock ChatGPT’s Full Potential

This article explains why crafting good prompts matters more than memorizing facts, introduces prompt engineering for large language models, and provides practical techniques—such as diverse prompting strategies, problem restatement, background provision, gradient prompting, role‑playing, and systematic evaluation—to help users get the most out of ChatGPT and similar AI systems.

AIChatGPTnatural language processing
0 likes · 16 min read
Master Prompt Engineering: Unlock ChatGPT’s Full Potential
Tencent Cloud Developer
Tencent Cloud Developer
Jun 1, 2023 · Artificial Intelligence

A Comprehensive Guide to Decoding Strategies for Text Generation with HuggingFace Transformers

This guide thoroughly explains the major decoding strategies for neural text generation in HuggingFace Transformers—including greedy, beam, diverse beam, sampling, top‑k, top‑p, sample‑and‑rank, beam sampling, and group beam search—detailing their principles, Python implementations with LogitsProcessor components, workflow diagrams, comparative analysis, and references to original research.

Beam SearchSamplingText Generation
0 likes · 60 min read
A Comprehensive Guide to Decoding Strategies for Text Generation with HuggingFace Transformers
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
May 6, 2023 · Artificial Intelligence

The Art of Asking ChatGPT for High‑Quality Answers – A Complete Guide to Prompt Engineering

This article translates Ibrahim John’s book on prompt engineering for ChatGPT, explaining a wide range of prompting techniques—including instruction, role, seed‑word, zero‑shot, few‑shot, and reinforcement‑learning prompts—through clear English descriptions, formulas, and illustrative examples to help readers obtain high‑quality model outputs.

Artificial IntelligenceChatGPTnatural language processing
0 likes · 32 min read
The Art of Asking ChatGPT for High‑Quality Answers – A Complete Guide to Prompt Engineering
dbaplus Community
dbaplus Community
Apr 15, 2023 · Artificial Intelligence

Why ChatGPT Isn't a New Revolution: Insights from AI Pioneer Wu Jun

In a live talk, AI veteran Wu Jun explains why the hype around ChatGPT is overblown, traces the history of language models from the 1970s, details the massive compute and data requirements, and discusses the real impact of large‑scale AI on society and work.

AI hypeChatGPTcomputational resources
0 likes · 20 min read
Why ChatGPT Isn't a New Revolution: Insights from AI Pioneer Wu Jun
ByteFE
ByteFE
Apr 3, 2023 · Artificial Intelligence

Mastering Prompt Engineering: Building a CLI Command Translator with the ChatGPT API

This comprehensive tutorial demonstrates how to leverage the official ChatGPT API and advanced prompt engineering techniques to develop a command-line tool that translates natural language instructions into precise shell commands, covering iterative prompt design, constraint implementation, and Node.js integration.

AI toolsCLI DevelopmentChatGPT API
0 likes · 15 min read
Mastering Prompt Engineering: Building a CLI Command Translator with the ChatGPT API
21CTO
21CTO
Apr 2, 2023 · Artificial Intelligence

Which Jobs Will Vanish and Which Will Thrive with ChatGPT?

The article examines how ChatGPT automates many manual tasks, reducing demand for translators, editors, customer support reps, and data analysts, while boosting opportunities for chatbot developers, NLP engineers, data scientists, content creators, and software developers, and outlines broader industry advancements driven by AI.

AI ImpactChatGPTdata-science
0 likes · 7 min read
Which Jobs Will Vanish and Which Will Thrive with ChatGPT?
Tencent Cloud Developer
Tencent Cloud Developer
Mar 29, 2023 · Artificial Intelligence

GPT-4 Capabilities and Limitations: A Comprehensive Analysis

The article reviews GPT‑4’s expanded visual and coding abilities, modest arithmetic gains, and capacity to use external tools, while highlighting persistent shortcomings in planning, long‑range context, complex calculations, and societal risks such as misinformation, concluding that despite impressive advances it remains far from true artificial general intelligence.

AI capabilitiesAI limitationsArithmetic
0 likes · 14 min read
GPT-4 Capabilities and Limitations: A Comprehensive Analysis
DataFunSummit
DataFunSummit
Feb 26, 2023 · Artificial Intelligence

Fudan University's MOSS: China's First Conversational Large Language Model

Fudan University's Natural Language Processing Lab introduced MOSS, the country's first conversational large language model capable of dialogue generation, programming, factual QA and ethical reasoning, with plans for open‑source release despite current limitations in Chinese language proficiency.

AIFudan UniversityLarge Language Model
0 likes · 3 min read
Fudan University's MOSS: China's First Conversational Large Language Model
Programmer DD
Programmer DD
Feb 21, 2023 · Artificial Intelligence

Meet MOSS: China’s Homegrown ChatGPT Rival and Its Capabilities

MOSS, a Chinese large‑language model released by Fudan University, offers ChatGPT‑like functions such as text generation, summarization, translation, and code writing, while being open‑source and free during preview, yet it still lags behind due to limited data, compute, and model size.

AIChatGPTFudan University
0 likes · 11 min read
Meet MOSS: China’s Homegrown ChatGPT Rival and Its Capabilities
DataFunTalk
DataFunTalk
Feb 16, 2023 · Artificial Intelligence

Fine‑Grained Entity Recognition in Tencent TexSmart: System Overview and Key Techniques

This article presents an in‑depth overview of Tencent's TexSmart natural‑language understanding system, highlighting its fine‑grained NER capabilities, knowledge‑base combination methods, remote‑supervision via similar entities, multi‑source zero‑shot fusion, experimental results, and practical insights from a recent NLP summit.

Entity TypingFine-grained NERTexSmart
0 likes · 12 min read
Fine‑Grained Entity Recognition in Tencent TexSmart: System Overview and Key Techniques
IT Architects Alliance
IT Architects Alliance
Feb 6, 2023 · Artificial Intelligence

Understanding the Transformer Model: A Deep Dive into “Attention Is All You Need”

This article provides a comprehensive, plain‑language walkthrough of the 2017 “Attention Is All You Need” paper, explaining the Transformer’s architecture, core mechanisms such as embedding, positional encoding and self‑attention, and discussing its broader impact on AI research and applications.

AITransformerattention mechanism
0 likes · 17 min read
Understanding the Transformer Model: A Deep Dive into “Attention Is All You Need”
DataFunTalk
DataFunTalk
Dec 17, 2022 · Artificial Intelligence

Multimodal Pre‑training Techniques and Applications – Overview, OPPOVL Dataset, Architecture, and Performance

This article presents a comprehensive overview of multimodal pre‑training, describing its motivation, architecture choices, large‑scale Chinese image‑text dataset construction, training optimizations, performance benchmarks, downstream applications, and a Q&A session that highlights practical deployment considerations.

Model architectureMultimodalcomputer vision
0 likes · 16 min read
Multimodal Pre‑training Techniques and Applications – Overview, OPPOVL Dataset, Architecture, and Performance
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 7, 2022 · Artificial Intelligence

Alibaba Cloud PAI’s Few-Shot NLP Breakthroughs at EMNLP 2022

At EMNLP 2022, Alibaba Cloud’s PAI platform showcased three pioneering few-shot NLP papers—KECP, SpanProto, and UPT—demonstrating advanced prompt-tuning techniques, knowledge-enhanced models, and a unified learning paradigm that push the boundaries of low-resource language understanding.

Alibaba CloudEMNLP 2022natural language processing
0 likes · 8 min read
Alibaba Cloud PAI’s Few-Shot NLP Breakthroughs at EMNLP 2022
Model Perspective
Model Perspective
Nov 17, 2022 · Artificial Intelligence

How Mathematics Sparked the Rise of Modern Linguistics and NLP

This article traces the historical convergence of mathematics and linguistics, from 19th‑century pioneers to post‑war computer‑driven research, highlighting how statistical, probabilistic, and formal methods laid the foundation for machine translation, morphological analysis, and contemporary natural language processing.

history of linguisticsmachine translationmathematical linguistics
0 likes · 7 min read
How Mathematics Sparked the Rise of Modern Linguistics and NLP
DataFunTalk
DataFunTalk
Nov 8, 2022 · Artificial Intelligence

Retrieval-Based Dialogue System Framework for Customer Service: Architecture, Retrieval, Ranking, and Practical Applications

This article presents a comprehensive retrieval‑based dialogue system designed to assist customer‑service agents by recommending candidate replies, detailing its five‑layer architecture, metric suite, text and vector retrieval modules, ranking strategies, and real‑world deployment results across multiple business scenarios.

AICustomer Servicedialogue system
0 likes · 34 min read
Retrieval-Based Dialogue System Framework for Customer Service: Architecture, Retrieval, Ranking, and Practical Applications
DataFunSummit
DataFunSummit
Oct 20, 2022 · Artificial Intelligence

End-to-End Speech Relation Extraction

This paper presents an end‑to‑end approach for extracting relational triples directly from speech signals, bypassing intermediate transcription, and demonstrates its effectiveness on synthesized speech versions of the CoNLL04 and TACRED datasets, highlighting challenges such as length constraints and cross‑modal alignment.

End-to-EndMultimodalnatural language processing
0 likes · 17 min read
End-to-End Speech Relation Extraction
DataFunTalk
DataFunTalk
Oct 16, 2022 · Artificial Intelligence

Query Understanding and Processing in E‑commerce Search Systems

This article explains the end‑to‑end pipeline of query understanding for e‑commerce search, covering preprocessing, segmentation, spell correction, normalization, and expansion, and discusses both academic research and industry implementations with examples and references.

Query ProcessingQuery Rewritingnatural language processing
0 likes · 13 min read
Query Understanding and Processing in E‑commerce Search Systems
DataFunTalk
DataFunTalk
Sep 27, 2022 · Artificial Intelligence

Contrastive Learning for Text Generation: Motivation, Methodology, Experiments, and Discussion (CoNT Framework)

This article reviews the integration of contrastive learning into text generation, explains why it helps mitigate exposure bias, introduces the CoNT framework with three key improvements, presents extensive experiments on translation, summarization, code comment and data‑to‑text tasks, and discusses practical deployment considerations.

AICoNTText Generation
0 likes · 21 min read
Contrastive Learning for Text Generation: Motivation, Methodology, Experiments, and Discussion (CoNT Framework)
Model Perspective
Model Perspective
Aug 27, 2022 · Artificial Intelligence

How AI Can Summarize Research Papers Instantly with paper-digest

This article introduces paper-digest, an AI‑driven assistant that parses full research papers to generate concise, clear summaries, helping students read more efficiently, researchers share findings faster, and science communicators produce effective overviews.

AIResearch Toolsacademic productivity
0 likes · 3 min read
How AI Can Summarize Research Papers Instantly with paper-digest
Baidu Geek Talk
Baidu Geek Talk
Aug 15, 2022 · Artificial Intelligence

GEEK TALK: Practical Applications of Augmented Analysis

The article explains Augmented Analysis—using AI‑driven natural‑language queries, intelligent assistants, and automated business insights—to enable non‑technical users to explore data, gain actionable recommendations, and boost business value, illustrated with real‑world use cases and practical guidance for embedding the technology into everyday workflows.

AI in AnalyticsAugmented Analysisbusiness intelligence
0 likes · 9 min read
GEEK TALK: Practical Applications of Augmented Analysis
JD Cloud Developers
JD Cloud Developers
Aug 15, 2022 · Artificial Intelligence

How FCA Doubles BERT’s Inference Speed with Less Than 1% Accuracy Loss

This article explains how the Fine‑ and Coarse‑Granularity Hybrid Self‑Attention (FCA) mechanism reduces BERT’s computational cost by over 50% while keeping accuracy loss under 1%, detailing the method, experimental results, and its significance for efficient large‑scale language models.

BERTFCASelf-Attention
0 likes · 8 min read
How FCA Doubles BERT’s Inference Speed with Less Than 1% Accuracy Loss
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Jul 26, 2022 · Artificial Intelligence

Unlock AI-Driven Optimization: Join the NL4Opt Challenge at NeurIPS 2022

The NL4Opt competition at NeurIPS 2022, co‑hosted by Huawei Cloud and two Canadian universities, introduces the first natural‑language‑based mathematical‑modeling dataset, offering named‑entity‑recognition and model‑generation tracks, a detailed schedule, and prize incentives to advance AI‑powered optimization research.

AI competitionNL4OptNeurIPS
0 likes · 5 min read
Unlock AI-Driven Optimization: Join the NL4Opt Challenge at NeurIPS 2022
Meituan Technology Team
Meituan Technology Team
Jul 21, 2022 · Artificial Intelligence

Overview of Meituan Technical Team Papers Featured at ACM SIGIR 2022 and Related Works

The article highlights ten representative Meituan technical papers accepted at ACM SIGIR 2022, spanning personalized opinion tagging, cross‑domain sentiment classification, dialogue summarization transfer, universal retrieval, CTR prediction, image behavior modeling, and topic segmentation, each summarized with abstracts and download links for researchers.

Machine LearningRecommendation Systemscross-domain learning
0 likes · 25 min read
Overview of Meituan Technical Team Papers Featured at ACM SIGIR 2022 and Related Works
DataFunSummit
DataFunSummit
Jun 21, 2022 · Artificial Intelligence

JiuGe: An Automatic Chinese Classical Poetry Generation System – Algorithms and Research Overview

This article presents the JiuGe system developed by THUNLP for automatically generating Chinese classical poetry, detailing its research motivations, model architecture—including salient‑clue, working‑memory, topic‑memory, style‑transfer and reinforcement‑learning components—implementation, applications, and future directions.

Artificial IntelligencePoetry Generationdeep learning
0 likes · 18 min read
JiuGe: An Automatic Chinese Classical Poetry Generation System – Algorithms and Research Overview
360 Quality & Efficiency
360 Quality & Efficiency
Jun 10, 2022 · Artificial Intelligence

Overview of Modern Text Summarization Techniques

This article reviews contemporary text summarization methods, covering extractive approaches such as TextRank and clustering, abstractive models like Seq2Seq with attention, pointer‑generator networks, and recent pre‑trained transformers including BART, CPT and PEGASUS, highlighting their strengths, limitations, and combined strategies.

abstractive modelsextractive methodsnatural language processing
0 likes · 13 min read
Overview of Modern Text Summarization Techniques
Meituan Technology Team
Meituan Technology Team
May 26, 2022 · Artificial Intelligence

Span-Level Dialogue Summarization via Distant Supervision and Machine Reading Comprehension (DSMRC‑S)

The paper reviews classic summarization models, then proposes DSMRC‑S, a span-level extractive dialogue summarization method using distant supervision and a machine‑reading‑comprehension framework, with token‑level labeling and density‑based span selection, achieving state‑of‑the‑art BLEU and ROUGE improvements on a large Meituan dialogue dataset.

BERTDialogue Summarizationmachine reading comprehension
0 likes · 33 min read
Span-Level Dialogue Summarization via Distant Supervision and Machine Reading Comprehension (DSMRC‑S)
DataFunTalk
DataFunTalk
Mar 16, 2022 · Artificial Intelligence

A Survey of Entity Linking: Definitions, Methods, and Applications

This article provides a comprehensive overview of entity linking, detailing its definition, the two-stage pipeline of entity recognition and disambiguation, common methodologies such as candidate generation and ranking, advanced approaches, challenges like unlinkable mentions, and various applications in knowledge graphs, text mining, and question answering.

entity linkinginformation retrievalnatural language processing
0 likes · 15 min read
A Survey of Entity Linking: Definitions, Methods, and Applications
Baobao Algorithm Notes
Baobao Algorithm Notes
Mar 3, 2022 · Artificial Intelligence

How Hierarchical Curriculum Learning Improves Dialogue Response Selection

This article explains how treating negative response candidates with varying difficulty through a hierarchical curriculum learning framework—combining corpus‑level and instance‑level curricula—enhances dialogue response selection models, backed by experiments on Douban, Ubuntu, and E‑Commerce datasets.

Curriculum Learningdialogue response selectionhierarchical learning
0 likes · 8 min read
How Hierarchical Curriculum Learning Improves Dialogue Response Selection
Youku Technology
Youku Technology
Feb 28, 2022 · Artificial Intelligence

Seq2Path: Generating Sentiment Tuples as Paths of a Tree

Seq2Path treats each sentiment tuple as an independent tree path, training with average path loss and decoding via constrained beam search with a discriminative token, achieving state‑of‑the‑art results on five aspect‑based sentiment analysis datasets and deployment in Alibaba Entertainment AI Brain.

Beam SearchSentiment AnalysisSeq2Seq
0 likes · 3 min read
Seq2Path: Generating Sentiment Tuples as Paths of a Tree
DataFunSummit
DataFunSummit
Jan 25, 2022 · Artificial Intelligence

Intelligent Lyric Generation for Music: Techniques, Models, and Future Directions

This article explores how AI and natural language processing technologies are applied to music lyric creation, covering background challenges, rhyme retrieval methods, advanced language models such as SongNet, decoding strategies, style transfer, and a multi‑level generation platform that aims to streamline professional songwriting.

AI lyric generationLanguage ModelsSongNet
0 likes · 14 min read
Intelligent Lyric Generation for Music: Techniques, Models, and Future Directions
DataFunTalk
DataFunTalk
Jan 9, 2022 · Artificial Intelligence

Information Extraction for Unstructured Text: From Closed to Open

This presentation reviews the concepts, tasks, and challenges of information extraction from unstructured text, covering closed and open settings, relation extraction, joint extraction, and open extraction methods, and discusses recent advances such as segment‑attention, global‑rationale models, ETL, TPLinker, and maximal‑clique based approaches with experimental results.

information extractionjoint extractionknowledge graph
0 likes · 18 min read
Information Extraction for Unstructured Text: From Closed to Open
DataFunTalk
DataFunTalk
Jan 5, 2022 · Artificial Intelligence

Graph-Based Methods for Hot Event Discovery, Long Text Matching, and Ontology Construction in Natural Language Processing

This talk presents a series of graph‑based techniques for natural language processing, including the Story Forest system for hot event discovery, the GIANT framework for ontology creation and user interest modeling, and a divide‑and‑conquer approach to long‑text matching that leverages graph neural networks and community detection.

event detectiongraph mininggraph neural networks
0 likes · 19 min read
Graph-Based Methods for Hot Event Discovery, Long Text Matching, and Ontology Construction in Natural Language Processing
Code DAO
Code DAO
Dec 12, 2021 · Artificial Intelligence

Lightning Flash 0.3 Introduces New Tasks, Visualization Tools, Data Pipelines, and Registry API

Lightning Flash 0.3 expands the PyTorch Lightning ecosystem with eight new computer‑vision and NLP tasks, modular API design, integrated model hubs, visualisation callbacks, customizable data‑source hooks, and a central registry for model backbones, all illustrated with concrete code examples.

Lightning FlashPyTorch Lightningcomputer vision
0 likes · 7 min read
Lightning Flash 0.3 Introduces New Tasks, Visualization Tools, Data Pipelines, and Registry API
Youku Technology
Youku Technology
Dec 2, 2021 · Artificial Intelligence

Hybrid Curriculum Learning for Emotion Recognition in Conversation

The paper introduces a hybrid curriculum learning framework that tackles emotion shift and confusing labels in emotion recognition in conversation by applying nested curriculum stages at both conversation and utterance levels, enabling a progressive easy‑to‑hard training that markedly boosts classic ERC model performance across four public datasets and is already deployed in Alibaba’s entertainment AI brain script health‑check service.

Curriculum LearningEmotion Recognitionconversation analysis
0 likes · 2 min read
Hybrid Curriculum Learning for Emotion Recognition in Conversation
DataFunSummit
DataFunSummit
Nov 27, 2021 · Artificial Intelligence

Knowledge Graph Construction, Applications, and Recent Advances in Entity Linking

This article reviews the fundamentals of knowledge graphs, their practical uses in question answering, search and recommendation, and surveys recent research on entity linking—including dual‑encoder retrieval, BERT‑based models, multilingual approaches, and zero‑shot methods—while also outlining modern knowledge‑graph construction pipelines and open challenges.

Machine Learningentity linkinginformation extraction
0 likes · 21 min read
Knowledge Graph Construction, Applications, and Recent Advances in Entity Linking
JD Retail Technology
JD Retail Technology
Nov 16, 2021 · Artificial Intelligence

Intelligent Online Selling Point Extraction for E‑Commerce Recommendation (IOSPE) Wins AAAI 2022 Innovation Award

The IOSPE system, which uses BERT‑based scoring, transformer‑pointer generation, and personalized distribution to automatically extract and generate selling points for millions of e‑commerce products, earned the AAAI 2022 Artificial Intelligence Innovation Application Award and has boosted click‑through rates and user dwell time across JD.com platforms.

AIBERTInnovation Award
0 likes · 6 min read
Intelligent Online Selling Point Extraction for E‑Commerce Recommendation (IOSPE) Wins AAAI 2022 Innovation Award
58 Tech
58 Tech
Nov 16, 2021 · Artificial Intelligence

Deep Optimization of the 58 Yellow Pages Smart Chat Assistant for Enhanced User Experience and Business Opportunity Conversion

This article details the development and continuous optimization of 58.com’s Yellow Pages smart chat assistant, covering background, metrics, model improvements for QABot and TaskBot, slot extraction, quality assessment, and future directions, resulting in near‑human conversion rates and significant operational savings.

AIBusiness OpportunityChatbot
0 likes · 22 min read
Deep Optimization of the 58 Yellow Pages Smart Chat Assistant for Enhanced User Experience and Business Opportunity Conversion
DataFunTalk
DataFunTalk
Nov 14, 2021 · Artificial Intelligence

Knowledge Graph Construction and Entity Linking: Techniques, Applications, and Recent Advances

This article provides a comprehensive overview of knowledge graphs and entity linking, covering their definitions, practical uses in question answering, search and recommendation, the standard pipeline of mention detection, candidate generation and scoring, challenges such as scalability and multilinguality, and recent research advances including dual‑encoder, RELIC, deep retrieval, and multilingual BERT‑based models, followed by a discussion of modern knowledge‑graph construction methods.

AIentity linkingknowledge graph
0 likes · 21 min read
Knowledge Graph Construction and Entity Linking: Techniques, Applications, and Recent Advances
Dada Group Technology
Dada Group Technology
Oct 29, 2021 · Artificial Intelligence

Query Understanding in JD Daojia E‑commerce Search: Architecture, Core Algorithms, and Experimental Results

This article presents a comprehensive overview of JD Daojia's query understanding system for e‑commerce search, detailing its overall architecture, core modules such as tokenization, term weighting, query rewriting, intent detection, the algorithms employed, experimental evaluations, and future directions.

natural language processingquery understandingsearch engine
0 likes · 27 min read
Query Understanding in JD Daojia E‑commerce Search: Architecture, Core Algorithms, and Experimental Results
Meituan Technology Team
Meituan Technology Team
Sep 30, 2021 · Artificial Intelligence

Meituan's Intelligent Customer Service Technology and Practice

Meituan’s intelligent customer service platform, serving over 630 million users and 7.7 million merchants, integrates six core AI capabilities—including problem recommendation, understanding, dialogue management, answer supply, response recommendation, and session summarization—across pre‑sale, in‑sale, after‑sale and internal scenarios, leveraging multi‑turn dialogue, intent recognition, knowledge‑graph Q&A, and the Moses platform, while targeting future end‑to‑end and emotionally intelligent interactions.

BERTDialogue SystemsIntelligent Customer Service
0 likes · 23 min read
Meituan's Intelligent Customer Service Technology and Practice
Volcano Engine Developer Services
Volcano Engine Developer Services
Sep 25, 2021 · Artificial Intelligence

Cutting‑Edge AI from ByteDance & OPPO: Audio, NLP, and Translation

The ByteDance Engine Developer Community Meetup featured senior engineers from ByteDance and OPPO who presented the latest advances in intelligent audio signal processing, natural language processing for recommendation, entity linking in knowledge graphs, and multimedia machine translation, highlighting practical applications and performance challenges.

Artificial IntelligenceRecommendation Systemsknowledge graph
0 likes · 4 min read
Cutting‑Edge AI from ByteDance & OPPO: Audio, NLP, and Translation
DataFunTalk
DataFunTalk
Sep 24, 2021 · Artificial Intelligence

Intelligent Question Answering in QQ Browser Search Engine: KBQA, DeepQA, and IRQA

This article presents the architecture, techniques, and practical solutions behind intelligent question answering in QQ Browser's search engine, covering knowledge‑graph based QA (KBQA), machine‑reading‑comprehension QA (DeepQA), and information‑retrieval QA (IRQA), and discusses system design, model optimization, and future directions.

AIinformation retrievalknowledge graph
0 likes · 23 min read
Intelligent Question Answering in QQ Browser Search Engine: KBQA, DeepQA, and IRQA
DataFunTalk
DataFunTalk
Sep 22, 2021 · Artificial Intelligence

Baidu Unveils PLATO-XL: A 110‑Billion‑Parameter Bilingual Dialogue Generation Model

Baidu's newly released PLATO‑XL, a 110‑billion‑parameter bilingual pre‑training dialogue model, surpasses previous large‑scale models, introduces multi‑role awareness for consistent multi‑turn conversations, and demonstrates state‑of‑the‑art performance across open‑domain, knowledge‑grounded, and task‑oriented dialogue tasks.

Large Language ModelPLATO-XLbilingual AI
0 likes · 9 min read
Baidu Unveils PLATO-XL: A 110‑Billion‑Parameter Bilingual Dialogue Generation Model
HaoDF Tech Team
HaoDF Tech Team
Sep 15, 2021 · Artificial Intelligence

Optimizing Question‑Answer Search Similarity in Haodf Online: A Semantic Similarity Model Case Study

This article describes how Haodf Online improved its medical question‑answer search by analyzing search challenges, adopting semantic similarity models based on pre‑trained language embeddings, designing contrastive training tasks, and evaluating the resulting increase in click‑through rate and user engagement.

medical AImodel optimizationnatural language processing
0 likes · 12 min read
Optimizing Question‑Answer Search Similarity in Haodf Online: A Semantic Similarity Model Case Study
DataFunTalk
DataFunTalk
Aug 31, 2021 · Artificial Intelligence

Applying Knowledge Graphs for Clinical VTE Risk Assessment: A Case Study from HuiMei Technology

This article describes how HuiMei Technology leverages a medical knowledge graph, natural‑language processing, and AI‑driven scoring to automate venous thromboembolism (VTE) risk assessment in large hospitals, detailing the business background, technical architecture, implementation workflow, and ongoing research directions.

SNOMED-CTVTEclinical risk assessment
0 likes · 16 min read
Applying Knowledge Graphs for Clinical VTE Risk Assessment: A Case Study from HuiMei Technology
DeWu Technology
DeWu Technology
Jul 18, 2021 · Artificial Intelligence

Deep Learning Techniques for Sentiment Analysis

The article explains how deep‑learning models, particularly convolutional neural networks with token‑level padding, kernel size three, and max‑pooling, can automatically classify e‑commerce product reviews into eight sentiment categories, offering scalable insight for decision‑making and paving the way for recommendation, QA, and risk‑assessment applications.

Sentiment Analysisconvolutional neural networkdeep learning
0 likes · 9 min read
Deep Learning Techniques for Sentiment Analysis
Meituan Technology Team
Meituan Technology Team
Jun 24, 2021 · Artificial Intelligence

Construction and Application of Meituan's Common‑Sense Concept Knowledge Graph

The paper describes Meituan’s common‑sense concept knowledge graph, detailing a multi‑stage construction pipeline—concept, hierarchy, attribute, bridging, and POI/SPU linking—using BERT, XGBoost, and graph neural networks, and demonstrates its deployment in category‑word enrichment, search suggestions, and medical‑beauty tagging, achieving over two million concepts, three million relations, and roughly 90 % accuracy.

Machine Learningconcept miningentity linking
0 likes · 26 min read
Construction and Application of Meituan's Common‑Sense Concept Knowledge Graph
DataFunTalk
DataFunTalk
May 8, 2021 · Artificial Intelligence

Attribute‑Level Sentiment Analysis for E‑commerce: Tasks, Challenges, and System Design

This article presents a comprehensive overview of sentiment analysis in user‑generated content, detailing document‑, sentence‑, and aspect‑level tasks, defining the Aspect Sentiment Triplet Extraction problem for e‑commerce reviews, describing a three‑stage pipeline with pre‑training, multi‑domain modeling and attribute normalization, and reporting significant business improvements such as 400% CTR lift, while also discussing data imbalance, annotation scarcity, and future research directions.

Machine LearningSentiment Analysisaspect based sentiment
0 likes · 15 min read
Attribute‑Level Sentiment Analysis for E‑commerce: Tasks, Challenges, and System Design