Tagged articles
1067 articles
Page 9 of 11
Ops Development & AI Practice
Ops Development & AI Practice
Aug 16, 2024 · Industry Insights

How LLMs Are Evolving from Language Mimicry to Real-World Simulation

Recent breakthroughs in AI, from large language models gaining real-world simulation abilities to rapid AI-chip advancements and the surge of open-source models, are reshaping industries, highlighting both unprecedented opportunities and the need for ethical, secure deployment across sectors.

AI chipsArtificial IntelligenceOpen Source
0 likes · 7 min read
How LLMs Are Evolving from Language Mimicry to Real-World Simulation
DaTaobao Tech
DaTaobao Tech
Aug 16, 2024 · Artificial Intelligence

Effective Prompt Design for Large Language Models

Effective prompt design for large language models requires clear goals, relevant context, explicit input/output formats, evaluation criteria, and illustrative examples, combined with specific language, step‑by‑step instructions, edge‑case handling, ethical considerations, and proper tokenization, encoding, decoding, and post‑processing to produce accurate, concise, low‑hallucination responses.

AIlarge language modelsnatural language processing
0 likes · 33 min read
Effective Prompt Design for Large Language Models
DataFunSummit
DataFunSummit
Aug 16, 2024 · Artificial Intelligence

Educational Large Language Model Research and Product Applications for Youth Programming

The presentation outlines the challenges of sparse data and delayed learning effects in youth programming education, introduces three technical breakthroughs—dual‑data model training, hierarchical knowledge‑graph prompting, and reinforcement‑based cognitive recommendation—and showcases product implementations such as the Frog Programming Platform, AI learning machine, and digital‑human recorded courses.

AIeducationlarge language models
0 likes · 19 min read
Educational Large Language Model Research and Product Applications for Youth Programming
AntTech
AntTech
Aug 13, 2024 · Artificial Intelligence

Ant Group Contributions to ACL 2024: Summaries of 14 Accepted Papers Across NLP and AI

From August 11‑16, 2024 the ACL conference in Bangkok featured 14 Ant Group papers covering large‑scale information extraction, decomposed LLMs for semantic search, multimodal hallucination detection, long‑context attention mechanisms, concept‑reasoning datasets, knowledge‑graph alignment, and more, highlighting the group's breadth in natural language processing and AI research.

ACL2024MultimodalNLP
0 likes · 20 min read
Ant Group Contributions to ACL 2024: Summaries of 14 Accepted Papers Across NLP and AI
DaTaobao Tech
DaTaobao Tech
Aug 12, 2024 · Artificial Intelligence

Challenges and Optimization Techniques for Retrieval‑Augmented Generation (RAG)

Deploying large language models faces domain gaps, hallucinations, and high barriers, so Retrieval‑Augmented Generation (RAG) combines retrieval with generation, and advanced optimizations—such as RAPTOR’s hierarchical clustering, Self‑RAG’s self‑reflective retrieval, CRAG’s corrective evaluator, proposition‑level Dense X Retrieval, sophisticated chunking, query rewriting, and hybrid sparse‑dense methods—are essential for improving accuracy, reducing hallucinations, and achieving efficient, scalable performance.

AIOptimizationRAG
0 likes · 22 min read
Challenges and Optimization Techniques for Retrieval‑Augmented Generation (RAG)
DataFunSummit
DataFunSummit
Aug 8, 2024 · Artificial Intelligence

Exploring Training and Alignment Techniques for Financial Large Models

The announcement details a DataFun Summit 2024 session where Du Xiaoman AI researcher Huo Liangyu will present on the challenges, development, and alignment methods of the Xuan Yuan financial large language model, highlighting RLHF techniques, data collection, and real‑world deployment insights for the finance sector.

AIFinanceFinancial AI
0 likes · 6 min read
Exploring Training and Alignment Techniques for Financial Large Models
Data Thinking Notes
Data Thinking Notes
Aug 6, 2024 · Artificial Intelligence

How Large Language Models Are Revolutionizing R&D Operations and Telecom Networks

Large language models are increasingly applied in research and development operations, boosting efficiency and automating processes such as coding assistance, testing, requirement analysis, documentation, knowledge management, and network traffic analysis, while also enhancing security and enabling intelligent transformation across industries, especially telecom.

AI in telecomR&D automationlarge language models
0 likes · 3 min read
How Large Language Models Are Revolutionizing R&D Operations and Telecom Networks
DeWu Technology
DeWu Technology
Aug 5, 2024 · Frontend Development

Large Model Innovations Redefining Frontend Development – Key Takeaways

The July 14 DeWu tech salon showcased how large language models are reshaping frontend development, featuring insights from NetEase, Alibaba, and DeWu experts on AI‑driven low‑code platforms, intelligent coding assistants, and practical implementation strategies, with over 20,000 online viewers.

AIFrontendLow‑code
0 likes · 8 min read
Large Model Innovations Redefining Frontend Development – Key Takeaways
Software Development Quality
Software Development Quality
Aug 5, 2024 · Artificial Intelligence

How Large Language Models Can Transform Software Testing

This article explores how large language models can automate test case generation, predict defects, analyze results, optimize strategies, execute intelligent testing, and assist compatibility checks, while providing practical tools, real-world case studies, and a step‑by‑step GPT‑4 testing workflow.

AI testingdefect predictionlarge language models
0 likes · 15 min read
How Large Language Models Can Transform Software Testing
DataFunSummit
DataFunSummit
Aug 4, 2024 · Artificial Intelligence

Graph Technology Overview and Applications – From GraphGPT to Graph Databases

This article presents a comprehensive overview of recent advances in graph technology, covering GraphGPT for large language models, knowledge transfer on complex graphs, financial fraud detection, telecom network optimization, graph foundation models, Baidu's multi‑domain recommendation, high‑availability graph databases, and Kuaishou's efficient recommendation architecture.

Recommendation Systemsfinancial fraud detectiongraph databases
0 likes · 4 min read
Graph Technology Overview and Applications – From GraphGPT to Graph Databases
NewBeeNLP
NewBeeNLP
Aug 3, 2024 · Artificial Intelligence

Extending LLM Context to 1M Tokens: SAMBA, CoPE, RoPE, Retrieval Heads & Infini‑Attention

This article reviews recent research on extending large language model context windows to millions of tokens, covering SAMBA's hybrid architecture, Contextual Position Encoding (CoPE), RoPE base length theory, Retrieval Head analysis, and the memory‑efficient Infini‑Attention mechanism.

Efficient AttentionLLM researchlarge language models
0 likes · 10 min read
Extending LLM Context to 1M Tokens: SAMBA, CoPE, RoPE, Retrieval Heads & Infini‑Attention
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Aug 1, 2024 · Artificial Intelligence

Xiaohongshu Search Advertising Recall: Practices, Metrics, and Large‑Model Integration

Xiaohongshu’s search advertising recall system evolves from keyword bidding to BERT‑based vector retrieval and LLM‑enhanced query rewriting, using dual semantic and efficiency models, water‑level metrics, and GPU‑accelerated engineering to achieve 80 % click coverage, 60 % conversion coverage and a 5 % CPM lift.

Artificial Intelligenceefficiency optimizationlarge language models
0 likes · 33 min read
Xiaohongshu Search Advertising Recall: Practices, Metrics, and Large‑Model Integration
Sohu Tech Products
Sohu Tech Products
Jul 31, 2024 · Artificial Intelligence

MMEvalPro: A Trustworthy Benchmark for Evaluating Multimodal Large Models

MMEvalPro, a new benchmark created by researchers from Peking University, Chinese Academy of Medical Sciences, CUHK and Alibaba, augments existing multimodal datasets with perception and knowledge questions and introduces a Genuine Accuracy metric, revealing that top multimodal models still lag far behind humans and exposing shortcut‑driven performance on prior tests.

MMEvalProMultimodal Evaluationbenchmark
0 likes · 11 min read
MMEvalPro: A Trustworthy Benchmark for Evaluating Multimodal Large Models
NewBeeNLP
NewBeeNLP
Jul 31, 2024 · Artificial Intelligence

Training 7B–13B LLMs: Practical Tips, Hyperparameters, and Scaling Challenges

The article shares hands‑on experience training 7‑ and 13‑billion‑parameter language models, covering essential hyper‑parameters, hardware requirements, data quality considerations, open dataset resources, and the systemic difficulties that arise when scaling to trillion‑parameter models.

LLM traininghyperparameterslarge language models
0 likes · 8 min read
Training 7B–13B LLMs: Practical Tips, Hyperparameters, and Scaling Challenges
Tencent Cloud Developer
Tencent Cloud Developer
Jul 30, 2024 · Artificial Intelligence

A Systematic Guide to Prompt Engineering: From Zero to One

This guide walks readers from beginner to proficient Prompt Engineer by outlining the evolution of prompting, introducing a universal four‑component template, and detailing a five‑step workflow—including refinement, retrieval‑augmented generation, chain‑of‑thought reasoning, and advanced tuning techniques—plus evaluation metrics for LLM performance.

AI promptingLLM optimizationRAG
0 likes · 51 min read
A Systematic Guide to Prompt Engineering: From Zero to One
21CTO
21CTO
Jul 28, 2024 · Artificial Intelligence

How Anaconda Is Building an AI Operating System with High‑Performance Python

At PyCon US 2024, Anaconda’s Peter Wang outlined the company’s strategy to create an AI operating system by accelerating Python, launching the Anaconda Toolbox and AI Navigator, and addressing the challenges of integrating data, code, and large‑language models for enterprise AI workloads.

AI NavigatorAI Operating SystemAnaconda
0 likes · 6 min read
How Anaconda Is Building an AI Operating System with High‑Performance Python
DataFunSummit
DataFunSummit
Jul 28, 2024 · Artificial Intelligence

Leveraging Large Language Models for Graph Learning: Opportunities, Current Progress, and Future Directions

This article reviews why large language models can be applied to graph learning, outlines their capabilities and graph data characteristics, surveys current research across different graph types and LLM roles, and proposes future research directions for unified cross‑domain graph learning.

AIMultimodalResearch Directions
0 likes · 19 min read
Leveraging Large Language Models for Graph Learning: Opportunities, Current Progress, and Future Directions
DataFunSummit
DataFunSummit
Jul 25, 2024 · Artificial Intelligence

LOGIN: Large‑Model‑Assisted Graph Neural Networks for User Behavior Risk Control

This article presents the latest advances from the Chinese Academy of Sciences in graph machine learning for user behavior risk control, introducing the LOGIN framework that leverages large language models as consultants to iteratively enhance GNN training, and demonstrates its effectiveness through extensive experiments on homogeneous and heterogeneous graph benchmarks.

Machine Learninggraph neural networkslarge language models
0 likes · 14 min read
LOGIN: Large‑Model‑Assisted Graph Neural Networks for User Behavior Risk Control
21CTO
21CTO
Jul 24, 2024 · Artificial Intelligence

Meta’s Llama 3.1 405B: How the Open‑Source Giant Stands Up to GPT‑4 and Claude 3.5

Meta’s newly released Llama 3.1 series, highlighted by the 405B model trained on 150 trillion tokens, claims state‑of‑the‑art performance in coding, mathematics, and multilingual summarization while offering an open‑source alternative to GPT‑4o and Claude 3.5 Sonnet.

AI competitionLlama 3.1large language models
0 likes · 6 min read
Meta’s Llama 3.1 405B: How the Open‑Source Giant Stands Up to GPT‑4 and Claude 3.5
Kuaishou Tech
Kuaishou Tech
Jul 23, 2024 · Artificial Intelligence

Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models

This paper introduces Parrot, a system that enhances large language models' (LLMs) multi-turn instruction following capabilities through context-aware preference optimization (CaPO) and synthetic data generation, achieving significant performance improvements with limited training data.

CaPONLPdata synthesis
0 likes · 9 min read
Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models
DataFunTalk
DataFunTalk
Jul 21, 2024 · Artificial Intelligence

Integrating DataOps with Large Language Models for Text2SQL: Practices, Challenges, and Future Directions

This article presents a comprehensive overview of how DataOps principles combined with large language models such as GPT‑4 enable more agile and intelligent data engineering workflows, focusing on Text2SQL applications, schema‑linking techniques, practical product implementations, and future research challenges.

AIDataOpsSchema Linking
0 likes · 23 min read
Integrating DataOps with Large Language Models for Text2SQL: Practices, Challenges, and Future Directions
IT Services Circle
IT Services Circle
Jul 17, 2024 · Artificial Intelligence

Why Large Language Models Mistake 9.11 > 9.9: Prompting, Tokenizer Effects, and Recent Findings

The article examines why leading large language models such as GPT‑4o, Gemini Advanced, and Claude 3.5 incorrectly claim that 9.11 is larger than 9.9, analyzes tokenization and prompting strategies that cause the error, and discusses recent research and OpenAI model updates.

AI reasoningNumerical Comparisonlarge language models
0 likes · 7 min read
Why Large Language Models Mistake 9.11 > 9.9: Prompting, Tokenizer Effects, and Recent Findings
Java Tech Enthusiast
Java Tech Enthusiast
Jul 12, 2024 · Artificial Intelligence

Why Alibaba’s Qwen‑2 Is Outperforming Global LLMs and What It Means for AI

After OpenAI halted API access in China, Alibaba’s Tongyi Qwen‑2 quickly rose to the top of global open‑source LLM leaderboards, surpassing Meta’s Llama‑3 and other contenders, with detailed benchmark scores, performance gains over previous versions, and implications for China’s AI ecosystem.

AI BenchmarkAlibabaChina AI
0 likes · 5 min read
Why Alibaba’s Qwen‑2 Is Outperforming Global LLMs and What It Means for AI
Kuaishou Large Model
Kuaishou Large Model
Jul 11, 2024 · Artificial Intelligence

Pipeline-Aware Offloading & Balanced Checkpointing Accelerate LLM Training

Researchers from Kwai’s large-model team present a novel training system that combines pipeline-parallel-aware activation offloading with a compute-memory balanced checkpointing strategy, enabling lossless acceleration of large language models, achieving up to 42.7% MFU on 256 NVIDIA H800 GPUs while reducing memory usage.

GPU trainingKwaiPerformance Modeling
0 likes · 13 min read
Pipeline-Aware Offloading & Balanced Checkpointing Accelerate LLM Training
JD Tech
JD Tech
Jul 11, 2024 · Artificial Intelligence

Intelligent Parcel Identification in JD Express Logistics Using Large Language Models

This article examines the challenges of low parcel matching rates in JD Express logistics and proposes a large‑model‑based intelligent identification system, detailing its architecture, accuracy validation, cost‑saving cache strategy, and future prospects for improved efficiency and personalized services.

AI in e-commerceLogisticsOperational Efficiency
0 likes · 24 min read
Intelligent Parcel Identification in JD Express Logistics Using Large Language Models
NewBeeNLP
NewBeeNLP
Jul 10, 2024 · Artificial Intelligence

Can Large Language Models Master Co‑Temporal Reasoning? Introducing COTEMPQA

This article presents the COTEMPQA benchmark for evaluating large language models on co‑temporal reasoning, details its four scenario types, construction pipeline, experimental results across models, error analysis, and proposes the MR‑COT strategy that leverages mathematical reasoning to significantly improve performance.

LLM evaluationMR-COTbenchmark dataset
0 likes · 11 min read
Can Large Language Models Master Co‑Temporal Reasoning? Introducing COTEMPQA
DataFunSummit
DataFunSummit
Jul 9, 2024 · Artificial Intelligence

Applying Large Language Models to Recommendation Systems at Ant Group

This article details Ant Group's research on integrating large language models into recommendation pipelines, covering background challenges, knowledge extraction, teacher‑student distillation, experimental results, and practical Q&A for improving bias, efficiency, and cold‑start performance.

AI researchAnt Groupknowledge extraction
0 likes · 14 min read
Applying Large Language Models to Recommendation Systems at Ant Group
DataFunSummit
DataFunSummit
Jul 6, 2024 · Artificial Intelligence

Synergy Between Large Language Models and Knowledge Graphs: Recent Advances, Evaluation, and Future Integration

This article reviews the rapid progress of large language models and their complementary relationship with knowledge graphs, covering comparative strengths, knowledge extraction and completion, evaluation benchmarks, deployment benefits, complex reasoning support, and prospects for interactive fusion toward more reliable and explainable AI systems.

AI evaluationKnowledge Graphsknowledge extraction
0 likes · 12 min read
Synergy Between Large Language Models and Knowledge Graphs: Recent Advances, Evaluation, and Future Integration
21CTO
21CTO
Jul 5, 2024 · Artificial Intelligence

15 Real-World Ways Companies Leverage Large Language Models

This article explores fifteen detailed examples of how major companies across sectors—from streaming and e‑commerce to transportation and social platforms—are harnessing large language models to improve search, personalize communications, detect fraud, and enhance operational efficiency.

AI case studiesLLM applicationsMachine Learning
0 likes · 9 min read
15 Real-World Ways Companies Leverage Large Language Models
Bilibili Tech
Bilibili Tech
Jul 5, 2024 · Artificial Intelligence

Bilibili's AI Innovations at WAIC 2024: Empowering Creators and Transforming Content

At WAIC 2024, Bilibili unveiled a suite of AI tools—including a 1:1 digital‑avatar generator, dynamic comic technology, a customized voice library for virtual singer Luo Tianyi, the BiliStudio video‑audio model, and the Index‑1.9B large‑language models—empowering creators, cutting production costs, and serving its 80 million‑plus monthly users with advanced content‑creation and commercial‑marketing solutions.

AI content creationAI marketingBilibili
0 likes · 7 min read
Bilibili's AI Innovations at WAIC 2024: Empowering Creators and Transforming Content
JD Tech
JD Tech
Jul 5, 2024 · Artificial Intelligence

Generative Recommendation Systems for JD Alliance Advertising: Architecture, Implementation, and Experimental Evaluation

This article surveys how large language models reshape recommendation systems, presents a generative RS framework tailored for JD Alliance advertising, details material representation, model input, training and inference pipelines, and reports extensive offline and online experiments demonstrating its effectiveness on sparse user data.

Generative RecommendationLLMe-commerce advertising
0 likes · 27 min read
Generative Recommendation Systems for JD Alliance Advertising: Architecture, Implementation, and Experimental Evaluation
Meituan Technology Team
Meituan Technology Team
Jul 4, 2024 · Artificial Intelligence

Meituan Search Advertising: Evolution of Recall Strategies and Generative Approaches

Meituan’s search advertising has progressed from rule‑based keyword mining to hierarchical recall that partitions traffic and supply, and now to generative recall using large language models, chain‑of‑thought generation, diffusion‑enhanced multimodal vectors, and knowledge distillation, expanding the decision space while tackling compute and ROI challenges.

Generative ModelsMeituanMultimodal Retrieval
0 likes · 19 min read
Meituan Search Advertising: Evolution of Recall Strategies and Generative Approaches
360 Smart Cloud
360 Smart Cloud
Jul 4, 2024 · Artificial Intelligence

Optimizing Mixture-of-Experts (MoE) Training with the QLM Framework

This article introduces the background and challenges of large language model training, explains the Mixture-of-Experts (MoE) architecture, and details several optimization techniques implemented in the QLM framework—including fine-grained and shared experts, top‑k gating, token distribution, expert parallelism, and grouped GEMM – to improve training efficiency and performance.

AIMixture of ExpertsQLM
0 likes · 10 min read
Optimizing Mixture-of-Experts (MoE) Training with the QLM Framework
NewBeeNLP
NewBeeNLP
Jul 3, 2024 · Industry Insights

What Dominated the AI Landscape in Q2 2024? From Llama 3 to GPT‑4o and Global Price Wars

The second quarter of 2024 saw a whirlwind of AI developments—including Meta’s open‑source Llama 3, Microsoft’s fleeting WizardLM‑2, a wave of model price cuts, major IPOs, legislative restrictions, and the debut of OpenAI’s multimodal GPT‑4o—painting a vivid picture of rapid innovation, fierce competition, and shifting market dynamics across the global AI ecosystem.

AI modelsAI policyOpen Source
0 likes · 24 min read
What Dominated the AI Landscape in Q2 2024? From Llama 3 to GPT‑4o and Global Price Wars
JD Cloud Developers
JD Cloud Developers
Jul 2, 2024 · Operations

How Large Language Models Are Transforming Modern IT Operations

From manual server management to automated scripts, AIOps, and ChatOps, this article traces the evolution of IT operations and demonstrates how large language models boost efficiency, enable intelligent assistants, automated diagnostics, and smart log analysis, aiming for rapid fault detection, localization, and resolution.

ChatOpsOperationsaiops
0 likes · 7 min read
How Large Language Models Are Transforming Modern IT Operations
Continuous Delivery 2.0
Continuous Delivery 2.0
Jul 2, 2024 · Artificial Intelligence

Dynamic Integrated Developer Activity (DIDACT): Large Sequence Models for Software Development

The article introduces DIDACT, a large‑scale multitask machine‑learning framework that trains on the full software‑development workflow—including edits, builds, reviews, and tool interactions—to create AI assistants that can predict and suggest developer actions throughout the coding process.

AI for CodeMachine Learningdeveloper assistance
0 likes · 11 min read
Dynamic Integrated Developer Activity (DIDACT): Large Sequence Models for Software Development
DaTaobao Tech
DaTaobao Tech
Jul 1, 2024 · Artificial Intelligence

Recent Progress in Vision-Language Models (VLMs)

Over the past year, Vision‑Language Models have surged from early multimodal experiments to competitive open‑source systems rivaling GPT‑4, driven by higher‑resolution processing, richer vision encoders, better projection layers, and larger curated datasets, yet they still face evaluation difficulties, hallucinations, speed limits, and limited multimodal output.

Vision-Language Modelscomputer visiondeep learning
0 likes · 24 min read
Recent Progress in Vision-Language Models (VLMs)
JD Tech
JD Tech
Jun 28, 2024 · Artificial Intelligence

An Overview of Large Language Models: History, Fundamentals, Prompt Engineering, Retrieval‑Augmented Generation, Agents, and Multimodal AI

This article provides a comprehensive introduction to large language models, covering their historical development, core architecture, training process, prompt engineering techniques, Retrieval‑Augmented Generation, agent frameworks, multimodal capabilities, safety challenges, and future research directions.

AI agentsAI safetyMultimodal
0 likes · 22 min read
An Overview of Large Language Models: History, Fundamentals, Prompt Engineering, Retrieval‑Augmented Generation, Agents, and Multimodal AI
DataFunSummit
DataFunSummit
Jun 26, 2024 · Artificial Intelligence

2026 Roadmap for Recommendation Systems: Challenges, Research Directions, and OneRec Integration

This article outlines the current bottlenecks of conventional recommendation pipelines and proposes a comprehensive 2026 research agenda covering retention improvement, user growth, content ecosystem, multi‑objective Pareto optimization, long‑term value modeling, whole‑site optimization, interactive recommendation, personalized modeling, decision‑theoretic formulation, and the OneRec multi‑source fusion framework.

User Retentionlarge language modelsmulti-objective optimization
0 likes · 18 min read
2026 Roadmap for Recommendation Systems: Challenges, Research Directions, and OneRec Integration
JD Tech Talk
JD Tech Talk
Jun 25, 2024 · Artificial Intelligence

Understanding Large Language Models: From Parameters to Transformer Architecture

This article explains the fundamental concepts behind large language models, including their two-file structure, training process, neural network basics, perceptron examples, weight and threshold calculations, the TensorFlow Playground, and a detailed walkthrough of the Transformer architecture with tokenization, positional encoding, self‑attention, normalization, and feed‑forward layers.

AIMachine LearningSelf-Attention
0 likes · 20 min read
Understanding Large Language Models: From Parameters to Transformer Architecture
NewBeeNLP
NewBeeNLP
Jun 24, 2024 · Artificial Intelligence

How Domain Large Models Are Shaping the Future of AI: Challenges and Solutions

This article reviews Fudan University's Knowledge Factory Lab research on domain large models, covering background, three major deployment challenges, data‑selection strategies, ability‑enhancement techniques, collaborative workflows, and retrieval‑augmented generation methods that aim to make large models practical for real‑world tasks.

domain adaptationknowledge extractionlarge language models
0 likes · 18 min read
How Domain Large Models Are Shaping the Future of AI: Challenges and Solutions
DataFunSummit
DataFunSummit
Jun 23, 2024 · Artificial Intelligence

Tongyi Xingchen Personalized Large Model: Technical Overview and Applications

This article summarizes the development background of large language models, Alibaba's progression in foundational and personalized AI, the design and capabilities of the Tongyi Xingchen personalized model, its multimodal and agent-based architecture, various industry use cases, and the safety and responsibility measures applied to ensure trustworthy AI deployment.

AI safetylarge language modelsmodel agents
0 likes · 13 min read
Tongyi Xingchen Personalized Large Model: Technical Overview and Applications
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Jun 20, 2024 · Artificial Intelligence

Xiaohongshu 2024 Large Model Frontier Paper Sharing Live Event

On June 27, 2024, Xiaohongshu’s technical team will livestream a two‑hour session across WeChat Channels, Bilibili, Douyin and Xiaohongshu, showcasing six top‑conference papers on large‑model advances—including early‑stopping and fine‑grained self‑consistency, novel evaluation methods, negative‑sample‑assisted distillation, and LLM‑based note recommendation—followed by a Q&A and recruitment briefing.

AI researchKnowledge DistillationRecommendation Systems
0 likes · 12 min read
Xiaohongshu 2024 Large Model Frontier Paper Sharing Live Event
JD Tech Talk
JD Tech Talk
Jun 20, 2024 · Artificial Intelligence

Applying Large Language Models to Courier Operations: Intelligent Operations, Q&A, Prompting, and Agents

This article describes how large language models such as ChatGPT are integrated into courier terminal systems to automate tasks, enhance intelligent voice operations, enable retrieval‑augmented question answering, generate smart prompts, and explore agent‑based workflows, supported by code examples for data extraction, splitting, and embedding.

AI for logisticsIntelligent OperationsRetrieval-Augmented Generation
0 likes · 14 min read
Applying Large Language Models to Courier Operations: Intelligent Operations, Q&A, Prompting, and Agents
DataFunSummit
DataFunSummit
Jun 18, 2024 · Artificial Intelligence

Conditional and Multimodal Knowledge Graph Construction, Extraction, and Integration with Large Models

This article presents a comprehensive overview of conditional and multimodal knowledge graphs, covering their background, construction pipelines, extraction techniques, dataset creation, semi‑supervised learning strategies, and how they can be fused with large language models for enhanced reasoning and application in tasks such as intelligent QA and video scene graph generation.

AIconditional KGinformation extraction
0 likes · 23 min read
Conditional and Multimodal Knowledge Graph Construction, Extraction, and Integration with Large Models
DataFunTalk
DataFunTalk
Jun 15, 2024 · Artificial Intelligence

Research on Domain Large Models by Fudan University Knowledge Factory Lab

This article presents Fudan University's Knowledge Factory Lab research on domain large models, covering background, challenges, data selection, source‑enhanced tagging, capability improvements, self‑correction, collaborative workflows, and retrieval‑augmented generation for practical AI deployment.

AI researchdomain adaptationknowledge graph
0 likes · 16 min read
Research on Domain Large Models by Fudan University Knowledge Factory Lab
Baidu Tech Salon
Baidu Tech Salon
Jun 14, 2024 · Artificial Intelligence

Why Large Models Signal the Dawn of General AI: Insights from Baidu’s CTO

In a keynote at the 2024 Beijing Zhiyuan Conference, Baidu’s CTO Wang Haifeng explained how large‑model universality and comprehensive capabilities are driving artificial general intelligence forward, highlighting scale laws, multimodal advances, agent technologies, and the industrial‑scale production of AI.

AI industrializationAI trendsGeneral AI
0 likes · 7 min read
Why Large Models Signal the Dawn of General AI: Insights from Baidu’s CTO
DataFunTalk
DataFunTalk
Jun 14, 2024 · Artificial Intelligence

Shopee's E‑commerce Knowledge Graph Construction and Integration with Large Models

This article presents Shopee's comprehensive exploration of building an e‑commerce knowledge graph, detailing its challenges, construction pipeline, AI‑driven extraction and fusion techniques, multilingual and multimodal modeling, and practical applications ranging from search and recommendation to AI assistants and real‑time updates.

AI applicationsMultimodale-commerce
0 likes · 21 min read
Shopee's E‑commerce Knowledge Graph Construction and Integration with Large Models
AntTech
AntTech
Jun 13, 2024 · Artificial Intelligence

Exploring Multi‑Agent Applications in Financial Scenarios and the agentUniverse Framework

The article reviews the evolution from large language models to stateful agents, discusses the specific challenges of information‑dense, knowledge‑dense, and decision‑dense financial tasks, and introduces the open‑source agentUniverse multi‑agent framework with its PEER collaboration model and real‑world investment‑research applications.

AI Research AssistantFinancial AIPEER framework
0 likes · 18 min read
Exploring Multi‑Agent Applications in Financial Scenarios and the agentUniverse Framework
AntTech
AntTech
Jun 6, 2024 · Information Security

AIGC Era Trends in Next‑Generation Identity Recognition: DeepFake Risks, AIGC as a New Production Force, and Cross‑Terminal Interaction

The talk at the 18th Security Identification Technology Expo and Summit outlines three emerging trends for identity verification in the AIGC era: the surge of deep‑fake attacks, the use of generative AI as a new data‑production engine, and the shift toward cross‑device, agent‑based authentication paradigms.

AIGCBiometricsIdentity verification
0 likes · 10 min read
AIGC Era Trends in Next‑Generation Identity Recognition: DeepFake Risks, AIGC as a New Production Force, and Cross‑Terminal Interaction
Baobao Algorithm Notes
Baobao Algorithm Notes
Jun 3, 2024 · Artificial Intelligence

Can Adversarial Training Make Retrieval‑Augmented Generators More Robust?

Recent arXiv work introduces ATM, an adversarially‑tuned multi‑agent system that iteratively pits a fake‑knowledge attacker against a generator, dramatically improving retrieval‑augmented language models’ resistance to hallucinated content and boosting performance on knowledge‑intensive benchmarks, even with noisy or irrelevant documents.

Hallucination MitigationRAGadversarial training
0 likes · 12 min read
Can Adversarial Training Make Retrieval‑Augmented Generators More Robust?
58 Tech
58 Tech
Jun 3, 2024 · Artificial Intelligence

Parameter-Efficient Fine-Tuning (PEFT) Methods for Large Language Models: LoRA, QLoRA, AdaLoRA, SoRA, and Training Acceleration with Unsloth

This article systematically analyzes popular parameter‑efficient fine‑tuning (PEFT) techniques for large language models—including Adapter Tuning, Prefix Tuning, LoRA, QLoRA, AdaLoRA, and SoRA—detailing their principles, implementation code, experimental results on NLU tasks, and practical acceleration using the Unsloth library.

AdaLoRALoRAPEFT
0 likes · 39 min read
Parameter-Efficient Fine-Tuning (PEFT) Methods for Large Language Models: LoRA, QLoRA, AdaLoRA, SoRA, and Training Acceleration with Unsloth
DataFunSummit
DataFunSummit
Jun 1, 2024 · Artificial Intelligence

Graph Foundation Models: Concepts, Progress, and Future Directions

This article provides a comprehensive overview of Graph Foundation Models (GFMs), covering their definition, key characteristics, historical development of graph machine learning, recent research trends such as PT‑HGNN, Specformer, and GraphTranslator, and discusses future challenges and research directions.

Machine Learningfoundation modelsgraph neural networks
0 likes · 23 min read
Graph Foundation Models: Concepts, Progress, and Future Directions
DataFunTalk
DataFunTalk
May 31, 2024 · Artificial Intelligence

The Role of Knowledge Graphs in Industry: Importance, Product Forms, and Practical Cases

This article explains why knowledge graphs are crucial for industrial applications, describes the main product forms and architectural considerations, and shares real‑world case studies illustrating how AI, large models, and graph databases can be combined to improve knowledge management and decision‑making.

AIGraph DatabaseIndustrial Applications
0 likes · 20 min read
The Role of Knowledge Graphs in Industry: Importance, Product Forms, and Practical Cases
Kuaishou Tech
Kuaishou Tech
May 27, 2024 · Artificial Intelligence

What Kuaishou’s Four ACL Papers Reveal About the Future of Large Language Models

The 62nd ACL conference accepted four papers from Kuaishou that explore multi‑turn instruction following, self‑agreement reasoning, fine‑grained reinforcement learning, and dynamic routing in Mixture‑of‑Experts models, each with detailed methods, experimental results, author lists, and public arXiv links.

ACL 2024Kuaishou ResearchMixture of Experts
0 likes · 11 min read
What Kuaishou’s Four ACL Papers Reveal About the Future of Large Language Models
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
May 27, 2024 · Databases

Baidu’s Enterprise Vector Database: Architecture, Performance, and RAG Secrets

An exclusive interview with Baidu’s senior database architects reveals the motivations behind building a dedicated enterprise vector database, details its novel column‑store engine, C++‑based retrieval stack, performance gains over open‑source solutions, multi‑modal support, RAG integration, and future research directions.

AIRAGStorage Engine
0 likes · 28 min read
Baidu’s Enterprise Vector Database: Architecture, Performance, and RAG Secrets
Baidu Tech Salon
Baidu Tech Salon
May 20, 2024 · Artificial Intelligence

Boosting Ad Efficiency with Baidu’s Multi‑Agent AI Architecture

In the AI‑native era, Baidu's ad platform adopts a multi‑agent architecture that combines large and small LLMs, SOP‑driven workflows, long‑term memory, and vector databases to achieve high query accuracy, low latency, and significant business gains while tackling challenges such as hallucination, planning, execution, and personalization.

AI agentsIndustry InsightsLLM optimization
0 likes · 18 min read
Boosting Ad Efficiency with Baidu’s Multi‑Agent AI Architecture
NewBeeNLP
NewBeeNLP
May 16, 2024 · Artificial Intelligence

How Large Language Models Transform Advertising Copy Generation

This article examines the adoption of large language models for intelligent advertising copy creation, detailing business challenges, model selection criteria, training data preparation, fine‑tuning methods, performance evaluation, deployment results, while highlighting the trade‑offs between model size, cost, and output quality.

AI marketingadvertising copyfine-tuning
0 likes · 20 min read
How Large Language Models Transform Advertising Copy Generation
DeWu Technology
DeWu Technology
May 15, 2024 · Artificial Intelligence

Accelerating Large Language Model Inference: Techniques and Framework Recommendations

Deploying a dedicated inference cluster and applying four key optimizations—FlashAttention‑based attention computation, PageAttention KV‑cache management, Mixture‑of‑Experts parameter reduction, and tensor parallelism—can accelerate large language model inference by up to 50% for models as large as 70 B parameters while cutting deployment costs.

FlashAttentionInference AccelerationMixture of Experts
0 likes · 17 min read
Accelerating Large Language Model Inference: Techniques and Framework Recommendations
Baidu Geek Talk
Baidu Geek Talk
May 15, 2024 · Artificial Intelligence

Accelerating Large Model Training and Inference with Baidu Baige AIAK‑LLM: Challenges, Techniques, and Optimizations

The talk outlines how Baidu’s Baige AIAK‑LLM suite tackles the exploding compute demands of trillion‑parameter models by boosting Model FLOPS Utilization through advanced parallelism, memory‑saving recompute, zero‑offload, adaptive scheduling, and cross‑chip orchestration, delivering 30‑60% training and inference speedups and a unified cloud product.

AI infrastructureBaiduInference Optimization
0 likes · 25 min read
Accelerating Large Model Training and Inference with Baidu Baige AIAK‑LLM: Challenges, Techniques, and Optimizations
NewBeeNLP
NewBeeNLP
May 15, 2024 · Artificial Intelligence

How Large Language Models and Knowledge Graphs Can Boost Each Other

This talk reviews recent advances in large language models, compares them with knowledge graphs, explores how LLMs enhance knowledge extraction and completion, examines how knowledge graphs aid LLM evaluation and safe deployment, and outlines future interactive integration between the two technologies.

AI researchKnowledge Graphsknowledge extraction
0 likes · 13 min read
How Large Language Models and Knowledge Graphs Can Boost Each Other
21CTO
21CTO
May 11, 2024 · Artificial Intelligence

Will U.S. AI Export Controls Stall Global Large‑Model Development?

The United States is drafting a bipartisan bill to impose export controls on advanced proprietary AI models, aiming to shield its technology from China, Russia, North Korea and Iran, while confronting challenges of open‑source model regulation and potential geopolitical retaliation.

AI policyChinaExport controls
0 likes · 7 min read
Will U.S. AI Export Controls Stall Global Large‑Model Development?
Baidu Tech Salon
Baidu Tech Salon
May 10, 2024 · Artificial Intelligence

Baidu Comate: Core Capabilities of Intelligent Code Assistant

The article surveys Baidu Comate, an AI‑powered code assistant built on the Wenxin (ERNIE) large model, tracing software development from the 1950s crisis through the internet and open‑source era to today’s AI‑driven tools, and highlights its features and demonstration at a global development conference.

AI codingBaidu ComateIDE plugin
0 likes · 7 min read
Baidu Comate: Core Capabilities of Intelligent Code Assistant
Architects' Tech Alliance
Architects' Tech Alliance
May 9, 2024 · Artificial Intelligence

AI Servers: Market Opportunities, Architecture, and Future Demand Driven by Generative AI

The article examines how the surge of generative AI (AIGC) is fueling rapid growth in AI server demand, detailing the emerging AIGC ecosystem, server hardware composition, model scaling, heterogeneous computing, training vs. inference workloads, market size forecasts, and the competitive landscape of AI server manufacturers.

AI infrastructureAI serversGPU
0 likes · 15 min read
AI Servers: Market Opportunities, Architecture, and Future Demand Driven by Generative AI
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
May 2, 2024 · Artificial Intelligence

Understanding Large Language Models: Principles, Training, Risks, and Application Security

This article provides a comprehensive overview of large language models (LLMs), explaining their core concepts, transformer architecture, training stages, known shortcomings such as hallucination and reversal curse, and highlights emerging security threats like prompt injection and jailbreaking, offering guidance for safe deployment.

AI safetyLLMjailbreaking
0 likes · 21 min read
Understanding Large Language Models: Principles, Training, Risks, and Application Security
21CTO
21CTO
Apr 28, 2024 · Artificial Intelligence

5 Transformative Business Use Cases for Conversational AI

This article explores how conversational AI, powered by large language models, is reshaping enterprise operations across five key scenarios—from customer support assistants and AI‑driven data interfaces to HR bots, unstructured data processing, and multi‑agent digital assistants—highlighting benefits, implementation considerations, and privacy challenges.

Conversational AIbusiness applicationscustomer support
0 likes · 13 min read
5 Transformative Business Use Cases for Conversational AI
DataFunTalk
DataFunTalk
Apr 26, 2024 · Artificial Intelligence

Large Language Models in the Automotive Industry: Overview, Impact, and Practical Exploration

This article examines how large language models such as GPT and Transformer‑based architectures are reshaping the automotive sector by enhancing in‑vehicle intelligence, streamlining product development, improving customer service, and redefining data analyst roles, while also presenting practical experiments, deployment challenges, and future directions.

Automotive AIData AnalysisGPT
0 likes · 18 min read
Large Language Models in the Automotive Industry: Overview, Impact, and Practical Exploration
Sohu Tech Products
Sohu Tech Products
Apr 24, 2024 · Artificial Intelligence

Evolution, Architecture, Training Data, Methods, and Performance of Meta's Llama Series (Llama 1, 2, 3)

Meta's Llama series has progressed from the 7‑65B Llama‑1 in early 2023 to the 8B and 70B Llama‑3 in 2024, scaling token counts from 1 T to over 15 T, adopting decoder‑only Transformers with RMSNorm, SwiGLU, RoPE and GQA, and adding supervised fine‑tuning, RLHF and DPO, resulting in state‑of‑the‑art benchmark performance and a vibrant open‑source ecosystem.

AILLaMAPerformance Evaluation
0 likes · 25 min read
Evolution, Architecture, Training Data, Methods, and Performance of Meta's Llama Series (Llama 1, 2, 3)
21CTO
21CTO
Apr 23, 2024 · Artificial Intelligence

Deploy Large Language Models with vLLM and Quantization for Low Latency

This guide explains how to deploy open‑source large language models using vLLM, benchmark latency and throughput, and apply 8‑bit/4‑bit quantization techniques such as BitsandBytes and NF4 to achieve faster inference on limited‑GPU hardware.

LLM deploymentPythonlarge language models
0 likes · 13 min read
Deploy Large Language Models with vLLM and Quantization for Low Latency
MoonWebTeam
MoonWebTeam
Apr 23, 2024 · Artificial Intelligence

Exploring Devika AI: An Open‑Source AI Programmer’s Capabilities and Limits

Devika AI, an open‑source AI programmer from Stition AI, is examined for its architecture, supported actions, installation steps, and real‑world performance across tasks such as building a Snake game, Conway’s Game of Life, Vue3 components, and unit‑test generation, highlighting strengths, weaknesses, and future potential.

Devika AITool Evaluationlarge language models
0 likes · 21 min read
Exploring Devika AI: An Open‑Source AI Programmer’s Capabilities and Limits
NewBeeNLP
NewBeeNLP
Apr 22, 2024 · Artificial Intelligence

Why LLAMA‑3’s Scaling Laws Signal the Next AI Frontier

The article analyzes LLAMA‑3’s architectural tweaks, massive data expansion, scaling‑law implications, open‑source versus closed‑source dynamics, and the critical role of synthetic data in sustaining large‑model progress beyond 2025.

LLAMA-3large language modelsopen-source AI
0 likes · 10 min read
Why LLAMA‑3’s Scaling Laws Signal the Next AI Frontier
Xiaohe Frontend Team
Xiaohe Frontend Team
Apr 21, 2024 · Artificial Intelligence

What’s New in Generative AI? VASA‑1, Llama‑3, Stable Diffusion 3 & More

The article reviews the latest breakthroughs in generative AI, including Microsoft’s VASA‑1 video synthesis model, Meta’s open‑source Llama‑3 large language model, Stability AI’s Stable Diffusion 3 API, Adobe’s integration of third‑party AI video tools into Premiere Pro, and a free image‑style‑recreation platform from Freepik, highlighting their technical details and potential applications.

AI toolsdiffusion modelsgenerative AI
0 likes · 13 min read
What’s New in Generative AI? VASA‑1, Llama‑3, Stable Diffusion 3 & More
AntTech
AntTech
Apr 19, 2024 · Artificial Intelligence

OneKE: Open-Source Bilingual Knowledge Extraction Framework for Large Language Models

OneKE, an open‑source bilingual (Chinese‑English) knowledge extraction framework jointly developed by Ant Group and Zhejiang University, enables efficient extraction of entities, relations, and events to build domain knowledge graphs that enhance large language models’ reasoning, reduce hallucinations, and support applications in medical, financial, and governmental sectors.

Artificial IntelligenceKnowledge Graphsbilingual
0 likes · 5 min read
OneKE: Open-Source Bilingual Knowledge Extraction Framework for Large Language Models
DevOps
DevOps
Apr 17, 2024 · Artificial Intelligence

Engineering Capabilities for Enterprise Large Model Applications: Prompt Engineering, RAG, and Fine‑Tuning

The article explores how enterprises can build and improve large‑model applications by combining prompt engineering, retrieval‑augmented generation (RAG), and fine‑tuning, discusses their relationships, optimization dimensions, testing challenges, and provides practical guidance for SE4AI implementation.

AI EngineeringRAGenterprise AI
0 likes · 20 min read
Engineering Capabilities for Enterprise Large Model Applications: Prompt Engineering, RAG, and Fine‑Tuning
AntTech
AntTech
Apr 17, 2024 · Artificial Intelligence

LLMRG: Improving Recommendations through Large Language Model Reasoning Graphs

LLMRG introduces a novel framework that leverages large language models to construct personalized reasoning graphs, integrating chain reasoning, self‑verification, divergent extension, and knowledge‑base self‑improvement, thereby enhancing recommendation accuracy, interpretability, and performance across multiple benchmark datasets without additional user or item information.

AIInterpretabilityRecommendation Systems
0 likes · 9 min read
LLMRG: Improving Recommendations through Large Language Model Reasoning Graphs
360 Tech Engineering
360 Tech Engineering
Apr 15, 2024 · Artificial Intelligence

Fine‑Tuning Large Language Models: A Practical Guide Using Qwen‑14B on the 360AI Platform

This article explains the concept, motivations, and step‑by‑step workflow for fine‑tuning large language models—specifically Qwen‑14B—covering data preparation, training commands with DeepSpeed, hyper‑parameter settings, evaluation, and deployment via FastChat, all illustrated with code snippets and configuration details.

AIDeepSpeedFastChat
0 likes · 10 min read
Fine‑Tuning Large Language Models: A Practical Guide Using Qwen‑14B on the 360AI Platform
DataFunSummit
DataFunSummit
Apr 13, 2024 · Artificial Intelligence

Understanding and Mitigating Hallucinations in Large Language Model Industry Q&A with Knowledge Graphs

This article examines why large language models often produce hallucinations in industry question‑answering, defines the phenomenon, explores its data and training origins, proposes evaluation metrics, and presents practical strategies—including high‑quality fine‑tuning data, honest refusal mechanisms, advanced decoding methods, and external knowledge‑graph augmentation—to reduce hallucinations and improve reliability.

AI evaluationhallucinationknowledge graph
0 likes · 21 min read
Understanding and Mitigating Hallucinations in Large Language Model Industry Q&A with Knowledge Graphs
NewBeeNLP
NewBeeNLP
Apr 13, 2024 · Artificial Intelligence

How a Multimodal ‘Joke‑King’ Model Beats GPT‑4 at Humor Generation

A research team from Sun Yat‑sen University, Sea AI Lab and Harvard built a multimodal large model that learns to generate creative jokes and memes by training on the Oogiri‑GO dataset, introducing a Leap‑of‑Thought (LoT) paradigm and CLoT fine‑tuning, which outperforms GPT‑4 and other state‑of‑the‑art models in humor tasks.

CLoTLeap-of-ThoughtOogiri-GO dataset
0 likes · 9 min read
How a Multimodal ‘Joke‑King’ Model Beats GPT‑4 at Humor Generation
Data Thinking Notes
Data Thinking Notes
Apr 11, 2024 · Artificial Intelligence

How Financial Institutions Are Building Their Own Large Language Models

This article explores how the finance sector is creating specialized large language models—covering the shift from generic to domain‑specific models, training innovations, evaluation methods, and real‑world applications such as marketing, customer service, risk control, and operational analytics.

Applicationsfinance AIlarge language models
0 likes · 16 min read
How Financial Institutions Are Building Their Own Large Language Models
Cloud Native Technology Community
Cloud Native Technology Community
Apr 11, 2024 · Cloud Native

Why Kubernetes Is the Ideal Platform for Deploying Large Language Models

Deploying large language models demands massive compute, flexible scaling, and robust resource management, and this article explains how Kubernetes’s auto‑scaling, portability, cloud‑native features, observability tools, and multi‑tenant isolation make it the optimal platform for training, serving, and iterating LLM workloads.

Cloud NativeKubernetesResource Management
0 likes · 17 min read
Why Kubernetes Is the Ideal Platform for Deploying Large Language Models
DataFunSummit
DataFunSummit
Apr 9, 2024 · Artificial Intelligence

Knowledge Map for Large Model Application Development

This article outlines a comprehensive knowledge map for building large‑model applications, detailing a four‑layer technical architecture, development lifecycle, core elements such as prompt engineering and fine‑tuning, evaluation methods, and real‑world case studies across various AI use cases.

AI application developmentlarge language modelsmodel fine-tuning
0 likes · 12 min read
Knowledge Map for Large Model Application Development
NewBeeNLP
NewBeeNLP
Apr 8, 2024 · Artificial Intelligence

What Will Recommendation Systems Look Like in 2026? Emerging Trends and Challenges

This article analyzes the current bottlenecks of conventional recommendation systems and outlines ten forward‑looking research directions for 2026, including retention improvement, user growth, content ecosystem, multi‑objective Pareto optimization, long‑term value estimation, site‑wide optimization, interactive recommendation, personalized modeling, decision‑theoretic framing, and the integration of large language models via the OneRec framework.

User Retentioninteractive recommendationlarge language models
0 likes · 18 min read
What Will Recommendation Systems Look Like in 2026? Emerging Trends and Challenges
DataFunTalk
DataFunTalk
Apr 4, 2024 · Artificial Intelligence

Enhancing Interactive Agents with Large Language Models: The SwiftSage Framework

This article reviews the challenges of textual‑only large language model interaction, introduces benchmark environments such as AFL World and ScienceWorld, compares baseline reinforcement‑learning approaches, and presents SwiftSage—a hybrid system that combines a fast T5‑based small model with a powerful LLM for planning and grounding, demonstrating superior performance, efficiency, and cost‑effectiveness while outlining current limitations and future research directions.

AISwiftSageinteractive agents
0 likes · 22 min read
Enhancing Interactive Agents with Large Language Models: The SwiftSage Framework
DataFunTalk
DataFunTalk
Apr 3, 2024 · Artificial Intelligence

Future Directions of Recommendation Systems: Retention, User Growth, Content Ecosystem, Multi‑Objective Optimization, and Large‑Model Fusion

This presentation outlines the current bottlenecks of conventional recommendation pipelines and proposes a 2026 roadmap that includes retention improvement, user‑growth strategies, content‑ecosystem metrics, Pareto‑optimal multi‑objective optimization, long‑term value modeling, site‑wide spatial optimization, interactive recommendation, personalized modeling, and the integration of large‑model fusion through the OneRec framework.

Recommendation SystemsUser Retentioninteractive recommendation
0 likes · 18 min read
Future Directions of Recommendation Systems: Retention, User Growth, Content Ecosystem, Multi‑Objective Optimization, and Large‑Model Fusion
DataFunTalk
DataFunTalk
Apr 2, 2024 · Artificial Intelligence

User Portrait Algorithms: From Ontology‑Based Methods to Deep Learning and Future Directions

This article provides a comprehensive overview of user portrait algorithms, covering their historical development, ontology‑based traditional approaches, deep‑learning enhancements, representation‑learning techniques such as lookalike, active‑learning driven iteration, and the integration of large‑model world knowledge, while also discussing current challenges and future research directions.

Recommendation Systemsactive learningdeep learning
0 likes · 26 min read
User Portrait Algorithms: From Ontology‑Based Methods to Deep Learning and Future Directions
DataFunSummit
DataFunSummit
Mar 31, 2024 · Artificial Intelligence

Challenges and Techniques in Distributed Training of Large Language Models

This article reviews the rapid development of large language models since 2019, outlines the historical background, identifies key challenges such as massive compute demand, memory constraints, and system complexity, and then details distributed training technologies—including data parallelism, pipeline parallelism, and advanced optimization strategies—while also discussing future research directions and answering common questions.

AI infrastructureData ParallelismDeepSpeed
0 likes · 23 min read
Challenges and Techniques in Distributed Training of Large Language Models
DaTaobao Tech
DaTaobao Tech
Mar 29, 2024 · Artificial Intelligence

Text-to-SQL with Large Language Models: DIN-SQL Approach

The DIN‑SQL approach enhances Text‑to‑SQL performance by using large language models in a decomposed in‑context learning framework with schema linking, query classification, SQL generation, and self‑correction modules, achieving state‑of‑the‑art 85.3% execution accuracy on the Spider benchmark by breaking complex queries into manageable sub‑tasks.

AI researchData AnalysisDatabase Querying
0 likes · 34 min read
Text-to-SQL with Large Language Models: DIN-SQL Approach
Sohu Tech Products
Sohu Tech Products
Mar 27, 2024 · Artificial Intelligence

NVIDIA NeMo Framework, TensorRT‑LLM, and RAG for Large Language Model Solutions

NVIDIA’s comprehensive LLM ecosystem combines the full‑stack NeMo Framework for data curation, distributed training, fine‑tuning, inference acceleration with TensorRT‑LLM and Triton, plus Retrieval‑Augmented Generation and Guardrails, enabling efficient, low‑latency, knowledge‑grounded model deployment across clusters.

AI accelerationNVIDIANeMo Framework
0 likes · 16 min read
NVIDIA NeMo Framework, TensorRT‑LLM, and RAG for Large Language Model Solutions
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Mar 26, 2024 · Artificial Intelligence

MoE LLMs: How Alibaba Cloud & NVIDIA Megatron-Core Accelerate Training

This article reviews the evolution of Mixture-of-Experts (MoE) models, details Alibaba Cloud’s collaboration with NVIDIA’s Megatron-Core to build a high-performance MoE framework, and presents extensive training optimizations, benchmark results, conversion tools, and best-practice guidelines for large-scale LLM development and deployment.

Alibaba CloudMegatron-CoreMoE
0 likes · 18 min read
MoE LLMs: How Alibaba Cloud & NVIDIA Megatron-Core Accelerate Training
NewBeeNLP
NewBeeNLP
Mar 21, 2024 · Artificial Intelligence

Mastering Large Language Model Training: Key Challenges and Optimization Strategies

This article examines the resource and efficiency challenges of scaling large language model training, explains data, model, pipeline, and tensor parallelism, and provides practical I/O, communication, and stability optimization techniques—including high‑availability storage, RDMA networking, NCCL tuning, and fault‑tolerant recovery—to improve throughput and reliability.

AI EngineeringI/O optimizationcommunication optimization
0 likes · 15 min read
Mastering Large Language Model Training: Key Challenges and Optimization Strategies
TAL Education Technology
TAL Education Technology
Mar 20, 2024 · Artificial Intelligence

Understanding AI: From Brain Differences to Data Science Practices and Large Model Applications

This article explains why current AI cannot achieve self‑awareness, outlines data‑science steps for large models—including preprocessing, exploratory analysis, modeling, and evaluation—then surveys general and vertical applications of large language models and details a complete machine‑learning workflow with transformer fine‑tuning techniques.

AIApplicationsMachine Learning
0 likes · 14 min read
Understanding AI: From Brain Differences to Data Science Practices and Large Model Applications