Tagged articles

1067 articles

Page 9 of 11

Aug 16, 2024 · Industry Insights

How LLMs Are Evolving from Language Mimicry to Real-World Simulation

Recent breakthroughs in AI, from large language models gaining real-world simulation abilities to rapid AI-chip advancements and the surge of open-source models, are reshaping industries, highlighting both unprecedented opportunities and the need for ethical, secure deployment across sectors.

AI chipsArtificial IntelligenceOpen Source

0 likes · 7 min read

How LLMs Are Evolving from Language Mimicry to Real-World Simulation

DaTaobao Tech

Aug 16, 2024 · Artificial Intelligence

Effective Prompt Design for Large Language Models

Effective prompt design for large language models requires clear goals, relevant context, explicit input/output formats, evaluation criteria, and illustrative examples, combined with specific language, step‑by‑step instructions, edge‑case handling, ethical considerations, and proper tokenization, encoding, decoding, and post‑processing to produce accurate, concise, low‑hallucination responses.

AIlarge language modelsnatural language processing

0 likes · 33 min read

Effective Prompt Design for Large Language Models

DataFunSummit

Aug 16, 2024 · Artificial Intelligence

Educational Large Language Model Research and Product Applications for Youth Programming

The presentation outlines the challenges of sparse data and delayed learning effects in youth programming education, introduces three technical breakthroughs—dual‑data model training, hierarchical knowledge‑graph prompting, and reinforcement‑based cognitive recommendation—and showcases product implementations such as the Frog Programming Platform, AI learning machine, and digital‑human recorded courses.

AIeducationlarge language models

0 likes · 19 min read

Educational Large Language Model Research and Product Applications for Youth Programming

Qunhe Technology Quality Tech

Aug 14, 2024 · Artificial Intelligence

Should Your Testing Team Build a Private LLM or Use RAG with a General Model?

This article compares the high costs and technical challenges of building a private large language model with the benefits, flexibility, and lower risk of using Retrieval‑Augmented Generation (RAG) on a general LLM, offering practical guidance for testing teams seeking AI assistance.

AIModel DeploymentRAG

0 likes · 11 min read

Should Your Testing Team Build a Private LLM or Use RAG with a General Model?

AntTech

Aug 13, 2024 · Artificial Intelligence

Ant Group Contributions to ACL 2024: Summaries of 14 Accepted Papers Across NLP and AI

From August 11‑16, 2024 the ACL conference in Bangkok featured 14 Ant Group papers covering large‑scale information extraction, decomposed LLMs for semantic search, multimodal hallucination detection, long‑context attention mechanisms, concept‑reasoning datasets, knowledge‑graph alignment, and more, highlighting the group's breadth in natural language processing and AI research.

ACL2024MultimodalNLP

0 likes · 20 min read

Ant Group Contributions to ACL 2024: Summaries of 14 Accepted Papers Across NLP and AI

DaTaobao Tech

Aug 12, 2024 · Artificial Intelligence

Challenges and Optimization Techniques for Retrieval‑Augmented Generation (RAG)

Deploying large language models faces domain gaps, hallucinations, and high barriers, so Retrieval‑Augmented Generation (RAG) combines retrieval with generation, and advanced optimizations—such as RAPTOR’s hierarchical clustering, Self‑RAG’s self‑reflective retrieval, CRAG’s corrective evaluator, proposition‑level Dense X Retrieval, sophisticated chunking, query rewriting, and hybrid sparse‑dense methods—are essential for improving accuracy, reducing hallucinations, and achieving efficient, scalable performance.

AIOptimizationRAG

0 likes · 22 min read

Challenges and Optimization Techniques for Retrieval‑Augmented Generation (RAG)

DataFunSummit

Aug 8, 2024 · Artificial Intelligence

Exploring Training and Alignment Techniques for Financial Large Models

The announcement details a DataFun Summit 2024 session where Du Xiaoman AI researcher Huo Liangyu will present on the challenges, development, and alignment methods of the Xuan Yuan financial large language model, highlighting RLHF techniques, data collection, and real‑world deployment insights for the finance sector.

AIFinanceFinancial AI

0 likes · 6 min read

Exploring Training and Alignment Techniques for Financial Large Models

Data Thinking Notes

Aug 6, 2024 · Artificial Intelligence

How Large Language Models Are Revolutionizing R&D Operations and Telecom Networks

Large language models are increasingly applied in research and development operations, boosting efficiency and automating processes such as coding assistance, testing, requirement analysis, documentation, knowledge management, and network traffic analysis, while also enhancing security and enabling intelligent transformation across industries, especially telecom.

AI in telecomR&D automationlarge language models

0 likes · 3 min read

How Large Language Models Are Revolutionizing R&D Operations and Telecom Networks

DeWu Technology

Aug 5, 2024 · Frontend Development

Large Model Innovations Redefining Frontend Development – Key Takeaways

The July 14 DeWu tech salon showcased how large language models are reshaping frontend development, featuring insights from NetEase, Alibaba, and DeWu experts on AI‑driven low‑code platforms, intelligent coding assistants, and practical implementation strategies, with over 20,000 online viewers.

AIFrontendLow‑code

0 likes · 8 min read

Large Model Innovations Redefining Frontend Development – Key Takeaways

Software Development Quality

Aug 5, 2024 · Artificial Intelligence

How Large Language Models Can Transform Software Testing

This article explores how large language models can automate test case generation, predict defects, analyze results, optimize strategies, execute intelligent testing, and assist compatibility checks, while providing practical tools, real-world case studies, and a step‑by‑step GPT‑4 testing workflow.

AI testingdefect predictionlarge language models

0 likes · 15 min read

How Large Language Models Can Transform Software Testing

DataFunSummit

Aug 4, 2024 · Artificial Intelligence

Graph Technology Overview and Applications – From GraphGPT to Graph Databases

This article presents a comprehensive overview of recent advances in graph technology, covering GraphGPT for large language models, knowledge transfer on complex graphs, financial fraud detection, telecom network optimization, graph foundation models, Baidu's multi‑domain recommendation, high‑availability graph databases, and Kuaishou's efficient recommendation architecture.

Recommendation Systemsfinancial fraud detectiongraph databases

0 likes · 4 min read

Graph Technology Overview and Applications – From GraphGPT to Graph Databases

NewBeeNLP

Aug 3, 2024 · Artificial Intelligence

Extending LLM Context to 1M Tokens: SAMBA, CoPE, RoPE, Retrieval Heads & Infini‑Attention

This article reviews recent research on extending large language model context windows to millions of tokens, covering SAMBA's hybrid architecture, Contextual Position Encoding (CoPE), RoPE base length theory, Retrieval Head analysis, and the memory‑efficient Infini‑Attention mechanism.

Efficient AttentionLLM researchlarge language models

0 likes · 10 min read

Extending LLM Context to 1M Tokens: SAMBA, CoPE, RoPE, Retrieval Heads & Infini‑Attention

Data Thinking Notes

Aug 1, 2024 · Artificial Intelligence

Unlocking Vertical Domain LLMs: Advantages, Challenges, and Alignment Strategies

Over the past year our team explored applying large language models to specialized domains, detailing their professional benefits, unique challenges such as accuracy and knowledge‑base maintenance, and presenting solutions like alignment enhancement via BPO, Text2API, RAG, and advanced SFT/DPO techniques.

RAGSFTText2API

0 likes · 10 min read

Unlocking Vertical Domain LLMs: Advantages, Challenges, and Alignment Strategies

Xiaohongshu Tech REDtech

Aug 1, 2024 · Artificial Intelligence

Xiaohongshu Search Advertising Recall: Practices, Metrics, and Large‑Model Integration

Xiaohongshu’s search advertising recall system evolves from keyword bidding to BERT‑based vector retrieval and LLM‑enhanced query rewriting, using dual semantic and efficiency models, water‑level metrics, and GPU‑accelerated engineering to achieve 80 % click coverage, 60 % conversion coverage and a 5 % CPM lift.

Artificial Intelligenceefficiency optimizationlarge language models

0 likes · 33 min read

Xiaohongshu Search Advertising Recall: Practices, Metrics, and Large‑Model Integration

Sohu Tech Products

Jul 31, 2024 · Artificial Intelligence

MMEvalPro: A Trustworthy Benchmark for Evaluating Multimodal Large Models

MMEvalPro, a new benchmark created by researchers from Peking University, Chinese Academy of Medical Sciences, CUHK and Alibaba, augments existing multimodal datasets with perception and knowledge questions and introduces a Genuine Accuracy metric, revealing that top multimodal models still lag far behind humans and exposing shortcut‑driven performance on prior tests.

MMEvalProMultimodal Evaluationbenchmark

0 likes · 11 min read

MMEvalPro: A Trustworthy Benchmark for Evaluating Multimodal Large Models

NewBeeNLP

Jul 31, 2024 · Artificial Intelligence

Training 7B–13B LLMs: Practical Tips, Hyperparameters, and Scaling Challenges

The article shares hands‑on experience training 7‑ and 13‑billion‑parameter language models, covering essential hyper‑parameters, hardware requirements, data quality considerations, open dataset resources, and the systemic difficulties that arise when scaling to trillion‑parameter models.

LLM traininghyperparameterslarge language models

0 likes · 8 min read

Training 7B–13B LLMs: Practical Tips, Hyperparameters, and Scaling Challenges

Tencent Cloud Developer

Jul 30, 2024 · Artificial Intelligence

A Systematic Guide to Prompt Engineering: From Zero to One

This guide walks readers from beginner to proficient Prompt Engineer by outlining the evolution of prompting, introducing a universal four‑component template, and detailing a five‑step workflow—including refinement, retrieval‑augmented generation, chain‑of‑thought reasoning, and advanced tuning techniques—plus evaluation metrics for LLM performance.

AI promptingLLM optimizationRAG

0 likes · 51 min read

A Systematic Guide to Prompt Engineering: From Zero to One

21CTO

Jul 28, 2024 · Artificial Intelligence

How Anaconda Is Building an AI Operating System with High‑Performance Python

At PyCon US 2024, Anaconda’s Peter Wang outlined the company’s strategy to create an AI operating system by accelerating Python, launching the Anaconda Toolbox and AI Navigator, and addressing the challenges of integrating data, code, and large‑language models for enterprise AI workloads.

AI NavigatorAI Operating SystemAnaconda

0 likes · 6 min read

How Anaconda Is Building an AI Operating System with High‑Performance Python

DataFunSummit

Jul 28, 2024 · Artificial Intelligence

Leveraging Large Language Models for Graph Learning: Opportunities, Current Progress, and Future Directions

This article reviews why large language models can be applied to graph learning, outlines their capabilities and graph data characteristics, surveys current research across different graph types and LLM roles, and proposes future research directions for unified cross‑domain graph learning.

AIMultimodalResearch Directions

0 likes · 19 min read

Leveraging Large Language Models for Graph Learning: Opportunities, Current Progress, and Future Directions

DataFunSummit

Jul 25, 2024 · Artificial Intelligence

LOGIN: Large‑Model‑Assisted Graph Neural Networks for User Behavior Risk Control

This article presents the latest advances from the Chinese Academy of Sciences in graph machine learning for user behavior risk control, introducing the LOGIN framework that leverages large language models as consultants to iteratively enhance GNN training, and demonstrates its effectiveness through extensive experiments on homogeneous and heterogeneous graph benchmarks.

Machine Learninggraph neural networkslarge language models

0 likes · 14 min read

LOGIN: Large‑Model‑Assisted Graph Neural Networks for User Behavior Risk Control

21CTO

Jul 24, 2024 · Artificial Intelligence

Meta’s Llama 3.1 405B: How the Open‑Source Giant Stands Up to GPT‑4 and Claude 3.5

Meta’s newly released Llama 3.1 series, highlighted by the 405B model trained on 150 trillion tokens, claims state‑of‑the‑art performance in coding, mathematics, and multilingual summarization while offering an open‑source alternative to GPT‑4o and Claude 3.5 Sonnet.

AI competitionLlama 3.1large language models

0 likes · 6 min read

Meta’s Llama 3.1 405B: How the Open‑Source Giant Stands Up to GPT‑4 and Claude 3.5

Kuaishou Tech

Jul 23, 2024 · Artificial Intelligence

Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models

This paper introduces Parrot, a system that enhances large language models' (LLMs) multi-turn instruction following capabilities through context-aware preference optimization (CaPO) and synthetic data generation, achieving significant performance improvements with limited training data.

CaPONLPdata synthesis

0 likes · 9 min read

Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models

DataFunTalk

Jul 21, 2024 · Artificial Intelligence

Integrating DataOps with Large Language Models for Text2SQL: Practices, Challenges, and Future Directions

This article presents a comprehensive overview of how DataOps principles combined with large language models such as GPT‑4 enable more agile and intelligent data engineering workflows, focusing on Text2SQL applications, schema‑linking techniques, practical product implementations, and future research challenges.

AIDataOpsSchema Linking

0 likes · 23 min read

Integrating DataOps with Large Language Models for Text2SQL: Practices, Challenges, and Future Directions

IT Services Circle

Jul 17, 2024 · Artificial Intelligence

Why Large Language Models Mistake 9.11 > 9.9: Prompting, Tokenizer Effects, and Recent Findings

The article examines why leading large language models such as GPT‑4o, Gemini Advanced, and Claude 3.5 incorrectly claim that 9.11 is larger than 9.9, analyzes tokenization and prompting strategies that cause the error, and discusses recent research and OpenAI model updates.

AI reasoningNumerical Comparisonlarge language models

0 likes · 7 min read

Why Large Language Models Mistake 9.11 > 9.9: Prompting, Tokenizer Effects, and Recent Findings

Java Tech Enthusiast

Jul 12, 2024 · Artificial Intelligence

Why Alibaba’s Qwen‑2 Is Outperforming Global LLMs and What It Means for AI

After OpenAI halted API access in China, Alibaba’s Tongyi Qwen‑2 quickly rose to the top of global open‑source LLM leaderboards, surpassing Meta’s Llama‑3 and other contenders, with detailed benchmark scores, performance gains over previous versions, and implications for China’s AI ecosystem.

AI BenchmarkAlibabaChina AI

0 likes · 5 min read

Why Alibaba’s Qwen‑2 Is Outperforming Global LLMs and What It Means for AI

Kuaishou Large Model

Jul 11, 2024 · Artificial Intelligence

Pipeline-Aware Offloading & Balanced Checkpointing Accelerate LLM Training

Researchers from Kwai’s large-model team present a novel training system that combines pipeline-parallel-aware activation offloading with a compute-memory balanced checkpointing strategy, enabling lossless acceleration of large language models, achieving up to 42.7% MFU on 256 NVIDIA H800 GPUs while reducing memory usage.

GPU trainingKwaiPerformance Modeling

0 likes · 13 min read

Pipeline-Aware Offloading & Balanced Checkpointing Accelerate LLM Training

Kuaishou Tech

Jul 11, 2024 · Artificial Intelligence

Kuaishou Open-Sources Kolors: A High-Performance Text-to-Image Model Rivaling Midjourney v6

Kuaishou has officially open-sourced Kolors, a state-of-the-art text-to-image diffusion model that leverages ChatGLM3 for advanced bilingual text understanding and employs a two-stage training strategy to achieve photographic image quality rivaling leading proprietary systems.

generative AIlarge language modelsopen-source AI

0 likes · 8 min read

Kuaishou Open-Sources Kolors: A High-Performance Text-to-Image Model Rivaling Midjourney v6

JD Tech

Jul 11, 2024 · Artificial Intelligence

Intelligent Parcel Identification in JD Express Logistics Using Large Language Models

This article examines the challenges of low parcel matching rates in JD Express logistics and proposes a large‑model‑based intelligent identification system, detailing its architecture, accuracy validation, cost‑saving cache strategy, and future prospects for improved efficiency and personalized services.

AI in e-commerceLogisticsOperational Efficiency

0 likes · 24 min read

Intelligent Parcel Identification in JD Express Logistics Using Large Language Models

NewBeeNLP

Jul 10, 2024 · Artificial Intelligence

Can Large Language Models Master Co‑Temporal Reasoning? Introducing COTEMPQA

This article presents the COTEMPQA benchmark for evaluating large language models on co‑temporal reasoning, details its four scenario types, construction pipeline, experimental results across models, error analysis, and proposes the MR‑COT strategy that leverages mathematical reasoning to significantly improve performance.

LLM evaluationMR-COTbenchmark dataset

0 likes · 11 min read

Can Large Language Models Master Co‑Temporal Reasoning? Introducing COTEMPQA

DataFunSummit

Jul 9, 2024 · Artificial Intelligence

Applying Large Language Models to Recommendation Systems at Ant Group

This article details Ant Group's research on integrating large language models into recommendation pipelines, covering background challenges, knowledge extraction, teacher‑student distillation, experimental results, and practical Q&A for improving bias, efficiency, and cold‑start performance.

AI researchAnt Groupknowledge extraction

0 likes · 14 min read

Applying Large Language Models to Recommendation Systems at Ant Group

DataFunSummit

Jul 6, 2024 · Artificial Intelligence

Synergy Between Large Language Models and Knowledge Graphs: Recent Advances, Evaluation, and Future Integration

This article reviews the rapid progress of large language models and their complementary relationship with knowledge graphs, covering comparative strengths, knowledge extraction and completion, evaluation benchmarks, deployment benefits, complex reasoning support, and prospects for interactive fusion toward more reliable and explainable AI systems.

AI evaluationKnowledge Graphsknowledge extraction

0 likes · 12 min read

Synergy Between Large Language Models and Knowledge Graphs: Recent Advances, Evaluation, and Future Integration

21CTO

Jul 5, 2024 · Artificial Intelligence

15 Real-World Ways Companies Leverage Large Language Models

This article explores fifteen detailed examples of how major companies across sectors—from streaming and e‑commerce to transportation and social platforms—are harnessing large language models to improve search, personalize communications, detect fraud, and enhance operational efficiency.

AI case studiesLLM applicationsMachine Learning

0 likes · 9 min read

15 Real-World Ways Companies Leverage Large Language Models

Bilibili Tech

Jul 5, 2024 · Artificial Intelligence

Bilibili's AI Innovations at WAIC 2024: Empowering Creators and Transforming Content

At WAIC 2024, Bilibili unveiled a suite of AI tools—including a 1:1 digital‑avatar generator, dynamic comic technology, a customized voice library for virtual singer Luo Tianyi, the BiliStudio video‑audio model, and the Index‑1.9B large‑language models—empowering creators, cutting production costs, and serving its 80 million‑plus monthly users with advanced content‑creation and commercial‑marketing solutions.

AI content creationAI marketingBilibili

0 likes · 7 min read

Bilibili's AI Innovations at WAIC 2024: Empowering Creators and Transforming Content

JD Tech

Jul 5, 2024 · Artificial Intelligence

Generative Recommendation Systems for JD Alliance Advertising: Architecture, Implementation, and Experimental Evaluation

This article surveys how large language models reshape recommendation systems, presents a generative RS framework tailored for JD Alliance advertising, details material representation, model input, training and inference pipelines, and reports extensive offline and online experiments demonstrating its effectiveness on sparse user data.

Generative RecommendationLLMe-commerce advertising

0 likes · 27 min read

Generative Recommendation Systems for JD Alliance Advertising: Architecture, Implementation, and Experimental Evaluation

Meituan Technology Team

Jul 4, 2024 · Artificial Intelligence

Meituan Search Advertising: Evolution of Recall Strategies and Generative Approaches

Meituan’s search advertising has progressed from rule‑based keyword mining to hierarchical recall that partitions traffic and supply, and now to generative recall using large language models, chain‑of‑thought generation, diffusion‑enhanced multimodal vectors, and knowledge distillation, expanding the decision space while tackling compute and ROI challenges.

Generative ModelsMeituanMultimodal Retrieval

0 likes · 19 min read

Meituan Search Advertising: Evolution of Recall Strategies and Generative Approaches

360 Smart Cloud

Jul 4, 2024 · Artificial Intelligence

Optimizing Mixture-of-Experts (MoE) Training with the QLM Framework

This article introduces the background and challenges of large language model training, explains the Mixture-of-Experts (MoE) architecture, and details several optimization techniques implemented in the QLM framework—including fine-grained and shared experts, top‑k gating, token distribution, expert parallelism, and grouped GEMM – to improve training efficiency and performance.

AIMixture of ExpertsQLM

0 likes · 10 min read

Optimizing Mixture-of-Experts (MoE) Training with the QLM Framework

NewBeeNLP

Jul 3, 2024 · Industry Insights

What Dominated the AI Landscape in Q2 2024? From Llama 3 to GPT‑4o and Global Price Wars

The second quarter of 2024 saw a whirlwind of AI developments—including Meta’s open‑source Llama 3, Microsoft’s fleeting WizardLM‑2, a wave of model price cuts, major IPOs, legislative restrictions, and the debut of OpenAI’s multimodal GPT‑4o—painting a vivid picture of rapid innovation, fierce competition, and shifting market dynamics across the global AI ecosystem.

AI modelsAI policyOpen Source

0 likes · 24 min read

What Dominated the AI Landscape in Q2 2024? From Llama 3 to GPT‑4o and Global Price Wars

JD Cloud Developers

Jul 2, 2024 · Operations

How Large Language Models Are Transforming Modern IT Operations

From manual server management to automated scripts, AIOps, and ChatOps, this article traces the evolution of IT operations and demonstrates how large language models boost efficiency, enable intelligent assistants, automated diagnostics, and smart log analysis, aiming for rapid fault detection, localization, and resolution.

ChatOpsOperationsaiops

0 likes · 7 min read

How Large Language Models Are Transforming Modern IT Operations

Continuous Delivery 2.0

Jul 2, 2024 · Artificial Intelligence

Dynamic Integrated Developer Activity (DIDACT): Large Sequence Models for Software Development

The article introduces DIDACT, a large‑scale multitask machine‑learning framework that trains on the full software‑development workflow—including edits, builds, reviews, and tool interactions—to create AI assistants that can predict and suggest developer actions throughout the coding process.

AI for CodeMachine Learningdeveloper assistance

0 likes · 11 min read

Dynamic Integrated Developer Activity (DIDACT): Large Sequence Models for Software Development

DaTaobao Tech

Jul 1, 2024 · Artificial Intelligence

Recent Progress in Vision-Language Models (VLMs)

Over the past year, Vision‑Language Models have surged from early multimodal experiments to competitive open‑source systems rivaling GPT‑4, driven by higher‑resolution processing, richer vision encoders, better projection layers, and larger curated datasets, yet they still face evaluation difficulties, hallucinations, speed limits, and limited multimodal output.

Vision-Language Modelscomputer visiondeep learning

0 likes · 24 min read

Recent Progress in Vision-Language Models (VLMs)

JD Tech

Jun 28, 2024 · Artificial Intelligence

An Overview of Large Language Models: History, Fundamentals, Prompt Engineering, Retrieval‑Augmented Generation, Agents, and Multimodal AI

This article provides a comprehensive introduction to large language models, covering their historical development, core architecture, training process, prompt engineering techniques, Retrieval‑Augmented Generation, agent frameworks, multimodal capabilities, safety challenges, and future research directions.

AI agentsAI safetyMultimodal

0 likes · 22 min read

An Overview of Large Language Models: History, Fundamentals, Prompt Engineering, Retrieval‑Augmented Generation, Agents, and Multimodal AI

DataFunSummit

Jun 26, 2024 · Artificial Intelligence

2026 Roadmap for Recommendation Systems: Challenges, Research Directions, and OneRec Integration

This article outlines the current bottlenecks of conventional recommendation pipelines and proposes a comprehensive 2026 research agenda covering retention improvement, user growth, content ecosystem, multi‑objective Pareto optimization, long‑term value modeling, whole‑site optimization, interactive recommendation, personalized modeling, decision‑theoretic formulation, and the OneRec multi‑source fusion framework.

User Retentionlarge language modelsmulti-objective optimization

0 likes · 18 min read

2026 Roadmap for Recommendation Systems: Challenges, Research Directions, and OneRec Integration

JD Tech Talk

Jun 25, 2024 · Artificial Intelligence

Understanding Large Language Models: From Parameters to Transformer Architecture

This article explains the fundamental concepts behind large language models, including their two-file structure, training process, neural network basics, perceptron examples, weight and threshold calculations, the TensorFlow Playground, and a detailed walkthrough of the Transformer architecture with tokenization, positional encoding, self‑attention, normalization, and feed‑forward layers.

AIMachine LearningSelf-Attention

0 likes · 20 min read

Understanding Large Language Models: From Parameters to Transformer Architecture

NewBeeNLP

Jun 24, 2024 · Artificial Intelligence

How Domain Large Models Are Shaping the Future of AI: Challenges and Solutions

This article reviews Fudan University's Knowledge Factory Lab research on domain large models, covering background, three major deployment challenges, data‑selection strategies, ability‑enhancement techniques, collaborative workflows, and retrieval‑augmented generation methods that aim to make large models practical for real‑world tasks.

domain adaptationknowledge extractionlarge language models

0 likes · 18 min read

How Domain Large Models Are Shaping the Future of AI: Challenges and Solutions

DataFunSummit

Jun 23, 2024 · Artificial Intelligence

Tongyi Xingchen Personalized Large Model: Technical Overview and Applications

This article summarizes the development background of large language models, Alibaba's progression in foundational and personalized AI, the design and capabilities of the Tongyi Xingchen personalized model, its multimodal and agent-based architecture, various industry use cases, and the safety and responsibility measures applied to ensure trustworthy AI deployment.

AI safetylarge language modelsmodel agents

0 likes · 13 min read

Tongyi Xingchen Personalized Large Model: Technical Overview and Applications

Baidu Intelligent Cloud Tech Hub

Jun 21, 2024 · Databases

How Vector Databases and Large Models are Transforming AI-Driven Database Operations

This article reviews the evolution of databases and large models, explains the role of vector databases and Retrieval‑Augmented Generation (RAG) in AI‑enhanced data management, and showcases Baidu Cloud's VectorDB and DBSC solutions for intelligent database operations and knowledge‑driven services.

AI4DBDatabase operationsRAG

0 likes · 15 min read

How Vector Databases and Large Models are Transforming AI-Driven Database Operations

Xiaohongshu Tech REDtech

Jun 20, 2024 · Artificial Intelligence

Xiaohongshu 2024 Large Model Frontier Paper Sharing Live Event

On June 27, 2024, Xiaohongshu’s technical team will livestream a two‑hour session across WeChat Channels, Bilibili, Douyin and Xiaohongshu, showcasing six top‑conference papers on large‑model advances—including early‑stopping and fine‑grained self‑consistency, novel evaluation methods, negative‑sample‑assisted distillation, and LLM‑based note recommendation—followed by a Q&A and recruitment briefing.

AI researchKnowledge DistillationRecommendation Systems

0 likes · 12 min read

Xiaohongshu 2024 Large Model Frontier Paper Sharing Live Event

JD Tech Talk

Jun 20, 2024 · Artificial Intelligence

Applying Large Language Models to Courier Operations: Intelligent Operations, Q&A, Prompting, and Agents

This article describes how large language models such as ChatGPT are integrated into courier terminal systems to automate tasks, enhance intelligent voice operations, enable retrieval‑augmented question answering, generate smart prompts, and explore agent‑based workflows, supported by code examples for data extraction, splitting, and embedding.

AI for logisticsIntelligent OperationsRetrieval-Augmented Generation

0 likes · 14 min read

Applying Large Language Models to Courier Operations: Intelligent Operations, Q&A, Prompting, and Agents

DataFunSummit

Jun 18, 2024 · Artificial Intelligence

Conditional and Multimodal Knowledge Graph Construction, Extraction, and Integration with Large Models

This article presents a comprehensive overview of conditional and multimodal knowledge graphs, covering their background, construction pipelines, extraction techniques, dataset creation, semi‑supervised learning strategies, and how they can be fused with large language models for enhanced reasoning and application in tasks such as intelligent QA and video scene graph generation.

AIconditional KGinformation extraction

0 likes · 23 min read

Conditional and Multimodal Knowledge Graph Construction, Extraction, and Integration with Large Models

DataFunTalk

Jun 15, 2024 · Artificial Intelligence

Research on Domain Large Models by Fudan University Knowledge Factory Lab

This article presents Fudan University's Knowledge Factory Lab research on domain large models, covering background, challenges, data selection, source‑enhanced tagging, capability improvements, self‑correction, collaborative workflows, and retrieval‑augmented generation for practical AI deployment.

AI researchdomain adaptationknowledge graph

0 likes · 16 min read

Research on Domain Large Models by Fudan University Knowledge Factory Lab

Baidu Tech Salon

Jun 14, 2024 · Artificial Intelligence

Why Large Models Signal the Dawn of General AI: Insights from Baidu’s CTO

In a keynote at the 2024 Beijing Zhiyuan Conference, Baidu’s CTO Wang Haifeng explained how large‑model universality and comprehensive capabilities are driving artificial general intelligence forward, highlighting scale laws, multimodal advances, agent technologies, and the industrial‑scale production of AI.

AI industrializationAI trendsGeneral AI

0 likes · 7 min read

Why Large Models Signal the Dawn of General AI: Insights from Baidu’s CTO

DataFunTalk

Jun 14, 2024 · Artificial Intelligence

Shopee's E‑commerce Knowledge Graph Construction and Integration with Large Models

This article presents Shopee's comprehensive exploration of building an e‑commerce knowledge graph, detailing its challenges, construction pipeline, AI‑driven extraction and fusion techniques, multilingual and multimodal modeling, and practical applications ranging from search and recommendation to AI assistants and real‑time updates.

AI applicationsMultimodale-commerce

0 likes · 21 min read

Shopee's E‑commerce Knowledge Graph Construction and Integration with Large Models

AntTech

Jun 13, 2024 · Artificial Intelligence

Exploring Multi‑Agent Applications in Financial Scenarios and the agentUniverse Framework

The article reviews the evolution from large language models to stateful agents, discusses the specific challenges of information‑dense, knowledge‑dense, and decision‑dense financial tasks, and introduces the open‑source agentUniverse multi‑agent framework with its PEER collaboration model and real‑world investment‑research applications.

AI Research AssistantFinancial AIPEER framework

0 likes · 18 min read

Exploring Multi‑Agent Applications in Financial Scenarios and the agentUniverse Framework

AntTech

Jun 6, 2024 · Information Security

AIGC Era Trends in Next‑Generation Identity Recognition: DeepFake Risks, AIGC as a New Production Force, and Cross‑Terminal Interaction

The talk at the 18th Security Identification Technology Expo and Summit outlines three emerging trends for identity verification in the AIGC era: the surge of deep‑fake attacks, the use of generative AI as a new data‑production engine, and the shift toward cross‑device, agent‑based authentication paradigms.

AIGCBiometricsIdentity verification

0 likes · 10 min read

AIGC Era Trends in Next‑Generation Identity Recognition: DeepFake Risks, AIGC as a New Production Force, and Cross‑Terminal Interaction

Baobao Algorithm Notes

Jun 3, 2024 · Artificial Intelligence

Can Adversarial Training Make Retrieval‑Augmented Generators More Robust?

Recent arXiv work introduces ATM, an adversarially‑tuned multi‑agent system that iteratively pits a fake‑knowledge attacker against a generator, dramatically improving retrieval‑augmented language models’ resistance to hallucinated content and boosting performance on knowledge‑intensive benchmarks, even with noisy or irrelevant documents.

Hallucination MitigationRAGadversarial training

0 likes · 12 min read

Can Adversarial Training Make Retrieval‑Augmented Generators More Robust?

58 Tech

Jun 3, 2024 · Artificial Intelligence

Parameter-Efficient Fine-Tuning (PEFT) Methods for Large Language Models: LoRA, QLoRA, AdaLoRA, SoRA, and Training Acceleration with Unsloth

This article systematically analyzes popular parameter‑efficient fine‑tuning (PEFT) techniques for large language models—including Adapter Tuning, Prefix Tuning, LoRA, QLoRA, AdaLoRA, and SoRA—detailing their principles, implementation code, experimental results on NLU tasks, and practical acceleration using the Unsloth library.

AdaLoRALoRAPEFT

0 likes · 39 min read

Parameter-Efficient Fine-Tuning (PEFT) Methods for Large Language Models: LoRA, QLoRA, AdaLoRA, SoRA, and Training Acceleration with Unsloth

DataFunSummit

Jun 1, 2024 · Artificial Intelligence

Graph Foundation Models: Concepts, Progress, and Future Directions

This article provides a comprehensive overview of Graph Foundation Models (GFMs), covering their definition, key characteristics, historical development of graph machine learning, recent research trends such as PT‑HGNN, Specformer, and GraphTranslator, and discusses future challenges and research directions.

Machine Learningfoundation modelsgraph neural networks

0 likes · 23 min read

Graph Foundation Models: Concepts, Progress, and Future Directions

DataFunTalk

May 31, 2024 · Artificial Intelligence

The Role of Knowledge Graphs in Industry: Importance, Product Forms, and Practical Cases

This article explains why knowledge graphs are crucial for industrial applications, describes the main product forms and architectural considerations, and shares real‑world case studies illustrating how AI, large models, and graph databases can be combined to improve knowledge management and decision‑making.

AIGraph DatabaseIndustrial Applications

0 likes · 20 min read

The Role of Knowledge Graphs in Industry: Importance, Product Forms, and Practical Cases

Kuaishou Tech

May 27, 2024 · Artificial Intelligence

What Kuaishou’s Four ACL Papers Reveal About the Future of Large Language Models

The 62nd ACL conference accepted four papers from Kuaishou that explore multi‑turn instruction following, self‑agreement reasoning, fine‑grained reinforcement learning, and dynamic routing in Mixture‑of‑Experts models, each with detailed methods, experimental results, author lists, and public arXiv links.

ACL 2024Kuaishou ResearchMixture of Experts

0 likes · 11 min read

What Kuaishou’s Four ACL Papers Reveal About the Future of Large Language Models

Baidu Intelligent Cloud Tech Hub

May 27, 2024 · Databases

Baidu’s Enterprise Vector Database: Architecture, Performance, and RAG Secrets

An exclusive interview with Baidu’s senior database architects reveals the motivations behind building a dedicated enterprise vector database, details its novel column‑store engine, C++‑based retrieval stack, performance gains over open‑source solutions, multi‑modal support, RAG integration, and future research directions.

AIRAGStorage Engine

0 likes · 28 min read

Baidu’s Enterprise Vector Database: Architecture, Performance, and RAG Secrets

Baidu Tech Salon

May 20, 2024 · Artificial Intelligence

Boosting Ad Efficiency with Baidu’s Multi‑Agent AI Architecture

In the AI‑native era, Baidu's ad platform adopts a multi‑agent architecture that combines large and small LLMs, SOP‑driven workflows, long‑term memory, and vector databases to achieve high query accuracy, low latency, and significant business gains while tackling challenges such as hallucination, planning, execution, and personalization.

AI agentsIndustry InsightsLLM optimization

0 likes · 18 min read

Boosting Ad Efficiency with Baidu’s Multi‑Agent AI Architecture

NewBeeNLP

May 16, 2024 · Artificial Intelligence

How Large Language Models Transform Advertising Copy Generation

This article examines the adoption of large language models for intelligent advertising copy creation, detailing business challenges, model selection criteria, training data preparation, fine‑tuning methods, performance evaluation, deployment results, while highlighting the trade‑offs between model size, cost, and output quality.

AI marketingadvertising copyfine-tuning

0 likes · 20 min read

How Large Language Models Transform Advertising Copy Generation

DeWu Technology

May 15, 2024 · Artificial Intelligence

Accelerating Large Language Model Inference: Techniques and Framework Recommendations

Deploying a dedicated inference cluster and applying four key optimizations—FlashAttention‑based attention computation, PageAttention KV‑cache management, Mixture‑of‑Experts parameter reduction, and tensor parallelism—can accelerate large language model inference by up to 50% for models as large as 70 B parameters while cutting deployment costs.

FlashAttentionInference AccelerationMixture of Experts

0 likes · 17 min read

Accelerating Large Language Model Inference: Techniques and Framework Recommendations

Baidu Geek Talk

May 15, 2024 · Artificial Intelligence

Accelerating Large Model Training and Inference with Baidu Baige AIAK‑LLM: Challenges, Techniques, and Optimizations

The talk outlines how Baidu’s Baige AIAK‑LLM suite tackles the exploding compute demands of trillion‑parameter models by boosting Model FLOPS Utilization through advanced parallelism, memory‑saving recompute, zero‑offload, adaptive scheduling, and cross‑chip orchestration, delivering 30‑60% training and inference speedups and a unified cloud product.

AI infrastructureBaiduInference Optimization

0 likes · 25 min read

Accelerating Large Model Training and Inference with Baidu Baige AIAK‑LLM: Challenges, Techniques, and Optimizations

NewBeeNLP

May 15, 2024 · Artificial Intelligence

How Large Language Models and Knowledge Graphs Can Boost Each Other

This talk reviews recent advances in large language models, compares them with knowledge graphs, explores how LLMs enhance knowledge extraction and completion, examines how knowledge graphs aid LLM evaluation and safe deployment, and outlines future interactive integration between the two technologies.

AI researchKnowledge Graphsknowledge extraction

0 likes · 13 min read

How Large Language Models and Knowledge Graphs Can Boost Each Other

21CTO

May 11, 2024 · Artificial Intelligence

Will U.S. AI Export Controls Stall Global Large‑Model Development?

The United States is drafting a bipartisan bill to impose export controls on advanced proprietary AI models, aiming to shield its technology from China, Russia, North Korea and Iran, while confronting challenges of open‑source model regulation and potential geopolitical retaliation.

AI policyChinaExport controls

0 likes · 7 min read

Will U.S. AI Export Controls Stall Global Large‑Model Development?

Baidu Tech Salon

May 10, 2024 · Artificial Intelligence

Baidu Comate: Core Capabilities of Intelligent Code Assistant

The article surveys Baidu Comate, an AI‑powered code assistant built on the Wenxin (ERNIE) large model, tracing software development from the 1950s crisis through the internet and open‑source era to today’s AI‑driven tools, and highlights its features and demonstration at a global development conference.

AI codingBaidu ComateIDE plugin

0 likes · 7 min read

Baidu Comate: Core Capabilities of Intelligent Code Assistant

Architects' Tech Alliance

May 9, 2024 · Artificial Intelligence

AI Servers: Market Opportunities, Architecture, and Future Demand Driven by Generative AI

The article examines how the surge of generative AI (AIGC) is fueling rapid growth in AI server demand, detailing the emerging AIGC ecosystem, server hardware composition, model scaling, heterogeneous computing, training vs. inference workloads, market size forecasts, and the competitive landscape of AI server manufacturers.

AI infrastructureAI serversGPU

0 likes · 15 min read

AI Servers: Market Opportunities, Architecture, and Future Demand Driven by Generative AI

DataFunTalk

May 7, 2024 · Artificial Intelligence

Large Language Models and Knowledge Graphs: Recent Advances, Synergies, and Future Directions

This article reviews the rapid progress of large language models, compares them with knowledge graphs, explores how LLMs can aid knowledge extraction and completion, discusses how knowledge graphs can evaluate and enhance LLMs, and outlines future interactive integration between the two technologies.

AIKnowledge Graphsevaluation

0 likes · 12 min read

Large Language Models and Knowledge Graphs: Recent Advances, Synergies, and Future Directions

Rare Earth Juejin Tech Community

May 2, 2024 · Artificial Intelligence

Understanding Large Language Models: Principles, Training, Risks, and Application Security

This article provides a comprehensive overview of large language models (LLMs), explaining their core concepts, transformer architecture, training stages, known shortcomings such as hallucination and reversal curse, and highlights emerging security threats like prompt injection and jailbreaking, offering guidance for safe deployment.

AI safetyLLMjailbreaking

0 likes · 21 min read

Understanding Large Language Models: Principles, Training, Risks, and Application Security

21CTO

Apr 28, 2024 · Artificial Intelligence

5 Transformative Business Use Cases for Conversational AI

This article explores how conversational AI, powered by large language models, is reshaping enterprise operations across five key scenarios—from customer support assistants and AI‑driven data interfaces to HR bots, unstructured data processing, and multi‑agent digital assistants—highlighting benefits, implementation considerations, and privacy challenges.

Conversational AIbusiness applicationscustomer support

0 likes · 13 min read

5 Transformative Business Use Cases for Conversational AI

DataFunTalk

Apr 26, 2024 · Artificial Intelligence

Large Language Models in the Automotive Industry: Overview, Impact, and Practical Exploration

This article examines how large language models such as GPT and Transformer‑based architectures are reshaping the automotive sector by enhancing in‑vehicle intelligence, streamlining product development, improving customer service, and redefining data analyst roles, while also presenting practical experiments, deployment challenges, and future directions.

Automotive AIData AnalysisGPT

0 likes · 18 min read

Large Language Models in the Automotive Industry: Overview, Impact, and Practical Exploration

Sohu Tech Products

Apr 24, 2024 · Artificial Intelligence

Evolution, Architecture, Training Data, Methods, and Performance of Meta's Llama Series (Llama 1, 2, 3)

Meta's Llama series has progressed from the 7‑65B Llama‑1 in early 2023 to the 8B and 70B Llama‑3 in 2024, scaling token counts from 1 T to over 15 T, adopting decoder‑only Transformers with RMSNorm, SwiGLU, RoPE and GQA, and adding supervised fine‑tuning, RLHF and DPO, resulting in state‑of‑the‑art benchmark performance and a vibrant open‑source ecosystem.

AILLaMAPerformance Evaluation

0 likes · 25 min read

Evolution, Architecture, Training Data, Methods, and Performance of Meta's Llama Series (Llama 1, 2, 3)

21CTO

Apr 23, 2024 · Artificial Intelligence

Deploy Large Language Models with vLLM and Quantization for Low Latency

This guide explains how to deploy open‑source large language models using vLLM, benchmark latency and throughput, and apply 8‑bit/4‑bit quantization techniques such as BitsandBytes and NF4 to achieve faster inference on limited‑GPU hardware.

LLM deploymentPythonlarge language models

0 likes · 13 min read

Deploy Large Language Models with vLLM and Quantization for Low Latency

MoonWebTeam

Apr 23, 2024 · Artificial Intelligence

Exploring Devika AI: An Open‑Source AI Programmer’s Capabilities and Limits

Devika AI, an open‑source AI programmer from Stition AI, is examined for its architecture, supported actions, installation steps, and real‑world performance across tasks such as building a Snake game, Conway’s Game of Life, Vue3 components, and unit‑test generation, highlighting strengths, weaknesses, and future potential.

Devika AITool Evaluationlarge language models

0 likes · 21 min read

Exploring Devika AI: An Open‑Source AI Programmer’s Capabilities and Limits

NewBeeNLP

Apr 22, 2024 · Artificial Intelligence

Why LLAMA‑3’s Scaling Laws Signal the Next AI Frontier

The article analyzes LLAMA‑3’s architectural tweaks, massive data expansion, scaling‑law implications, open‑source versus closed‑source dynamics, and the critical role of synthetic data in sustaining large‑model progress beyond 2025.

LLAMA-3large language modelsopen-source AI

0 likes · 10 min read

Why LLAMA‑3’s Scaling Laws Signal the Next AI Frontier

Baobao Algorithm Notes

Apr 21, 2024 · Artificial Intelligence

Why Llama 3’s Open‑Source Release Could Redefine Large‑Model Scaling and Synthetic Data

The article analyzes Llama 3’s architecture, training data expansion, model variants, Meta’s open‑source strategy, the evolving gap between open and closed models, and how future breakthroughs in synthetic data will shape scaling laws and large‑model progress through 2025 and beyond.

AI trendsLlama3Open Source

0 likes · 12 min read

Why Llama 3’s Open‑Source Release Could Redefine Large‑Model Scaling and Synthetic Data

Xiaohe Frontend Team

Apr 21, 2024 · Artificial Intelligence

What’s New in Generative AI? VASA‑1, Llama‑3, Stable Diffusion 3 & More

The article reviews the latest breakthroughs in generative AI, including Microsoft’s VASA‑1 video synthesis model, Meta’s open‑source Llama‑3 large language model, Stability AI’s Stable Diffusion 3 API, Adobe’s integration of third‑party AI video tools into Premiere Pro, and a free image‑style‑recreation platform from Freepik, highlighting their technical details and potential applications.

AI toolsdiffusion modelsgenerative AI

0 likes · 13 min read

What’s New in Generative AI? VASA‑1, Llama‑3, Stable Diffusion 3 & More

AntTech

Apr 19, 2024 · Artificial Intelligence

OneKE: Open-Source Bilingual Knowledge Extraction Framework for Large Language Models

OneKE, an open‑source bilingual (Chinese‑English) knowledge extraction framework jointly developed by Ant Group and Zhejiang University, enables efficient extraction of entities, relations, and events to build domain knowledge graphs that enhance large language models’ reasoning, reduce hallucinations, and support applications in medical, financial, and governmental sectors.

Artificial IntelligenceKnowledge Graphsbilingual

0 likes · 5 min read

OneKE: Open-Source Bilingual Knowledge Extraction Framework for Large Language Models

DevOps

Apr 17, 2024 · Artificial Intelligence

Engineering Capabilities for Enterprise Large Model Applications: Prompt Engineering, RAG, and Fine‑Tuning

The article explores how enterprises can build and improve large‑model applications by combining prompt engineering, retrieval‑augmented generation (RAG), and fine‑tuning, discusses their relationships, optimization dimensions, testing challenges, and provides practical guidance for SE4AI implementation.

AI EngineeringRAGenterprise AI

0 likes · 20 min read

Engineering Capabilities for Enterprise Large Model Applications: Prompt Engineering, RAG, and Fine‑Tuning

AntTech

Apr 17, 2024 · Artificial Intelligence

LLMRG: Improving Recommendations through Large Language Model Reasoning Graphs

LLMRG introduces a novel framework that leverages large language models to construct personalized reasoning graphs, integrating chain reasoning, self‑verification, divergent extension, and knowledge‑base self‑improvement, thereby enhancing recommendation accuracy, interpretability, and performance across multiple benchmark datasets without additional user or item information.

AIInterpretabilityRecommendation Systems

0 likes · 9 min read

LLMRG: Improving Recommendations through Large Language Model Reasoning Graphs

360 Tech Engineering

Apr 15, 2024 · Artificial Intelligence

Fine‑Tuning Large Language Models: A Practical Guide Using Qwen‑14B on the 360AI Platform

This article explains the concept, motivations, and step‑by‑step workflow for fine‑tuning large language models—specifically Qwen‑14B—covering data preparation, training commands with DeepSpeed, hyper‑parameter settings, evaluation, and deployment via FastChat, all illustrated with code snippets and configuration details.

AIDeepSpeedFastChat

0 likes · 10 min read

Fine‑Tuning Large Language Models: A Practical Guide Using Qwen‑14B on the 360AI Platform

Architects' Tech Alliance

Apr 13, 2024 · Industry Insights

Why AI Servers Are Poised for Explosive Growth: Trends, Architecture, and Demand Forecast

The article analyzes how the surge in AIGC and large language models is reshaping the AI server market, detailing hardware composition, the rise of heterogeneous computing, GPU advantages, demand calculations for models like GPT‑3, and the competitive landscape driving rapid industry growth.

AI serversAIGCGPU computing

0 likes · 16 min read

Why AI Servers Are Poised for Explosive Growth: Trends, Architecture, and Demand Forecast

DataFunSummit

Apr 13, 2024 · Artificial Intelligence

Understanding and Mitigating Hallucinations in Large Language Model Industry Q&A with Knowledge Graphs

This article examines why large language models often produce hallucinations in industry question‑answering, defines the phenomenon, explores its data and training origins, proposes evaluation metrics, and presents practical strategies—including high‑quality fine‑tuning data, honest refusal mechanisms, advanced decoding methods, and external knowledge‑graph augmentation—to reduce hallucinations and improve reliability.

AI evaluationhallucinationknowledge graph

0 likes · 21 min read

Understanding and Mitigating Hallucinations in Large Language Model Industry Q&A with Knowledge Graphs

NewBeeNLP

Apr 13, 2024 · Artificial Intelligence

How a Multimodal ‘Joke‑King’ Model Beats GPT‑4 at Humor Generation

A research team from Sun Yat‑sen University, Sea AI Lab and Harvard built a multimodal large model that learns to generate creative jokes and memes by training on the Oogiri‑GO dataset, introducing a Leap‑of‑Thought (LoT) paradigm and CLoT fine‑tuning, which outperforms GPT‑4 and other state‑of‑the‑art models in humor tasks.

CLoTLeap-of-ThoughtOogiri-GO dataset

0 likes · 9 min read

How a Multimodal ‘Joke‑King’ Model Beats GPT‑4 at Humor Generation

Data Thinking Notes

Apr 11, 2024 · Artificial Intelligence

How Financial Institutions Are Building Their Own Large Language Models

This article explores how the finance sector is creating specialized large language models—covering the shift from generic to domain‑specific models, training innovations, evaluation methods, and real‑world applications such as marketing, customer service, risk control, and operational analytics.

Applicationsfinance AIlarge language models

0 likes · 16 min read

How Financial Institutions Are Building Their Own Large Language Models

Cloud Native Technology Community

Apr 11, 2024 · Cloud Native

Why Kubernetes Is the Ideal Platform for Deploying Large Language Models

Deploying large language models demands massive compute, flexible scaling, and robust resource management, and this article explains how Kubernetes’s auto‑scaling, portability, cloud‑native features, observability tools, and multi‑tenant isolation make it the optimal platform for training, serving, and iterating LLM workloads.

Cloud NativeKubernetesResource Management

0 likes · 17 min read

Why Kubernetes Is the Ideal Platform for Deploying Large Language Models

DataFunSummit

Apr 9, 2024 · Artificial Intelligence

Knowledge Map for Large Model Application Development

This article outlines a comprehensive knowledge map for building large‑model applications, detailing a four‑layer technical architecture, development lifecycle, core elements such as prompt engineering and fine‑tuning, evaluation methods, and real‑world case studies across various AI use cases.

AI application developmentlarge language modelsmodel fine-tuning

0 likes · 12 min read

Knowledge Map for Large Model Application Development

NewBeeNLP

Apr 8, 2024 · Artificial Intelligence

What Will Recommendation Systems Look Like in 2026? Emerging Trends and Challenges

This article analyzes the current bottlenecks of conventional recommendation systems and outlines ten forward‑looking research directions for 2026, including retention improvement, user growth, content ecosystem, multi‑objective Pareto optimization, long‑term value estimation, site‑wide optimization, interactive recommendation, personalized modeling, decision‑theoretic framing, and the integration of large language models via the OneRec framework.

User Retentioninteractive recommendationlarge language models

0 likes · 18 min read

What Will Recommendation Systems Look Like in 2026? Emerging Trends and Challenges

NewBeeNLP

Apr 7, 2024 · Artificial Intelligence

Can Large Language Models Learn Recommendation Knowledge? A NL‑Simulated Auxiliary Task

This article reviews a recent study that bridges the knowledge gap between large language models and recommendation systems by generating natural‑language auxiliary tasks, fine‑tuning the models, and achieving notable performance gains on Amazon domain benchmarks.

AI researchfine-tuningknowledge injection

0 likes · 4 min read

Can Large Language Models Learn Recommendation Knowledge? A NL‑Simulated Auxiliary Task

DataFunTalk

Apr 4, 2024 · Artificial Intelligence

Enhancing Interactive Agents with Large Language Models: The SwiftSage Framework

This article reviews the challenges of textual‑only large language model interaction, introduces benchmark environments such as AFL World and ScienceWorld, compares baseline reinforcement‑learning approaches, and presents SwiftSage—a hybrid system that combines a fast T5‑based small model with a powerful LLM for planning and grounding, demonstrating superior performance, efficiency, and cost‑effectiveness while outlining current limitations and future research directions.

AISwiftSageinteractive agents

0 likes · 22 min read

Enhancing Interactive Agents with Large Language Models: The SwiftSage Framework

DataFunTalk

Apr 3, 2024 · Artificial Intelligence

Future Directions of Recommendation Systems: Retention, User Growth, Content Ecosystem, Multi‑Objective Optimization, and Large‑Model Fusion

This presentation outlines the current bottlenecks of conventional recommendation pipelines and proposes a 2026 roadmap that includes retention improvement, user‑growth strategies, content‑ecosystem metrics, Pareto‑optimal multi‑objective optimization, long‑term value modeling, site‑wide spatial optimization, interactive recommendation, personalized modeling, and the integration of large‑model fusion through the OneRec framework.

Recommendation SystemsUser Retentioninteractive recommendation

0 likes · 18 min read

Future Directions of Recommendation Systems: Retention, User Growth, Content Ecosystem, Multi‑Objective Optimization, and Large‑Model Fusion

DataFunTalk

Apr 2, 2024 · Artificial Intelligence

User Portrait Algorithms: From Ontology‑Based Methods to Deep Learning and Future Directions

This article provides a comprehensive overview of user portrait algorithms, covering their historical development, ontology‑based traditional approaches, deep‑learning enhancements, representation‑learning techniques such as lookalike, active‑learning driven iteration, and the integration of large‑model world knowledge, while also discussing current challenges and future research directions.

Recommendation Systemsactive learningdeep learning

0 likes · 26 min read

User Portrait Algorithms: From Ontology‑Based Methods to Deep Learning and Future Directions

DataFunTalk

Apr 1, 2024 · Artificial Intelligence

Large Model Applications in the Financial Sector: Practices, Knowledge Graphs, Ethics, and Ecosystem

This article presents a comprehensive overview of how large AI models are being applied in finance, covering development trends, practical use cases, knowledge‑graph integration, safety mechanisms, ethical considerations, and the evolving ecosystem of model‑centric solutions.

AI applicationsfinance AIknowledge graph

0 likes · 24 min read

Large Model Applications in the Financial Sector: Practices, Knowledge Graphs, Ethics, and Ecosystem

DataFunSummit

Mar 31, 2024 · Artificial Intelligence

Challenges and Techniques in Distributed Training of Large Language Models

This article reviews the rapid development of large language models since 2019, outlines the historical background, identifies key challenges such as massive compute demand, memory constraints, and system complexity, and then details distributed training technologies—including data parallelism, pipeline parallelism, and advanced optimization strategies—while also discussing future research directions and answering common questions.

AI infrastructureData ParallelismDeepSpeed

0 likes · 23 min read

Challenges and Techniques in Distributed Training of Large Language Models

DaTaobao Tech

Mar 29, 2024 · Artificial Intelligence

Text-to-SQL with Large Language Models: DIN-SQL Approach

The DIN‑SQL approach enhances Text‑to‑SQL performance by using large language models in a decomposed in‑context learning framework with schema linking, query classification, SQL generation, and self‑correction modules, achieving state‑of‑the‑art 85.3% execution accuracy on the Spider benchmark by breaking complex queries into manageable sub‑tasks.

AI researchData AnalysisDatabase Querying

0 likes · 34 min read

Text-to-SQL with Large Language Models: DIN-SQL Approach

Sohu Tech Products

Mar 27, 2024 · Artificial Intelligence

NVIDIA NeMo Framework, TensorRT‑LLM, and RAG for Large Language Model Solutions

NVIDIA’s comprehensive LLM ecosystem combines the full‑stack NeMo Framework for data curation, distributed training, fine‑tuning, inference acceleration with TensorRT‑LLM and Triton, plus Retrieval‑Augmented Generation and Guardrails, enabling efficient, low‑latency, knowledge‑grounded model deployment across clusters.

AI accelerationNVIDIANeMo Framework

0 likes · 16 min read

NVIDIA NeMo Framework, TensorRT‑LLM, and RAG for Large Language Model Solutions

Alibaba Cloud Big Data AI Platform

Mar 26, 2024 · Artificial Intelligence

MoE LLMs: How Alibaba Cloud & NVIDIA Megatron-Core Accelerate Training

This article reviews the evolution of Mixture-of-Experts (MoE) models, details Alibaba Cloud’s collaboration with NVIDIA’s Megatron-Core to build a high-performance MoE framework, and presents extensive training optimizations, benchmark results, conversion tools, and best-practice guidelines for large-scale LLM development and deployment.

Alibaba CloudMegatron-CoreMoE

0 likes · 18 min read

MoE LLMs: How Alibaba Cloud & NVIDIA Megatron-Core Accelerate Training

NewBeeNLP

Mar 21, 2024 · Artificial Intelligence

Mastering Large Language Model Training: Key Challenges and Optimization Strategies

This article examines the resource and efficiency challenges of scaling large language model training, explains data, model, pipeline, and tensor parallelism, and provides practical I/O, communication, and stability optimization techniques—including high‑availability storage, RDMA networking, NCCL tuning, and fault‑tolerant recovery—to improve throughput and reliability.

AI EngineeringI/O optimizationcommunication optimization

0 likes · 15 min read

Mastering Large Language Model Training: Key Challenges and Optimization Strategies

TAL Education Technology

Mar 20, 2024 · Artificial Intelligence

Understanding AI: From Brain Differences to Data Science Practices and Large Model Applications

This article explains why current AI cannot achieve self‑awareness, outlines data‑science steps for large models—including preprocessing, exploratory analysis, modeling, and evaluation—then surveys general and vertical applications of large language models and details a complete machine‑learning workflow with transformer fine‑tuning techniques.

AIApplicationsMachine Learning

0 likes · 14 min read

Understanding AI: From Brain Differences to Data Science Practices and Large Model Applications