Tagged articles
1067 articles
Page 10 of 11
DataFunTalk
DataFunTalk
Mar 20, 2024 · Artificial Intelligence

Challenges and Optimization Techniques for Large Language Model Training

The article outlines the resource and efficiency challenges of scaling large language models, explains data and model parallelism strategies, and details practical I/O, communication, and stability optimizations—including high‑availability storage, RDMA networking, and fault‑tolerance measures—to improve training throughput and reliability.

AI EngineeringI/O optimizationcommunication optimization
0 likes · 13 min read
Challenges and Optimization Techniques for Large Language Model Training
DataFunTalk
DataFunTalk
Mar 17, 2024 · Artificial Intelligence

Leveraging Large Language Models to Enhance Comprehensive Graph Learning Capabilities

In this talk, researcher Jiang Zhuoren from Zhejiang University reviews the current state of large language models applied to graph learning, discusses their roles across various graph scenarios, and outlines promising research directions for unified cross‑domain graph learning.

Artificial Intelligencecross-domain learninggraph learning
0 likes · 3 min read
Leveraging Large Language Models to Enhance Comprehensive Graph Learning Capabilities
Model Perspective
Model Perspective
Mar 16, 2024 · Artificial Intelligence

What Watching a TV Drama Reveals About AI Model Training and Learning Strategies

The article draws parallels between expert viewers dissecting the drama "The Legend of Zhen Huan," efficient paper‑reading techniques, and the active‑prediction plus contrast‑learning approach that underpins modern AI model training, highlighting how proactive thinking boosts both personal and machine learning outcomes.

AI trainingPredictionactive learning
0 likes · 8 min read
What Watching a TV Drama Reveals About AI Model Training and Learning Strategies
DataFunSummit
DataFunSummit
Mar 14, 2024 · Artificial Intelligence

Multi‑Level Efficiency Challenges and Emerging Paradigms for Large AI Models

The article examines how large AI models are moving toward a unified, low‑knowledge‑density paradigm that raises computational efficiency challenges across model, algorithm, framework, and infrastructure layers, while also highlighting NVIDIA's GTC 2024 China AI Day sessions that showcase practical solutions and upcoming training opportunities.

AI conferencesAI infrastructureNVIDIA GTC
0 likes · 10 min read
Multi‑Level Efficiency Challenges and Emerging Paradigms for Large AI Models
21CTO
21CTO
Mar 12, 2024 · Artificial Intelligence

How Google’s ‘Social Learning’ AI Framework Boosts Privacy‑Safe Model Training

Google’s newly unveiled “Social Learning” AI framework lets large models teach each other via natural language, improving task performance while avoiding direct use of sensitive data, and uses teacher‑student interactions, synthetic data, and instruction generation to enhance privacy‑preserving model training.

AIlarge language modelsprivacy
0 likes · 4 min read
How Google’s ‘Social Learning’ AI Framework Boosts Privacy‑Safe Model Training
DataFunTalk
DataFunTalk
Mar 10, 2024 · Artificial Intelligence

Aligning Graph Models with Large Language Models for Open-Task Scenarios

This talk presents GraphTranslator, a framework that bridges pretrained graph models and large language models to enable unified handling of both predefined and open-ended graph analysis tasks by translating node representations into language tokens and training an alignment producer for node‑text pairs.

AI researchgraph neural networkslarge language models
0 likes · 3 min read
Aligning Graph Models with Large Language Models for Open-Task Scenarios
NewBeeNLP
NewBeeNLP
Mar 10, 2024 · Industry Insights

What WWW'24 Papers Reveal About LLMs in Search & Recommendation

This overview summarizes six WWW 2024 industry papers that apply large language models to e‑commerce search, personalized query suggestion, article recommendation, collaborative filtering, and lifelong sequential behavior understanding, highlighting their methods, experimental results, deployment status, and emerging trends in LLM‑driven search and recommendation.

LLMSearchWWW2024
0 likes · 16 min read
What WWW'24 Papers Reveal About LLMs in Search & Recommendation
DataFunTalk
DataFunTalk
Mar 7, 2024 · Artificial Intelligence

Enhancing Interactive Agents with Large Language Models: The SwiftSage Framework and Benchmark Analysis

This article reviews recent advances in using large language models for interactive embodied agents, introduces the SwiftSage dual‑model framework that combines a fast T5‑based small model with a powerful LLM for planning, evaluates it on benchmarks such as AFL World and ScienceWorld, and discusses efficiency, cost‑effectiveness, limitations, and future research directions.

AISwiftSageinteractive agents
0 likes · 23 min read
Enhancing Interactive Agents with Large Language Models: The SwiftSage Framework and Benchmark Analysis
Model Perspective
Model Perspective
Mar 6, 2024 · Fundamentals

Why Managing a City Is Like Designing a Spaceship: Exploring Complex Systems

An insightful look at how both spacecraft design and city governance exemplify complex systems, distinguishing closed versus open systems, outlining characteristics of complex and mega-complex systems, and linking these concepts to system engineering pioneers like Qian Xuesen and modern large language models.

Qian Xuesenlarge language modelsopen vs closed systems
0 likes · 9 min read
Why Managing a City Is Like Designing a Spaceship: Exploring Complex Systems
DataFunSummit
DataFunSummit
Mar 6, 2024 · Artificial Intelligence

Document Intelligence: Background, Technology, Large Models, and Enterprise Applications

This article presents a comprehensive overview of document intelligence, covering its background, technical evolution, large‑model advancements, and practical enterprise digital transformation use cases, with a focus on multimodal processing, unified document representation, and industry‑specific applications such as legal contract automation.

Document IntelligenceEnterprise Automationlarge language models
0 likes · 14 min read
Document Intelligence: Background, Technology, Large Models, and Enterprise Applications
Efficient Ops
Efficient Ops
Feb 27, 2024 · Artificial Intelligence

Can Large Language Models Truly Elevate Software Engineering? Insights and Roadmap

This article reviews the 2023 surge of large language models in software engineering, evaluates their current code generation, testing, and knowledge‑query capabilities, highlights persistent challenges in design and maintenance, and proposes concrete recommendations for advancing toward higher‑level intelligent development.

code generationdigital twinsgenerative AI
0 likes · 21 min read
Can Large Language Models Truly Elevate Software Engineering? Insights and Roadmap
NewBeeNLP
NewBeeNLP
Feb 17, 2024 · Artificial Intelligence

How Sora Highlights the Next Leap Toward AGI and Shifts AI Competition

The article analyzes OpenAI's Sora video model, arguing that its integration of large‑language‑model reasoning with diffusion techniques marks a major step toward true world understanding, reshapes creative workflows, widens the AI talent gap, and accelerates the path to artificial general intelligence.

AGIAI trendsSora
0 likes · 7 min read
How Sora Highlights the Next Leap Toward AGI and Shifts AI Competition
NewBeeNLP
NewBeeNLP
Feb 11, 2024 · Industry Insights

What 2023 Taught Us About LLMs and AI‑Guided Optimization

The author reviews a year of rapid progress in large language models, highlighting breakthrough papers such as Positional Interpolation, StreamingLLM, Deja Vu, and RLCD, and discusses how AI‑guided optimization techniques like SurCo, LANCER, and GenCo are reshaping research and industry applications.

LLMTransformersai-optimization
0 likes · 13 min read
What 2023 Taught Us About LLMs and AI‑Guided Optimization
DataFunTalk
DataFunTalk
Feb 10, 2024 · Artificial Intelligence

Mitigating Hallucinations in Large Language Model Applications with Knowledge Graphs

This article examines the challenges of using large language models for industry Q&A, defines hallucination phenomena, evaluates their causes and impact, and proposes a set of strategies—including high‑quality fine‑tuning data, honest alignment, advanced decoding, and external knowledge‑graph augmentation—to reduce hallucinations and improve answer reliability.

hallucinationknowledge graphlarge language models
0 likes · 21 min read
Mitigating Hallucinations in Large Language Model Applications with Knowledge Graphs
Cloud Native Technology Community
Cloud Native Technology Community
Feb 8, 2024 · Artificial Intelligence

How Retrieval‑Augmented Generation Boosts LLM Accuracy and Trust

Retrieval‑augmented generation (RAG) enhances large language models by fetching up‑to‑date, authoritative information from external sources, addressing hallucinations, outdated knowledge, and lack of citations, while offering cost‑effective implementation, improved relevance, user trust, and greater developer control through vector databases, semantic search, and prompt engineering.

AIRAGlarge language models
0 likes · 10 min read
How Retrieval‑Augmented Generation Boosts LLM Accuracy and Trust
DataFunSummit
DataFunSummit
Feb 5, 2024 · Artificial Intelligence

Ant Group's Knowledge Graph: Overview, Construction, Applications, and Integration with Large Models

Ant Group shares its comprehensive knowledge graph initiatives, detailing the fundamentals, construction pipeline, fusion techniques, cognitive representations, diverse business applications, and the emerging synergy between knowledge graphs and large language models, illustrating how graph-based AI enhances accuracy, interpretability, and downstream services.

Artificial IntelligenceGraph FusionSemantic Modeling
0 likes · 14 min read
Ant Group's Knowledge Graph: Overview, Construction, Applications, and Integration with Large Models
MaGe Linux Operations
MaGe Linux Operations
Jan 31, 2024 · Artificial Intelligence

Does Gemini Pro Really Outperform GPT‑4? A Deep Comparative Review

This article critically examines Google’s Gemini Pro against OpenAI’s GPT‑4 across reasoning, vision, token limits, benchmark data, and real‑world tasks, revealing where Gemini excels, where it falls short, and what to expect from the upcoming Gemini Ultra.

AI model comparisonGPT-4Gemini Pro
0 likes · 13 min read
Does Gemini Pro Really Outperform GPT‑4? A Deep Comparative Review
DataFunTalk
DataFunTalk
Jan 31, 2024 · Artificial Intelligence

Industry Trends and Challenges of Large Language Models in Enterprise Applications (2023 Review)

The article reviews the rapid development of large language models in enterprise settings, covering internal collaboration tools, AI assistants for development and marketing, multimodal generation, inference speed bottlenecks, resource constraints, and future directions such as open‑source models and academic‑industry cooperation.

AI assistantsAI in marketingInference Performance
0 likes · 8 min read
Industry Trends and Challenges of Large Language Models in Enterprise Applications (2023 Review)
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jan 29, 2024 · Artificial Intelligence

Unlocking Sparse MoE Large Model Training with Megatron-Core on Alibaba Cloud

This article explains how Alibaba Cloud's PAI platform and NVIDIA's Megatron-Core enable efficient training of sparse Mixture-of-Experts (MoE) large language models, covering algorithm basics, the Megatron-Core MoE framework, weight conversion pipelines, and performance results on Mixtral‑8x7B.

Megatron-CoreMixture of ExpertsModel Parallelism
0 likes · 18 min read
Unlocking Sparse MoE Large Model Training with Megatron-Core on Alibaba Cloud
ZhongAn Tech Team
ZhongAn Tech Team
Jan 22, 2024 · Artificial Intelligence

Weekly Tech Overview: Major Industry Updates and AI Insights

This weekly tech overview summarizes major industry developments, including Huawei's HarmonyOS NEXT release, SenseTime's open‑source large language model InternLM2, the Apple‑Epic App Store dispute resolution, Xiaomi's 5G satellite terminal approval, Microsoft overtaking Apple in market value, and recent AI energy consumption concerns.

AIHarmonyOSIndustry Updates
0 likes · 10 min read
Weekly Tech Overview: Major Industry Updates and AI Insights
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Jan 20, 2024 · Artificial Intelligence

Decoding Xiaohongshu’s Recommendation System: How Ordinary Users Gain Visibility

Xiaohongshu’s recommendation system uses large‑scale multimodal embeddings, dual‑tower and graph models, and diversity techniques like DPP and SSD to quickly surface high‑quality user‑generated content, enabling ordinary users to gain visibility while balancing personalization, exploration, and efficient LLM‑augmented pipelines.

Xiaohongshucold startcontent diversity
0 likes · 15 min read
Decoding Xiaohongshu’s Recommendation System: How Ordinary Users Gain Visibility
Cognitive Technology Team
Cognitive Technology Team
Jan 17, 2024 · Artificial Intelligence

Redis Founder antirez Reflects on Large Language Models in 2024

In his first 2024 blog post, Redis founder antirez shares a programmer's perspective on large language models, sharply critiques Google's search engine, evaluates current AIGC as both foolish and historically knowledgeable, and argues that generative AI mainly amplifies the abilities of already strong developers.

AI CommentaryRedislarge language models
0 likes · 2 min read
Redis Founder antirez Reflects on Large Language Models in 2024
21CTO
21CTO
Jan 14, 2024 · Artificial Intelligence

Can Large Language Models Really Boost Programming Productivity? Insights from Redis Founder

The article reflects on the Redis founder's 2024 blog about large language models, examining their strengths and limits in software development, illustrating how they can accelerate coding for experienced programmers while highlighting challenges in system programming and the need for careful prompt engineering.

AI programminglarge language modelsproductivity
0 likes · 19 min read
Can Large Language Models Really Boost Programming Productivity? Insights from Redis Founder
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jan 3, 2024 · Artificial Intelligence

Llama 2: Open Foundation and Fine‑Tuned Chat Models – Ghost Attention, RLHF Results, and Safety Evaluation

This article summarizes the Llama 2 series, describing the Ghost Attention technique for maintaining system‑message consistency across multi‑turn dialogs, presenting RLHF and human evaluation results, and discussing extensive safety pre‑training, benchmark assessments, and model release details.

AI evaluationGhost AttentionLlama-2
0 likes · 20 min read
Llama 2: Open Foundation and Fine‑Tuned Chat Models – Ghost Attention, RLHF Results, and Safety Evaluation
OPPO Kernel Craftsman
OPPO Kernel Craftsman
Dec 29, 2023 · Information Security

OPPO Releases White Paper on Mobile Application Trustworthy Technology at CAICT ICT+ Deep Observation Conference

At the CAICT ICT+ Deep Observation Conference, OPPO unveiled a white paper on mobile application trustworthy technology, analyzing lifecycle security risks, policy and patent developments, and the role of large‑model AI in intelligent terminals, while urging standardized security practices and accelerated AI‑driven vulnerability detection tools.

CAICTIntelligent TerminalsMobile Application Security
0 likes · 4 min read
OPPO Releases White Paper on Mobile Application Trustworthy Technology at CAICT ICT+ Deep Observation Conference
OPPO Amber Lab
OPPO Amber Lab
Dec 29, 2023 · Information Security

Large Models Transform Mobile App Security – Key Takeaways from OPPO’s White Paper

The 2024 China Academy of ICT deep‑observation summit in Shanghai unveiled OPPO’s new white paper on trustworthy mobile application technology, highlighting how large language models enhance smart terminal security, outlining industry trends, and outlining future directions for secure, intelligent mobile ecosystems.

OPPOSoftware Securitylarge language models
0 likes · 6 min read
Large Models Transform Mobile App Security – Key Takeaways from OPPO’s White Paper
DataFunTalk
DataFunTalk
Dec 25, 2023 · Artificial Intelligence

Tool Learning with Foundation Models: Frameworks, Datasets, and Open‑Source Toolkits

This article reviews the emerging field of tool learning for large foundation models, outlining its background, categorization, core framework components, training strategies, and applications such as WebCPM, BMTools, and ToolBench, while highlighting recent research results and open‑source resources.

AI toolsfoundation modelslarge language models
0 likes · 21 min read
Tool Learning with Foundation Models: Frameworks, Datasets, and Open‑Source Toolkits
Java High-Performance Architecture
Java High-Performance Architecture
Dec 22, 2023 · Artificial Intelligence

Is Google Gemini Echoing Baidu? A Deep Dive into Model Contamination

The article investigates recent tests showing that Google Gemini sometimes claims to be Baidu's AI, reproduces Baidu‑related responses, and appears to have its Chinese and English corpora contaminated with competitor data, highlighting the challenges of data provenance in large language models.

AI model contaminationAI testingBaidu Wenxin
0 likes · 6 min read
Is Google Gemini Echoing Baidu? A Deep Dive into Model Contamination
DataFunTalk
DataFunTalk
Dec 21, 2023 · Artificial Intelligence

Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning – Best Long Paper at EMNLP 2023

At EMNLP 2023, the joint WeChat AI and Peking University paper 'Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning' won the Best Long Paper award, revealing that label tokens act as anchors driving information aggregation in shallow layers and prediction flow in deep layers, and proposing methods to improve and diagnose in‑context learning.

AI researchIn-Context LearningInformation Flow
0 likes · 13 min read
Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning – Best Long Paper at EMNLP 2023
DataFunTalk
DataFunTalk
Dec 19, 2023 · Artificial Intelligence

Enterprise Large‑Model Deployment and Data Governance: Insights from Deepexi’s President

The article examines how enterprises can adopt domain‑specific large models by balancing demand‑side cost‑reduction needs with supply‑side mature training techniques, discusses team composition, fine‑tuning methods, data governance for unstructured data, and outlines Deepexi’s product ecosystem designed to improve efficiency, performance, and user experience.

AI deploymentcost economicsenterprise AI
0 likes · 13 min read
Enterprise Large‑Model Deployment and Data Governance: Insights from Deepexi’s President
21CTO
21CTO
Dec 17, 2023 · Artificial Intelligence

Why AI‑Native Apps Matter: Insights from Baidu, ByteDance Ban, and New PHP Server

The article examines Baidu CEO Li Yanhong’s call to focus on AI‑native applications, reports ByteDance’s suspension by OpenAI for misusing GPT, outlines Google’s phased removal of third‑party cookies, and announces the release of the Go‑based PHP server FrankenPHP 1.0.

AI-native applicationsPHP serverlarge language models
0 likes · 7 min read
Why AI‑Native Apps Matter: Insights from Baidu, ByteDance Ban, and New PHP Server
DataFunSummit
DataFunSummit
Dec 14, 2023 · Artificial Intelligence

Enterprise Large‑Model Deployment: Data Governance, Fine‑Tuning Strategies, and Cost Economics

The article examines how enterprises can adopt domain‑specific large language models by addressing data governance, model fine‑tuning techniques, dataset balance, and product architecture to achieve cost‑effective, high‑performance AI solutions across various business scenarios.

Model Fine‑tuningcost efficiencylarge language models
0 likes · 14 min read
Enterprise Large‑Model Deployment: Data Governance, Fine‑Tuning Strategies, and Cost Economics
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Dec 14, 2023 · Artificial Intelligence

Unlocking LLaMA: Key Innovations, Architecture Insights, and MindSpore Inference Guide

This article reviews the LLaMA large‑language‑model series, covering its background, architectural innovations such as Add&Norm, SwiGLU, and RoPE, a known reversal‑curse bug, and provides step‑by‑step MindSpore Transformers code for model configuration, inference, and pipeline usage while previewing the upcoming LLaMA‑2 session.

AILLaMAMindSpore
0 likes · 6 min read
Unlocking LLaMA: Key Innovations, Architecture Insights, and MindSpore Inference Guide
DataFunTalk
DataFunTalk
Dec 12, 2023 · Artificial Intelligence

Challenges and Considerations of Recommendation Systems: Evaluation, Data Leakage, and the Role of Large Models

This article examines recommendation system problem definitions, differences between academia and industry, offline evaluation pitfalls and data leakage issues, data construction challenges with datasets like MovieLens, and evaluates whether large language models can serve as effective solutions for modern recommendation tasks.

Machine LearningRecommendation Systemsdata leakage
0 likes · 20 min read
Challenges and Considerations of Recommendation Systems: Evaluation, Data Leakage, and the Role of Large Models
21CTO
21CTO
Dec 7, 2023 · Artificial Intelligence

Google Gemini vs GPT‑4: Can the New AI Model Outperform ChatGPT?

Google's Gemini AI suite, unveiled in December, brings three model sizes—Nano, Pro, and Ultra—to power Bard and other services, claims superior performance over GPT‑4 across most benchmarks, and introduces multimodal capabilities that signal a major shift in the AI landscape.

AI language modelGPT-4 comparisonGoogle Gemini
0 likes · 6 min read
Google Gemini vs GPT‑4: Can the New AI Model Outperform ChatGPT?
JD Tech
JD Tech
Nov 30, 2023 · Artificial Intelligence

Understanding ChatGPT: Mechanisms, Attention, Emergence, and the Chinese Room

This article examines the principles behind ChatGPT, detailing its continuation-based operation, the role of attention mechanisms and transformer architecture, the scaling of neural networks that leads to emergent abilities, and interprets these phenomena through the lenses of compression theory and the Chinese Room thought experiment.

ChatGPTattention mechanismcompression
0 likes · 27 min read
Understanding ChatGPT: Mechanisms, Attention, Emergence, and the Chinese Room
AntTech
AntTech
Nov 24, 2023 · Artificial Intelligence

Code Model Evaluation Framework and the CodeFuseEval Benchmark Overview

This article presents a comprehensive overview of code large‑model evaluation, describing the need for multi‑dimensional benchmarks, the CodeFuseEval benchmark suite, dataset construction, evaluation methods, framework architecture, result visualisation, and future directions for enterprise‑grade code generation models.

AICodeFuseEvalbenchmark
0 likes · 12 min read
Code Model Evaluation Framework and the CodeFuseEval Benchmark Overview
Ant R&D Efficiency
Ant R&D Efficiency
Nov 24, 2023 · Artificial Intelligence

CodeFuseEval: An Enterprise‑Level Multi‑Task Benchmark for Evaluating Code Large Models

CodeFuseEval is an enterprise‑grade, multi‑task benchmark that evaluates code‑generation large models across six languages and thousands of real‑world tasks using both objective metrics (pass@k, BLEU, CodeBLEU) and expert human review, with an open‑source framework, continuous dataset expansion, and a focus on correctness, efficiency, robustness, and service‑level quality.

AIbenchmarkcode generation
0 likes · 12 min read
CodeFuseEval: An Enterprise‑Level Multi‑Task Benchmark for Evaluating Code Large Models
DataFunTalk
DataFunTalk
Nov 21, 2023 · Artificial Intelligence

Improving Efficiency of Large-Scale Distributed Training for Large Language Models

Recent advances in large language models have dramatically increased model size and training data, leading to soaring computational costs; this article examines the scaling trends, hardware utilization challenges, distributed training techniques, and ethical considerations, highlighting methods to improve efficiency, reduce costs, and mitigate environmental impact.

AI ethicsEfficiencycompute optimization
0 likes · 29 min read
Improving Efficiency of Large-Scale Distributed Training for Large Language Models
Baobao Algorithm Notes
Baobao Algorithm Notes
Nov 21, 2023 · Artificial Intelligence

How Much Data Do You Need for a 10B LLM? Decoding Scaling Laws

This article explains how scaling laws can answer common LLM development questions—such as the data required for a 10B model, the model size achievable with 1 TB of data, and the optimal compute‑data‑model trade‑off for a fixed GPU budget—by presenting core formulas, practical derivations, and insights from OpenAI, DeepMind and Google.

Compute EfficiencyData RequirementsModel Size
0 likes · 12 min read
How Much Data Do You Need for a 10B LLM? Decoding Scaling Laws
360 Smart Cloud
360 Smart Cloud
Nov 20, 2023 · Artificial Intelligence

Overview of Recent Open‑Source AI Models and Tools (November 2023)

This article summarizes a collection of newly released open‑source AI projects covering natural‑language processing, multimodal processing, intelligent agents, recommendation systems, and model training acceleration, providing brief descriptions, key capabilities, and links to their repositories.

AIMultimodalRecommendation Systems
0 likes · 9 min read
Overview of Recent Open‑Source AI Models and Tools (November 2023)
Ximalaya Technology Team
Ximalaya Technology Team
Nov 16, 2023 · Artificial Intelligence

How AI Agents Turn One-Line Prompts Into Fully Functional Apps in Minutes

ChatDev, an AI‑driven software development platform, claims to create complete applications from a single prompt in about three minutes and at a cost of roughly two yuan, leveraging a multi‑agent workflow, a custom 100‑billion‑parameter model, and open‑source frameworks to dramatically cut development time and expense.

AI AgentsChatDevindustry analysis
0 likes · 13 min read
How AI Agents Turn One-Line Prompts Into Fully Functional Apps in Minutes
Architect
Architect
Nov 8, 2023 · Artificial Intelligence

AI Agents Unleashed: From Assistants API to Multi‑Agent Frameworks

The article dissects the rise of AI agents—from OpenAI's Assistants API and multimodal perception‑brain‑action pipelines to retrieval‑augmented generation, tool‑use strategies, single‑ and multi‑agent deployments, and emerging frameworks like AutoGen—while highlighting concrete examples, benchmark results, and current limitations.

AI AgentsAssistants APIEmbodied AI
0 likes · 38 min read
AI Agents Unleashed: From Assistants API to Multi‑Agent Frameworks
Tencent Cloud Developer
Tencent Cloud Developer
Nov 8, 2023 · Artificial Intelligence

Comprehensive Overview of AI Agents: Concepts, Technical Frameworks, and Applications

The article surveys modern AI agents—software entities powered by large language models that perceive multimodal inputs, reason via brain modules, act through tools or embodied actions, employ retrieval‑augmented generation and chain‑of‑thought planning, and can operate singly (e.g., AutoGPT) or collaboratively via frameworks like Microsoft’s AutoGen—while highlighting current challenges such as controllability, memory limits, parallelism, and reliability.

AI AgentsAgent ArchitectureAutoGen
0 likes · 34 min read
Comprehensive Overview of AI Agents: Concepts, Technical Frameworks, and Applications
DataFunSummit
DataFunSummit
Nov 5, 2023 · Artificial Intelligence

Enhancing Recommendation Models with Scaling Law via HCNet and MemoNet: A Memory‑Based Feature‑Combination Approach

This article presents a memory‑driven architecture (HCNet and MemoNet) that equips recommendation models with scaling‑law characteristics by storing and retrieving arbitrary feature‑combination embeddings, evaluates multi‑hash codebooks, memory‑restoring strategies, key‑feature selection, and demonstrates significant offline and online performance gains.

Scaling Lawfeature interactionlarge language models
0 likes · 15 min read
Enhancing Recommendation Models with Scaling Law via HCNet and MemoNet: A Memory‑Based Feature‑Combination Approach
Model Perspective
Model Perspective
Nov 2, 2023 · Artificial Intelligence

Why Mathematical Modelers Must Embrace LLMs and Forget Outdated Skills

The article explains how rapid advances in data and large language models force mathematical modelers to continuously update their models and skills, discard obsolete knowledge, and adopt lifelong learning to stay effective in a fast‑changing AI‑driven environment.

Artificial Intelligencecontinuous learningdata-science
0 likes · 6 min read
Why Mathematical Modelers Must Embrace LLMs and Forget Outdated Skills
Baidu Geek Talk
Baidu Geek Talk
Nov 2, 2023 · Artificial Intelligence

AI-Powered Code Defect Detection: Leveraging Code Knowledge Graphs and Large Language Models

The paper presents an AI‑driven static analysis framework that builds code knowledge graphs to extract relevant slices and leverages large language models for multilingual defect prediction, achieving up to 80% F1, detecting 662 defects across 1,100 C++ modules with a 26.9% recall gain over traditional rule‑based scanners.

BERTSoftware qualitycode defect detection
0 likes · 9 min read
AI-Powered Code Defect Detection: Leveraging Code Knowledge Graphs and Large Language Models
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Nov 1, 2023 · Databases

How BES Powers Large-Scale Vector Search for AI Applications

This article explains the principles of vector databases, outlines the engineering practices of Baidu Intelligent Cloud BES for large‑scale vector retrieval, discusses optimization techniques such as HNSW, IVF and filter integration, and presents real‑world AI use cases and future development directions.

AIBESElasticsearch
0 likes · 16 min read
How BES Powers Large-Scale Vector Search for AI Applications
DataFunSummit
DataFunSummit
Oct 30, 2023 · Artificial Intelligence

Exploring General AI, Large Language Models, Knowledge Graphs, and Reinforcement Learning – Insights from DataFun

This article presents a comprehensive overview of DaGuan Data's explorations in general artificial intelligence, large language models, knowledge graphs, reinforcement learning, compute and data requirements, and the emerging concept of Human‑Centric AGI, supplemented by a detailed Q&A session.

AGIArtificial IntelligenceKnowledge Graphs
0 likes · 18 min read
Exploring General AI, Large Language Models, Knowledge Graphs, and Reinforcement Learning – Insights from DataFun
DataFunSummit
DataFunSummit
Oct 27, 2023 · Artificial Intelligence

ChatGPT Technology, Domesticization Attempts, and Open‑Source Large Models

This article reviews the evolution and challenges of ChatGPT technology, describes the authors' efforts to localize and commercialize the model for the Chinese market, and introduces their open‑source Chinese large‑model initiative, including training methods, performance gaps, and future improvement directions.

ChatGPTChinese NLPModel Localization
0 likes · 11 min read
ChatGPT Technology, Domesticization Attempts, and Open‑Source Large Models
Baidu Tech Salon
Baidu Tech Salon
Oct 25, 2023 · Artificial Intelligence

Intelligent Question Answering Technology in Baidu Search: Development, Modeling, and Retrieval‑Enhanced Generation

The article surveys Baidu Search’s intelligent question‑answering system, tracing its evolution from feature‑engineered retrieval to large pre‑trained and generative models, and detailing hierarchical readers, multi‑teacher distillation, retrieval‑enhanced generation, and instruction decomposition as key techniques for delivering fast, accurate, citation‑rich answers.

Baidu SearchKnowledge DistillationRetrieval-Augmented Generation
0 likes · 18 min read
Intelligent Question Answering Technology in Baidu Search: Development, Modeling, and Retrieval‑Enhanced Generation
Baidu Geek Talk
Baidu Geek Talk
Oct 25, 2023 · Artificial Intelligence

How Baidu Search Is Transforming Machine Question Answering with Large‑Scale AI Models

This article reviews the evolution of machine question answering, from early feature‑engineered systems to modern large‑language‑model‑driven retrieval‑augmented generation, outlines Baidu Search’s current Retriever‑Reader architecture, discusses challenges such as semantic complexity, latency and answer quality, and presents solutions including hierarchical DocMRC modeling, multi‑teacher knowledge distillation, and instruction decomposition for efficient, high‑quality answers.

BaiduKnowledge DistillationRetrieval-Augmented Generation
0 likes · 18 min read
How Baidu Search Is Transforming Machine Question Answering with Large‑Scale AI Models
DataFunTalk
DataFunTalk
Oct 25, 2023 · Artificial Intelligence

Applying Large Language Models to Wireless Network Intelligent Operations: Opportunities, Challenges, and Platform Construction

This article examines how large language model technology can be leveraged for intelligent operation of wireless communication networks, analyzing its advantages, current challenges, platform architecture, experimental validation, and future research directions within the telecom industry.

AIintelligent operationknowledge graph
0 likes · 17 min read
Applying Large Language Models to Wireless Network Intelligent Operations: Opportunities, Challenges, and Platform Construction
Zuoyebang Tech Team
Zuoyebang Tech Team
Oct 19, 2023 · Artificial Intelligence

How AI and Big Data Are Transforming Education: Insights from Zuoyebang’s Chief Scientist

At the GET2023 Education Technology Conference, Zuoyebang’s chief scientist Song Yang detailed how AI, large language models, big data, and smart hardware are reshaping learning experiences across subjects, from math problem generation to interactive programming assistants, and outlined the company’s vision for AI‑driven education.

AI in EducationEducational Technologylarge language models
0 likes · 12 min read
How AI and Big Data Are Transforming Education: Insights from Zuoyebang’s Chief Scientist
Alimama Tech
Alimama Tech
Oct 18, 2023 · Artificial Intelligence

Technical Challenges and Directions for Large‑Model Applications in E‑commerce

Taobao Group’s ten large‑model challenges target e‑commerce AI by demanding domain‑specific pre‑training, multi‑step reasoning, extended context handling, factual reliability, intelligent tool orchestration, robust retrieval integration, fuzzy‑intent tool selection, scalable multi‑objective RLHF, improved query rewriting, and knowledge‑driven recommendation.

RLHFe-commerceknowledge hallucination
0 likes · 16 min read
Technical Challenges and Directions for Large‑Model Applications in E‑commerce
DaTaobao Tech
DaTaobao Tech
Oct 18, 2023 · Artificial Intelligence

Large Model Application Challenges for E-commerce

Taobao Group’s ten large‑model e‑commerce challenges call for researchers to build domain‑specific data pipelines, mitigate forgetting, balance expertise with generality, enable multi‑step reasoning, handle long contexts, reduce hallucinations, integrate tool use, improve fuzzy intent detection, apply multi‑objective RLHF, and generate cognitively novel recommendations.

RLHFknowledge hallucinationlarge language models
0 likes · 14 min read
Large Model Application Challenges for E-commerce
Baidu Geek Talk
Baidu Geek Talk
Oct 16, 2023 · Industry Insights

What Is AI‑Native Thinking and Why It Will Shape the Next Wave of Applications

The article explores the concept of AI‑native thinking, outlines the mindset and conditions needed for AI‑native applications, showcases examples such as Baidu Wenku and a legal‑assistant hackathon project, and discusses platform support, technical foundations, and emerging opportunities in the large‑model era.

AI-nativeBaiduIndustry Insight
0 likes · 14 min read
What Is AI‑Native Thinking and Why It Will Shape the Next Wave of Applications
Baidu Geek Talk
Baidu Geek Talk
Oct 11, 2023 · Artificial Intelligence

How Baidu’s Qianfan 2.0 Supercharges Large‑Model Development and Deployment

The article reviews Baidu Cloud’s Qianfan 2.0 platform, detailing its expanded model catalog, dataset library, Chinese‑language enhancements, compression and speed gains, robust AI infrastructure, application templates, and end‑to‑end data‑labeling pipeline that together lower cost and accelerate large‑model adoption across industries.

AI PlatformCloud AIModel Deployment
0 likes · 14 min read
How Baidu’s Qianfan 2.0 Supercharges Large‑Model Development and Deployment
JD Cloud Developers
JD Cloud Developers
Oct 10, 2023 · Artificial Intelligence

Do Large Language Models Have a Mind? Attention, Emergence & Compression Explained

This article examines whether ChatGPT and other large language models exhibit true Theory of Mind, detailing the role of attention mechanisms, neural network architecture, emergent abilities, the Chinese‑room argument, and how compression of massive textual data underlies their apparent intelligence.

Theory of Mindattention mechanismcompression
0 likes · 30 min read
Do Large Language Models Have a Mind? Attention, Emergence & Compression Explained
Baobao Algorithm Notes
Baobao Algorithm Notes
Oct 9, 2023 · Artificial Intelligence

Demystifying RLHF and PPO for Large Language Models: Theory and Practice

This article explains why Reinforcement Learning from Human Feedback (RLHF) is crucial for LLM intelligence, outlines the three-stage training pipeline, details InstructGPT's reward model and PPO optimization, and provides a practical guide to implementing RLHF with deep‑learning frameworks.

Artificial IntelligencePPORLHF
0 likes · 17 min read
Demystifying RLHF and PPO for Large Language Models: Theory and Practice
DataFunSummit
DataFunSummit
Sep 30, 2023 · Artificial Intelligence

Causal Inference from the Perspective of Large Models

This presentation by senior AI architect He Gang explores how large language models and LLM‑powered agents can enhance causal inference tasks, detailing model‑assisted analysis, agent‑based inference methods, and multi‑agent simulations to advance causal research.

AILLM agentslarge language models
0 likes · 2 min read
Causal Inference from the Perspective of Large Models
NetEase LeiHuo Testing Center
NetEase LeiHuo Testing Center
Sep 22, 2023 · Artificial Intelligence

Understanding Large Language Models and Prompt Engineering: A Practical Guide

This article provides an introductory overview of large language models (LLMs), compares popular models, explains their underlying principles, and offers practical guidance on prompt engineering, model evaluation, usage tips, and safety considerations, helping readers effectively select and apply LLMs in various scenarios.

AILLMlarge language models
0 likes · 44 min read
Understanding Large Language Models and Prompt Engineering: A Practical Guide
Tencent Tech
Tencent Tech
Sep 20, 2023 · Artificial Intelligence

Why Do Large Language Models Hallucinate and How to Reduce It?

The article explains why large language models generate hallucinations—due to data errors, training conflicts, and inference uncertainty—and outlines data‑cleaning, model‑level feedback, knowledge augmentation, constraint techniques, and post‑processing methods such as the “Truth‑seeking” algorithm to mitigate the issue.

AI safetyKnowledge retrievaldata quality
0 likes · 8 min read
Why Do Large Language Models Hallucinate and How to Reduce It?
DataFunSummit
DataFunSummit
Sep 19, 2023 · Artificial Intelligence

Advances in Information Extraction: From PLM to LLM Paradigms at Alibaba DAMO Academy

This article reviews Alibaba DAMO Academy's research on information extraction, covering background concepts, PLM-era extraction paradigms, few‑shot extraction techniques, and the emerging LLM‑era approaches, while also sharing practical insights, benchmark results, and future directions.

Alibaba DAMORetrieval-Augmented Generationfew-shot learning
0 likes · 24 min read
Advances in Information Extraction: From PLM to LLM Paradigms at Alibaba DAMO Academy
Ximalaya Technology Team
Ximalaya Technology Team
Sep 18, 2023 · Artificial Intelligence

Understanding Autonomous and Autopilot AI Agents: Insights from Industry Experts

The article surveys the rise of LLM‑powered AI agents, defining them as LLM + memory + planning + tool use, contrasting fully autonomous agents with human‑guided autopilot/copilot variants, outlining their benefits, risks such as hallucinations and unsafe actions, and urging modular frameworks and oversight for reliable enterprise deployment.

AI AgentsAgent FrameworkArtificial Intelligence
0 likes · 27 min read
Understanding Autonomous and Autopilot AI Agents: Insights from Industry Experts
AntTech
AntTech
Sep 12, 2023 · Artificial Intelligence

Ensuring Trustworthy and Secure AI: Insights from the 2023 Pujiang Innovation Forum

The 2023 Pujiang Innovation Forum highlighted the rapid rise of generative AI, its associated security and privacy risks, and presented Ant Group's multi‑stage, multi‑layered approach—including data, training, and inference controls and three core defense technologies—to achieve safe, reliable, and open knowledge sharing in the era of large language models.

Information Securityknowledge sharinglarge language models
0 likes · 10 min read
Ensuring Trustworthy and Secure AI: Insights from the 2023 Pujiang Innovation Forum
DaTaobao Tech
DaTaobao Tech
Sep 11, 2023 · Artificial Intelligence

Large Language Model Upgrade Paths and Architecture Selection

This article analyzes upgrade paths of major LLMs—ChatGLM, LLaMA, Baichuan—detailing performance, context length, and architectural changes, then examines essential capabilities, data cleaning, tokenizer and attention design, and offers practical guidance for balanced scaling and efficient model construction.

BaichuanChatGLMLLM architecture
0 likes · 32 min read
Large Language Model Upgrade Paths and Architecture Selection
DataFunSummit
DataFunSummit
Sep 9, 2023 · Artificial Intelligence

Evolution of AIGC Technology and Its Applications in Life Sciences

This article reviews the development of AIGC and generative AI technologies—including image, text, and molecular generation—explains key model advances such as diffusion and large language models, discusses their impact on drug discovery, and outlines current challenges, opportunities, and future directions.

AI in Life SciencesAIGCdrug discovery
0 likes · 14 min read
Evolution of AIGC Technology and Its Applications in Life Sciences
DataFunTalk
DataFunTalk
Sep 8, 2023 · Artificial Intelligence

Knowledge Processing in the Era of Large Models: New Opportunities and New Challenges

This article examines how large language models and knowledge graphs complement each other, discussing their respective strengths, integration techniques such as prompt engineering and knowledge editing, and outlining future research directions for building large knowledge models that combine linguistic understanding with structured knowledge representation.

AIKnowledge Graphsknowledge representation
0 likes · 27 min read
Knowledge Processing in the Era of Large Models: New Opportunities and New Challenges
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 28, 2023 · Artificial Intelligence

AI-Driven Application Engineering: From Prompt Engineering to Autonomous Agents

This article examines how the rapid rise of generative AI reshapes application engineering by outlining AI's core characteristics, the challenges developers face, the evolution of prompt and chain-of-thought techniques, the emergence of agents and tool integration, and the future direction toward AI‑centric computing architectures.

AIagentsapplication architecture
0 likes · 20 min read
AI-Driven Application Engineering: From Prompt Engineering to Autonomous Agents
FunTester
FunTester
Aug 22, 2023 · Artificial Intelligence

The Current State and Future Outlook of AI‑Driven Software Testing

The article examines how large‑language models, test‑case generation technologies, and model‑driven testing are reshaping software testing, discusses the challenges of applying AI to testing, and outlines future directions and skill sets for professionals seeking to leverage AI in quality assurance.

AIKnowledge Graphslarge language models
0 likes · 14 min read
The Current State and Future Outlook of AI‑Driven Software Testing
DataFunTalk
DataFunTalk
Aug 21, 2023 · Artificial Intelligence

Can We Build Large-Scale Models for Recommendation Systems?

In this talk, Zhang Pengtao, a Sina Weibo technical expert with a Ph.D. in computer applications, explores how the strong memory capabilities of NLP large language models inspire the design of independent memory mechanisms for recommendation systems, covering model concepts, HCNet & MemoNet, experimental results, and practical takeaways for enhancing recommendation model performance.

AIMemory MechanismsRecommendation Systems
0 likes · 2 min read
Can We Build Large-Scale Models for Recommendation Systems?
DataFunTalk
DataFunTalk
Aug 19, 2023 · Artificial Intelligence

Applying Large Language Models to Zhihu's Bridge Platform: Use Cases, Challenges, and Solutions

This article details how Zhihu's internal Bridge platform integrates large language models for business analysis, knowledge taxonomy, natural‑language‑to‑filter conversion, and ad‑hoc data queries, describing the workflow, technical hurdles, iterative improvements, and future directions.

AI for business analyticsknowledge taxonomylarge language models
0 likes · 12 min read
Applying Large Language Models to Zhihu's Bridge Platform: Use Cases, Challenges, and Solutions
DataFunTalk
DataFunTalk
Aug 16, 2023 · Artificial Intelligence

Data Engineering, Automated Evaluation, and Knowledge Graph Integration in Large Model Development

This article presents a comprehensive overview of data engineering practices, pre‑training data composition, automated model evaluation techniques, and the synergistic use of knowledge graphs within large‑scale AI model research, highlighting pipelines, quality criteria, and practical case studies.

automation evaluationdata engineeringknowledge graph
0 likes · 29 min read
Data Engineering, Automated Evaluation, and Knowledge Graph Integration in Large Model Development
Bilibili Tech
Bilibili Tech
Aug 15, 2023 · Backend Development

Bilibili Customer Service System Architecture and Implementation

The article explains Bilibili's self‑developed customer‑service platform, describing its modular architecture, core workflows, and implementation of features such as intelligent QA with Faiss vector search, Redis‑based seat scheduling, a robust workstation, permission control, and exploration of large language models, highlighting improvements in interception rate, satisfaction, and handling time.

Backend DevelopmentCustomer Service SystemFaiss vector search
0 likes · 20 min read
Bilibili Customer Service System Architecture and Implementation
DataFunSummit
DataFunSummit
Aug 14, 2023 · Artificial Intelligence

State of GPT: A Programmer’s Guide to Large Language Model Fundamentals, Training, and Applications

This article provides programmers with a comprehensive overview of large language models—including their evolution, core concepts, data pipelines, model architectures, training techniques such as 3D parallelism, supervised fine‑tuning, RLHF, open‑source recipes, and emerging application ecosystems—while also highlighting current challenges and future directions.

Fine‑tuningLLM applicationsRLHF
0 likes · 43 min read
State of GPT: A Programmer’s Guide to Large Language Model Fundamentals, Training, and Applications
php Courses
php Courses
Aug 14, 2023 · Artificial Intelligence

Guide to the Five Most Powerful Large Language Models and How to Choose Them

This article explains the fundamentals of modern large language models, outlines the top five most powerful LLMs—including GPT‑4, Claude 2, Llama 2, Orca, and Cohere—and provides practical guidance on selecting and applying them across business and development use cases.

AI applicationsClaude 2GPT-4
0 likes · 9 min read
Guide to the Five Most Powerful Large Language Models and How to Choose Them
DataFunTalk
DataFunTalk
Aug 13, 2023 · Artificial Intelligence

Applying Large Language Models to Search Advertising Satisfaction: From DNN to ERNIE and Prompt Learning

The article details how Baidu's Fengchao team leverages large language models, including a transition from DNN embeddings to ERNIE, introduces multi‑level tokenization and discrete core‑word inputs, and applies prompt learning and AIGC techniques to improve search advertising satisfaction and industry‑specific relevance modeling.

AIGCBaidularge language models
0 likes · 22 min read
Applying Large Language Models to Search Advertising Satisfaction: From DNN to ERNIE and Prompt Learning
DataFunTalk
DataFunTalk
Aug 9, 2023 · Artificial Intelligence

Key Technologies for Domain‑Specific Large Models: Insights from the World AI Conference

This report, based on Professor Xiao Yanghua’s presentation at the World AI Conference, examines why vertical domains need general large models, outlines their key capabilities such as open‑world understanding, combinatorial innovation, evaluation, complex instruction execution, task planning, and symbolic reasoning, and discusses current limitations and optimization strategies for domain‑specific deployment.

AI evaluationVertical AIlarge language models
0 likes · 17 min read
Key Technologies for Domain‑Specific Large Models: Insights from the World AI Conference
Efficient Ops
Efficient Ops
Aug 8, 2023 · Artificial Intelligence

Rethinking Software Development in the Age of Large Language Models

The article examines fundamental challenges of applying large language models to software engineering—such as scale limits, lack of abstract reasoning, hidden tacit knowledge, and maintenance difficulties—and proposes practical recommendations for integrating AI with disciplined development practices.

AI integrationdevelopment automationknowledge management
0 likes · 7 min read
Rethinking Software Development in the Age of Large Language Models
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Aug 8, 2023 · Artificial Intelligence

Unlocking LMOps: How Enterprises Can Master Large Model Operations

This article explains the evolution from traditional machine learning to the current large‑model era, introduces LMOps concepts and key technologies, compares them with MLOps, and showcases Baidu Cloud's Qianfan platform as a practical solution for building, deploying, and managing large language models in industry.

AI OperationsBaidu CloudLMOps
0 likes · 22 min read
Unlocking LMOps: How Enterprises Can Master Large Model Operations
DataFunTalk
DataFunTalk
Jul 27, 2023 · Artificial Intelligence

Applying AIGC in E‑commerce: Product Copy and Image Generation with Large Language Models

This article shares recent AIGC practices in e‑commerce, detailing product copy generation using GPT‑based models, image creation with Stable Diffusion, the evolution of large language models, technical solutions, experimental results, and future opportunities for AI‑driven automation in online retail.

AIGCe-commerceimage generation
0 likes · 18 min read
Applying AIGC in E‑commerce: Product Copy and Image Generation with Large Language Models
Baidu Geek Talk
Baidu Geek Talk
Jul 26, 2023 · Artificial Intelligence

Insights on AIGC Development and Commercial Applications by Baidu's Chief Architect

Baidu’s chief architect Li Shuanglong outlined how AIGC, driven by advanced large‑language and multimodal models, is already powering commercial tools such as automated copywriting, 2D digital‑human video creation and lead‑generation chatbots, while emphasizing future progress in engineering scalability, algorithmic fidelity, data quality, and scenario‑focused applications.

AI commercializationAI researchAIGC
0 likes · 8 min read
Insights on AIGC Development and Commercial Applications by Baidu's Chief Architect
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jul 24, 2023 · Artificial Intelligence

Comprehensive Survey of Large Language Models: History, Key Technologies, Resources, and Future Directions

This article provides a detailed overview of large language models (LLMs), tracing their evolution from statistical and neural language models to modern pre‑trained transformers, discussing scaling, training, adaptation, utilization, evaluation methods, available resources, and outlining current challenges and future research directions.

Model ScalingPre‑trainingevaluation
0 likes · 26 min read
Comprehensive Survey of Large Language Models: History, Key Technologies, Resources, and Future Directions
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 19, 2023 · Artificial Intelligence

Mastering Prompt Engineering: Techniques, Tips, and Real-World Examples

This comprehensive guide explores prompt engineering for large language models, covering its background, fundamental concepts, prompt formats, construction principles, advanced techniques like few‑shot, zero‑shot, and chain‑of‑thought prompting, as well as practical examples, evaluation metrics, and future directions.

Artificial IntelligenceFew-shotLLM
0 likes · 33 min read
Mastering Prompt Engineering: Techniques, Tips, and Real-World Examples
ZhongAn Tech Team
ZhongAn Tech Team
Jul 14, 2023 · Artificial Intelligence

Exploring AIGC Applications in Insurance: Insights from ZhongAn Insurance CTO Jiang Jiyun

The interview with ZhongAn Insurance CTO Jiang Jiyun discusses how the company leverages AIGC technologies such as large language models, embeddings, and prompt engineering to enhance marketing, intelligent customer service, and data security, while highlighting practical challenges and best practices for AI adoption in the insurance sector.

AIGCData SecurityEmbedding
0 likes · 15 min read
Exploring AIGC Applications in Insurance: Insights from ZhongAn Insurance CTO Jiang Jiyun
21CTO
21CTO
Jul 8, 2023 · Artificial Intelligence

Unlocking LangChain: Build End-to-End LLM Apps with Chains, Agents, and Memory

This article introduces LangChain—a modular framework for constructing large‑language‑model applications—covering its core components, asynchronous support, prompt engineering, memory handling, chain and agent workflows, token considerations, embedding techniques, and a step‑by‑step Python example that culminates in a Gradio‑based conversational chatbot.

AI developmentEmbeddingLangChain
0 likes · 20 min read
Unlocking LangChain: Build End-to-End LLM Apps with Chains, Agents, and Memory
DeWu Technology
DeWu Technology
Jul 5, 2023 · Artificial Intelligence

Fine-tuning Large Language Models with LoRA/QLoRA and Deploying via GPTQ Quantization on KubeAI

The article explains how LoRA and its 4‑bit QLoRA extension dramatically reduce trainable parameters and GPU memory for fine‑tuning large language models, while GPTQ post‑training quantization compresses weights for cheap inference, and shows how KubeAI integrates these techniques into a one‑click workflow for 7 B, 13 B, and 33 B models from data upload to API deployment.

GPTQKubeAILoRA
0 likes · 13 min read
Fine-tuning Large Language Models with LoRA/QLoRA and Deploying via GPTQ Quantization on KubeAI
DataFunSummit
DataFunSummit
Jun 30, 2023 · Artificial Intelligence

Roundtable on Large‑Model‑Based Recommendation Systems: Opportunities, Challenges, and Future Directions

In this expert roundtable, leading researchers and engineers discuss the current state of recommendation systems, how large language models can reshape the field, the technical and practical challenges involved, and practical advice for practitioners looking to adopt AI‑driven personalization solutions.

AIIndustry InsightsRecommendation Systems
0 likes · 36 min read
Roundtable on Large‑Model‑Based Recommendation Systems: Opportunities, Challenges, and Future Directions
DataFunSummit
DataFunSummit
Jun 28, 2023 · Artificial Intelligence

OPPO's CHAOS Pretrained Large Model and GammaE Knowledge‑Graph Multi‑hop Reasoning: Techniques and Insights

This article presents OPPO Research Institute's recent advances in large‑model AI, detailing the CHAOS pretrained model that topped the CLUE leaderboard, the knowledge‑enhanced training pipeline, and the GammaE model for multi‑hop reasoning over knowledge graphs, together with experimental results and practical training tips.

AI researchGammaEchaos
0 likes · 20 min read
OPPO's CHAOS Pretrained Large Model and GammaE Knowledge‑Graph Multi‑hop Reasoning: Techniques and Insights
Programmer DD
Programmer DD
Jun 20, 2023 · Artificial Intelligence

Yann LeCun: Today's AI Still Below Dog Level – Inside Meta’s Voicebox, MusicGen & I‑JEPA

Meta’s chief AI scientist Yann LeCun warned that current large language models still fall short of human and even dog intelligence, citing their lack of real‑world understanding, while Meta unveiled three new generative AI models—Voicebox for speech, MusicGen for music, and I‑JEPA for image reasoning—showcasing both progress and remaining limitations.

Artificial Intelligencecomputer visiongenerative AI
0 likes · 7 min read
Yann LeCun: Today's AI Still Below Dog Level – Inside Meta’s Voicebox, MusicGen & I‑JEPA
DataFunTalk
DataFunTalk
Jun 20, 2023 · Artificial Intelligence

How Recommendation Systems Work and Their Integration with ChatGPT

This article explains the fundamentals of recommendation systems, their digital representation, how ChatGPT and large language models are applied to enhance recommendation performance, and highlights emerging trends such as conversational recommendation and a recommended book on the subject.

AIChatGPTConversational AI
0 likes · 8 min read
How Recommendation Systems Work and Their Integration with ChatGPT
DataFunSummit
DataFunSummit
Jun 14, 2023 · Artificial Intelligence

DataFun Summit 2023: Large Language Models and AIGC Conference

DataFun will host the DataFun Summit 2023 on June 17‑18, featuring three chairs and eight presenters who will discuss core topics such as large language model research, multimodal generation, reinforcement learning, tool learning, distributed training, and industry applications, with free registration via QR code.

AI ConferenceAIGCMachine Learning
0 likes · 42 min read
DataFun Summit 2023: Large Language Models and AIGC Conference
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jun 14, 2023 · Artificial Intelligence

ChatGPT Practice Applications and Large Model Technology Insights from the Juejin Offline Salon

The article recaps a Beijing offline salon where experts and open‑source contributors discussed ChatGPT desktop applications, the development and deployment of ChatGPT‑Next‑Web, large‑language‑model challenges, the VisualGLM multimodal model, and product design considerations, providing technical insights and community perspectives on AI advancements.

AIChatGPTOpen Source
0 likes · 9 min read
ChatGPT Practice Applications and Large Model Technology Insights from the Juejin Offline Salon