Tagged articles

1067 articles

Page 10 of 11

Mar 20, 2024 · Artificial Intelligence

Challenges and Optimization Techniques for Large Language Model Training

The article outlines the resource and efficiency challenges of scaling large language models, explains data and model parallelism strategies, and details practical I/O, communication, and stability optimizations—including high‑availability storage, RDMA networking, and fault‑tolerance measures—to improve training throughput and reliability.

AI EngineeringI/O optimizationcommunication optimization

0 likes · 13 min read

Challenges and Optimization Techniques for Large Language Model Training

DataFunTalk

Mar 17, 2024 · Artificial Intelligence

Leveraging Large Language Models to Enhance Comprehensive Graph Learning Capabilities

In this talk, researcher Jiang Zhuoren from Zhejiang University reviews the current state of large language models applied to graph learning, discusses their roles across various graph scenarios, and outlines promising research directions for unified cross‑domain graph learning.

Artificial Intelligencecross-domain learninggraph learning

0 likes · 3 min read

Leveraging Large Language Models to Enhance Comprehensive Graph Learning Capabilities

Model Perspective

Mar 16, 2024 · Artificial Intelligence

What Watching a TV Drama Reveals About AI Model Training and Learning Strategies

The article draws parallels between expert viewers dissecting the drama "The Legend of Zhen Huan," efficient paper‑reading techniques, and the active‑prediction plus contrast‑learning approach that underpins modern AI model training, highlighting how proactive thinking boosts both personal and machine learning outcomes.

AI trainingPredictionactive learning

0 likes · 8 min read

What Watching a TV Drama Reveals About AI Model Training and Learning Strategies

DataFunSummit

Mar 14, 2024 · Artificial Intelligence

Multi‑Level Efficiency Challenges and Emerging Paradigms for Large AI Models

The article examines how large AI models are moving toward a unified, low‑knowledge‑density paradigm that raises computational efficiency challenges across model, algorithm, framework, and infrastructure layers, while also highlighting NVIDIA's GTC 2024 China AI Day sessions that showcase practical solutions and upcoming training opportunities.

AI conferencesAI infrastructureNVIDIA GTC

0 likes · 10 min read

Multi‑Level Efficiency Challenges and Emerging Paradigms for Large AI Models

21CTO

Mar 12, 2024 · Artificial Intelligence

How Google’s ‘Social Learning’ AI Framework Boosts Privacy‑Safe Model Training

Google’s newly unveiled “Social Learning” AI framework lets large models teach each other via natural language, improving task performance while avoiding direct use of sensitive data, and uses teacher‑student interactions, synthetic data, and instruction generation to enhance privacy‑preserving model training.

AIlarge language modelsprivacy

0 likes · 4 min read

How Google’s ‘Social Learning’ AI Framework Boosts Privacy‑Safe Model Training

DataFunTalk

Mar 10, 2024 · Artificial Intelligence

Aligning Graph Models with Large Language Models for Open-Task Scenarios

This talk presents GraphTranslator, a framework that bridges pretrained graph models and large language models to enable unified handling of both predefined and open-ended graph analysis tasks by translating node representations into language tokens and training an alignment producer for node‑text pairs.

AI researchgraph neural networkslarge language models

0 likes · 3 min read

Aligning Graph Models with Large Language Models for Open-Task Scenarios

NewBeeNLP

Mar 10, 2024 · Industry Insights

What WWW'24 Papers Reveal About LLMs in Search & Recommendation

This overview summarizes six WWW 2024 industry papers that apply large language models to e‑commerce search, personalized query suggestion, article recommendation, collaborative filtering, and lifelong sequential behavior understanding, highlighting their methods, experimental results, deployment status, and emerging trends in LLM‑driven search and recommendation.

LLMSearchWWW2024

0 likes · 16 min read

What WWW'24 Papers Reveal About LLMs in Search & Recommendation

DataFunTalk

Mar 7, 2024 · Artificial Intelligence

Enhancing Interactive Agents with Large Language Models: The SwiftSage Framework and Benchmark Analysis

This article reviews recent advances in using large language models for interactive embodied agents, introduces the SwiftSage dual‑model framework that combines a fast T5‑based small model with a powerful LLM for planning, evaluates it on benchmarks such as AFL World and ScienceWorld, and discusses efficiency, cost‑effectiveness, limitations, and future research directions.

AISwiftSageinteractive agents

0 likes · 23 min read

Enhancing Interactive Agents with Large Language Models: The SwiftSage Framework and Benchmark Analysis

Rare Earth Juejin Tech Community

Mar 7, 2024 · Artificial Intelligence

Anthropic Announces Claude 3 Model Family: Opus, Sonnet, and Haiku

Anthropic has launched the Claude 3 family of large language models—Opus, Sonnet, and Haiku—offering varying balances of intelligence, speed, and cost, with enhanced reasoning, multilingual, vision capabilities, reduced refusals, and improved safety, now available via API in over 159 countries.

AI safetyAnthropicClaude 3

0 likes · 11 min read

Anthropic Announces Claude 3 Model Family: Opus, Sonnet, and Haiku

Model Perspective

Mar 6, 2024 · Fundamentals

Why Managing a City Is Like Designing a Spaceship: Exploring Complex Systems

An insightful look at how both spacecraft design and city governance exemplify complex systems, distinguishing closed versus open systems, outlining characteristics of complex and mega-complex systems, and linking these concepts to system engineering pioneers like Qian Xuesen and modern large language models.

Qian Xuesenlarge language modelsopen vs closed systems

0 likes · 9 min read

Why Managing a City Is Like Designing a Spaceship: Exploring Complex Systems

DataFunSummit

Mar 6, 2024 · Artificial Intelligence

Document Intelligence: Background, Technology, Large Models, and Enterprise Applications

This article presents a comprehensive overview of document intelligence, covering its background, technical evolution, large‑model advancements, and practical enterprise digital transformation use cases, with a focus on multimodal processing, unified document representation, and industry‑specific applications such as legal contract automation.

Document IntelligenceEnterprise Automationlarge language models

0 likes · 14 min read

Document Intelligence: Background, Technology, Large Models, and Enterprise Applications

Efficient Ops

Feb 27, 2024 · Artificial Intelligence

Can Large Language Models Truly Elevate Software Engineering? Insights and Roadmap

This article reviews the 2023 surge of large language models in software engineering, evaluates their current code generation, testing, and knowledge‑query capabilities, highlights persistent challenges in design and maintenance, and proposes concrete recommendations for advancing toward higher‑level intelligent development.

code generationdigital twinsgenerative AI

0 likes · 21 min read

Can Large Language Models Truly Elevate Software Engineering? Insights and Roadmap

NewBeeNLP

Feb 17, 2024 · Artificial Intelligence

How Sora Highlights the Next Leap Toward AGI and Shifts AI Competition

The article analyzes OpenAI's Sora video model, arguing that its integration of large‑language‑model reasoning with diffusion techniques marks a major step toward true world understanding, reshapes creative workflows, widens the AI talent gap, and accelerates the path to artificial general intelligence.

AGIAI trendsSora

0 likes · 7 min read

How Sora Highlights the Next Leap Toward AGI and Shifts AI Competition

NewBeeNLP

Feb 11, 2024 · Industry Insights

What 2023 Taught Us About LLMs and AI‑Guided Optimization

The author reviews a year of rapid progress in large language models, highlighting breakthrough papers such as Positional Interpolation, StreamingLLM, Deja Vu, and RLCD, and discusses how AI‑guided optimization techniques like SurCo, LANCER, and GenCo are reshaping research and industry applications.

LLMTransformersai-optimization

0 likes · 13 min read

What 2023 Taught Us About LLMs and AI‑Guided Optimization

DataFunTalk

Feb 10, 2024 · Artificial Intelligence

Mitigating Hallucinations in Large Language Model Applications with Knowledge Graphs

This article examines the challenges of using large language models for industry Q&A, defines hallucination phenomena, evaluates their causes and impact, and proposes a set of strategies—including high‑quality fine‑tuning data, honest alignment, advanced decoding, and external knowledge‑graph augmentation—to reduce hallucinations and improve answer reliability.

hallucinationknowledge graphlarge language models

0 likes · 21 min read

Mitigating Hallucinations in Large Language Model Applications with Knowledge Graphs

Cloud Native Technology Community

Feb 8, 2024 · Artificial Intelligence

How Retrieval‑Augmented Generation Boosts LLM Accuracy and Trust

Retrieval‑augmented generation (RAG) enhances large language models by fetching up‑to‑date, authoritative information from external sources, addressing hallucinations, outdated knowledge, and lack of citations, while offering cost‑effective implementation, improved relevance, user trust, and greater developer control through vector databases, semantic search, and prompt engineering.

AIRAGlarge language models

0 likes · 10 min read

How Retrieval‑Augmented Generation Boosts LLM Accuracy and Trust

DataFunSummit

Feb 5, 2024 · Artificial Intelligence

Ant Group's Knowledge Graph: Overview, Construction, Applications, and Integration with Large Models

Ant Group shares its comprehensive knowledge graph initiatives, detailing the fundamentals, construction pipeline, fusion techniques, cognitive representations, diverse business applications, and the emerging synergy between knowledge graphs and large language models, illustrating how graph-based AI enhances accuracy, interpretability, and downstream services.

Artificial IntelligenceGraph FusionSemantic Modeling

0 likes · 14 min read

Ant Group's Knowledge Graph: Overview, Construction, Applications, and Integration with Large Models

MaGe Linux Operations

Jan 31, 2024 · Artificial Intelligence

Does Gemini Pro Really Outperform GPT‑4? A Deep Comparative Review

This article critically examines Google’s Gemini Pro against OpenAI’s GPT‑4 across reasoning, vision, token limits, benchmark data, and real‑world tasks, revealing where Gemini excels, where it falls short, and what to expect from the upcoming Gemini Ultra.

AI model comparisonGPT-4Gemini Pro

0 likes · 13 min read

Does Gemini Pro Really Outperform GPT‑4? A Deep Comparative Review

DataFunTalk

Jan 31, 2024 · Artificial Intelligence

Industry Trends and Challenges of Large Language Models in Enterprise Applications (2023 Review)

The article reviews the rapid development of large language models in enterprise settings, covering internal collaboration tools, AI assistants for development and marketing, multimodal generation, inference speed bottlenecks, resource constraints, and future directions such as open‑source models and academic‑industry cooperation.

AI assistantsAI in marketingInference Performance

0 likes · 8 min read

Industry Trends and Challenges of Large Language Models in Enterprise Applications (2023 Review)

Alibaba Cloud Big Data AI Platform

Jan 29, 2024 · Artificial Intelligence

Unlocking Sparse MoE Large Model Training with Megatron-Core on Alibaba Cloud

This article explains how Alibaba Cloud's PAI platform and NVIDIA's Megatron-Core enable efficient training of sparse Mixture-of-Experts (MoE) large language models, covering algorithm basics, the Megatron-Core MoE framework, weight conversion pipelines, and performance results on Mixtral‑8x7B.

Megatron-CoreMixture of ExpertsModel Parallelism

0 likes · 18 min read

Unlocking Sparse MoE Large Model Training with Megatron-Core on Alibaba Cloud

ZhongAn Tech Team

Jan 22, 2024 · Artificial Intelligence

Weekly Tech Overview: Major Industry Updates and AI Insights

This weekly tech overview summarizes major industry developments, including Huawei's HarmonyOS NEXT release, SenseTime's open‑source large language model InternLM2, the Apple‑Epic App Store dispute resolution, Xiaomi's 5G satellite terminal approval, Microsoft overtaking Apple in market value, and recent AI energy consumption concerns.

AIHarmonyOSIndustry Updates

0 likes · 10 min read

Weekly Tech Overview: Major Industry Updates and AI Insights

Xiaohongshu Tech REDtech

Jan 20, 2024 · Artificial Intelligence

Decoding Xiaohongshu’s Recommendation System: How Ordinary Users Gain Visibility

Xiaohongshu’s recommendation system uses large‑scale multimodal embeddings, dual‑tower and graph models, and diversity techniques like DPP and SSD to quickly surface high‑quality user‑generated content, enabling ordinary users to gain visibility while balancing personalization, exploration, and efficient LLM‑augmented pipelines.

Xiaohongshucold startcontent diversity

0 likes · 15 min read

Decoding Xiaohongshu’s Recommendation System: How Ordinary Users Gain Visibility

Cognitive Technology Team

Jan 17, 2024 · Artificial Intelligence

Redis Founder antirez Reflects on Large Language Models in 2024

In his first 2024 blog post, Redis founder antirez shares a programmer's perspective on large language models, sharply critiques Google's search engine, evaluates current AIGC as both foolish and historically knowledgeable, and argues that generative AI mainly amplifies the abilities of already strong developers.

AI CommentaryRedislarge language models

0 likes · 2 min read

Redis Founder antirez Reflects on Large Language Models in 2024

21CTO

Jan 14, 2024 · Artificial Intelligence

Can Large Language Models Really Boost Programming Productivity? Insights from Redis Founder

The article reflects on the Redis founder's 2024 blog about large language models, examining their strengths and limits in software development, illustrating how they can accelerate coding for experienced programmers while highlighting challenges in system programming and the need for careful prompt engineering.

AI programminglarge language modelsproductivity

0 likes · 19 min read

Can Large Language Models Really Boost Programming Productivity? Insights from Redis Founder

Rare Earth Juejin Tech Community

Jan 3, 2024 · Artificial Intelligence

Llama 2: Open Foundation and Fine‑Tuned Chat Models – Ghost Attention, RLHF Results, and Safety Evaluation

This article summarizes the Llama 2 series, describing the Ghost Attention technique for maintaining system‑message consistency across multi‑turn dialogs, presenting RLHF and human evaluation results, and discussing extensive safety pre‑training, benchmark assessments, and model release details.

AI evaluationGhost AttentionLlama-2

0 likes · 20 min read

Llama 2: Open Foundation and Fine‑Tuned Chat Models – Ghost Attention, RLHF Results, and Safety Evaluation

OPPO Kernel Craftsman

Dec 29, 2023 · Information Security

OPPO Releases White Paper on Mobile Application Trustworthy Technology at CAICT ICT+ Deep Observation Conference

At the CAICT ICT+ Deep Observation Conference, OPPO unveiled a white paper on mobile application trustworthy technology, analyzing lifecycle security risks, policy and patent developments, and the role of large‑model AI in intelligent terminals, while urging standardized security practices and accelerated AI‑driven vulnerability detection tools.

CAICTIntelligent TerminalsMobile Application Security

0 likes · 4 min read

OPPO Releases White Paper on Mobile Application Trustworthy Technology at CAICT ICT+ Deep Observation Conference

OPPO Amber Lab

Dec 29, 2023 · Information Security

Large Models Transform Mobile App Security – Key Takeaways from OPPO’s White Paper

The 2024 China Academy of ICT deep‑observation summit in Shanghai unveiled OPPO’s new white paper on trustworthy mobile application technology, highlighting how large language models enhance smart terminal security, outlining industry trends, and outlining future directions for secure, intelligent mobile ecosystems.

OPPOSoftware Securitylarge language models

0 likes · 6 min read

Large Models Transform Mobile App Security – Key Takeaways from OPPO’s White Paper

DataFunTalk

Dec 25, 2023 · Artificial Intelligence

Tool Learning with Foundation Models: Frameworks, Datasets, and Open‑Source Toolkits

This article reviews the emerging field of tool learning for large foundation models, outlining its background, categorization, core framework components, training strategies, and applications such as WebCPM, BMTools, and ToolBench, while highlighting recent research results and open‑source resources.

AI toolsfoundation modelslarge language models

0 likes · 21 min read

Tool Learning with Foundation Models: Frameworks, Datasets, and Open‑Source Toolkits

Java High-Performance Architecture

Dec 22, 2023 · Artificial Intelligence

Is Google Gemini Echoing Baidu? A Deep Dive into Model Contamination

The article investigates recent tests showing that Google Gemini sometimes claims to be Baidu's AI, reproduces Baidu‑related responses, and appears to have its Chinese and English corpora contaminated with competitor data, highlighting the challenges of data provenance in large language models.

AI model contaminationAI testingBaidu Wenxin

0 likes · 6 min read

Is Google Gemini Echoing Baidu? A Deep Dive into Model Contamination

DataFunTalk

Dec 21, 2023 · Artificial Intelligence

Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning – Best Long Paper at EMNLP 2023

At EMNLP 2023, the joint WeChat AI and Peking University paper 'Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning' won the Best Long Paper award, revealing that label tokens act as anchors driving information aggregation in shallow layers and prediction flow in deep layers, and proposing methods to improve and diagnose in‑context learning.

AI researchIn-Context LearningInformation Flow

0 likes · 13 min read

Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning – Best Long Paper at EMNLP 2023

DataFunTalk

Dec 19, 2023 · Artificial Intelligence

Enterprise Large‑Model Deployment and Data Governance: Insights from Deepexi’s President

The article examines how enterprises can adopt domain‑specific large models by balancing demand‑side cost‑reduction needs with supply‑side mature training techniques, discusses team composition, fine‑tuning methods, data governance for unstructured data, and outlines Deepexi’s product ecosystem designed to improve efficiency, performance, and user experience.

AI deploymentcost economicsenterprise AI

0 likes · 13 min read

Enterprise Large‑Model Deployment and Data Governance: Insights from Deepexi’s President

21CTO

Dec 17, 2023 · Artificial Intelligence

Why AI‑Native Apps Matter: Insights from Baidu, ByteDance Ban, and New PHP Server

The article examines Baidu CEO Li Yanhong’s call to focus on AI‑native applications, reports ByteDance’s suspension by OpenAI for misusing GPT, outlines Google’s phased removal of third‑party cookies, and announces the release of the Go‑based PHP server FrankenPHP 1.0.

AI-native applicationsPHP serverlarge language models

0 likes · 7 min read

Why AI‑Native Apps Matter: Insights from Baidu, ByteDance Ban, and New PHP Server

DataFunSummit

Dec 14, 2023 · Artificial Intelligence

Enterprise Large‑Model Deployment: Data Governance, Fine‑Tuning Strategies, and Cost Economics

The article examines how enterprises can adopt domain‑specific large language models by addressing data governance, model fine‑tuning techniques, dataset balance, and product architecture to achieve cost‑effective, high‑performance AI solutions across various business scenarios.

Model Fine‑tuningcost efficiencylarge language models

0 likes · 14 min read

Enterprise Large‑Model Deployment: Data Governance, Fine‑Tuning Strategies, and Cost Economics

Huawei Cloud Developer Alliance

Dec 14, 2023 · Artificial Intelligence

Unlocking LLaMA: Key Innovations, Architecture Insights, and MindSpore Inference Guide

This article reviews the LLaMA large‑language‑model series, covering its background, architectural innovations such as Add&Norm, SwiGLU, and RoPE, a known reversal‑curse bug, and provides step‑by‑step MindSpore Transformers code for model configuration, inference, and pipeline usage while previewing the upcoming LLaMA‑2 session.

AILLaMAMindSpore

0 likes · 6 min read

Unlocking LLaMA: Key Innovations, Architecture Insights, and MindSpore Inference Guide

DataFunTalk

Dec 12, 2023 · Artificial Intelligence

Challenges and Considerations of Recommendation Systems: Evaluation, Data Leakage, and the Role of Large Models

This article examines recommendation system problem definitions, differences between academia and industry, offline evaluation pitfalls and data leakage issues, data construction challenges with datasets like MovieLens, and evaluates whether large language models can serve as effective solutions for modern recommendation tasks.

Machine LearningRecommendation Systemsdata leakage

0 likes · 20 min read

Challenges and Considerations of Recommendation Systems: Evaluation, Data Leakage, and the Role of Large Models

21CTO

Dec 7, 2023 · Artificial Intelligence

Google Gemini vs GPT‑4: Can the New AI Model Outperform ChatGPT?

Google's Gemini AI suite, unveiled in December, brings three model sizes—Nano, Pro, and Ultra—to power Bard and other services, claims superior performance over GPT‑4 across most benchmarks, and introduces multimodal capabilities that signal a major shift in the AI landscape.

AI language modelGPT-4 comparisonGoogle Gemini

0 likes · 6 min read

Google Gemini vs GPT‑4: Can the New AI Model Outperform ChatGPT?

JD Tech

Nov 30, 2023 · Artificial Intelligence

Understanding ChatGPT: Mechanisms, Attention, Emergence, and the Chinese Room

This article examines the principles behind ChatGPT, detailing its continuation-based operation, the role of attention mechanisms and transformer architecture, the scaling of neural networks that leads to emergent abilities, and interprets these phenomena through the lenses of compression theory and the Chinese Room thought experiment.

ChatGPTattention mechanismcompression

0 likes · 27 min read

Understanding ChatGPT: Mechanisms, Attention, Emergence, and the Chinese Room

AntTech

Nov 24, 2023 · Artificial Intelligence

Code Model Evaluation Framework and the CodeFuseEval Benchmark Overview

This article presents a comprehensive overview of code large‑model evaluation, describing the need for multi‑dimensional benchmarks, the CodeFuseEval benchmark suite, dataset construction, evaluation methods, framework architecture, result visualisation, and future directions for enterprise‑grade code generation models.

AICodeFuseEvalbenchmark

0 likes · 12 min read

Code Model Evaluation Framework and the CodeFuseEval Benchmark Overview

Ant R&D Efficiency

Nov 24, 2023 · Artificial Intelligence

CodeFuseEval: An Enterprise‑Level Multi‑Task Benchmark for Evaluating Code Large Models

CodeFuseEval is an enterprise‑grade, multi‑task benchmark that evaluates code‑generation large models across six languages and thousands of real‑world tasks using both objective metrics (pass@k, BLEU, CodeBLEU) and expert human review, with an open‑source framework, continuous dataset expansion, and a focus on correctness, efficiency, robustness, and service‑level quality.

AIbenchmarkcode generation

0 likes · 12 min read

CodeFuseEval: An Enterprise‑Level Multi‑Task Benchmark for Evaluating Code Large Models

DataFunTalk

Nov 21, 2023 · Artificial Intelligence

Improving Efficiency of Large-Scale Distributed Training for Large Language Models

Recent advances in large language models have dramatically increased model size and training data, leading to soaring computational costs; this article examines the scaling trends, hardware utilization challenges, distributed training techniques, and ethical considerations, highlighting methods to improve efficiency, reduce costs, and mitigate environmental impact.

AI ethicsEfficiencycompute optimization

0 likes · 29 min read

Improving Efficiency of Large-Scale Distributed Training for Large Language Models

Baobao Algorithm Notes

Nov 21, 2023 · Artificial Intelligence

How Much Data Do You Need for a 10B LLM? Decoding Scaling Laws

This article explains how scaling laws can answer common LLM development questions—such as the data required for a 10B model, the model size achievable with 1 TB of data, and the optimal compute‑data‑model trade‑off for a fixed GPU budget—by presenting core formulas, practical derivations, and insights from OpenAI, DeepMind and Google.

Compute EfficiencyData RequirementsModel Size

0 likes · 12 min read

How Much Data Do You Need for a 10B LLM? Decoding Scaling Laws

360 Smart Cloud

Nov 20, 2023 · Artificial Intelligence

Overview of Recent Open‑Source AI Models and Tools (November 2023)

This article summarizes a collection of newly released open‑source AI projects covering natural‑language processing, multimodal processing, intelligent agents, recommendation systems, and model training acceleration, providing brief descriptions, key capabilities, and links to their repositories.

AIMultimodalRecommendation Systems

0 likes · 9 min read

Overview of Recent Open‑Source AI Models and Tools (November 2023)

Ximalaya Technology Team

Nov 16, 2023 · Artificial Intelligence

How AI Agents Turn One-Line Prompts Into Fully Functional Apps in Minutes

ChatDev, an AI‑driven software development platform, claims to create complete applications from a single prompt in about three minutes and at a cost of roughly two yuan, leveraging a multi‑agent workflow, a custom 100‑billion‑parameter model, and open‑source frameworks to dramatically cut development time and expense.

AI AgentsChatDevindustry analysis

0 likes · 13 min read

How AI Agents Turn One-Line Prompts Into Fully Functional Apps in Minutes

Architect

Nov 8, 2023 · Artificial Intelligence

AI Agents Unleashed: From Assistants API to Multi‑Agent Frameworks

The article dissects the rise of AI agents—from OpenAI's Assistants API and multimodal perception‑brain‑action pipelines to retrieval‑augmented generation, tool‑use strategies, single‑ and multi‑agent deployments, and emerging frameworks like AutoGen—while highlighting concrete examples, benchmark results, and current limitations.

AI AgentsAssistants APIEmbodied AI

0 likes · 38 min read

AI Agents Unleashed: From Assistants API to Multi‑Agent Frameworks

Tencent Cloud Developer

Nov 8, 2023 · Artificial Intelligence

Comprehensive Overview of AI Agents: Concepts, Technical Frameworks, and Applications

The article surveys modern AI agents—software entities powered by large language models that perceive multimodal inputs, reason via brain modules, act through tools or embodied actions, employ retrieval‑augmented generation and chain‑of‑thought planning, and can operate singly (e.g., AutoGPT) or collaboratively via frameworks like Microsoft’s AutoGen—while highlighting current challenges such as controllability, memory limits, parallelism, and reliability.

AI AgentsAgent ArchitectureAutoGen

0 likes · 34 min read

Comprehensive Overview of AI Agents: Concepts, Technical Frameworks, and Applications

DataFunSummit

Nov 5, 2023 · Artificial Intelligence

Enhancing Recommendation Models with Scaling Law via HCNet and MemoNet: A Memory‑Based Feature‑Combination Approach

This article presents a memory‑driven architecture (HCNet and MemoNet) that equips recommendation models with scaling‑law characteristics by storing and retrieving arbitrary feature‑combination embeddings, evaluates multi‑hash codebooks, memory‑restoring strategies, key‑feature selection, and demonstrates significant offline and online performance gains.

Scaling Lawfeature interactionlarge language models

0 likes · 15 min read

Enhancing Recommendation Models with Scaling Law via HCNet and MemoNet: A Memory‑Based Feature‑Combination Approach

Model Perspective

Nov 2, 2023 · Artificial Intelligence

Why Mathematical Modelers Must Embrace LLMs and Forget Outdated Skills

The article explains how rapid advances in data and large language models force mathematical modelers to continuously update their models and skills, discard obsolete knowledge, and adopt lifelong learning to stay effective in a fast‑changing AI‑driven environment.

Artificial Intelligencecontinuous learningdata-science

0 likes · 6 min read

Why Mathematical Modelers Must Embrace LLMs and Forget Outdated Skills

Baidu Geek Talk

Nov 2, 2023 · Artificial Intelligence

AI-Powered Code Defect Detection: Leveraging Code Knowledge Graphs and Large Language Models

The paper presents an AI‑driven static analysis framework that builds code knowledge graphs to extract relevant slices and leverages large language models for multilingual defect prediction, achieving up to 80% F1, detecting 662 defects across 1,100 C++ modules with a 26.9% recall gain over traditional rule‑based scanners.

BERTSoftware qualitycode defect detection

0 likes · 9 min read

AI-Powered Code Defect Detection: Leveraging Code Knowledge Graphs and Large Language Models

Baidu Intelligent Cloud Tech Hub

Nov 1, 2023 · Databases

How BES Powers Large-Scale Vector Search for AI Applications

This article explains the principles of vector databases, outlines the engineering practices of Baidu Intelligent Cloud BES for large‑scale vector retrieval, discusses optimization techniques such as HNSW, IVF and filter integration, and presents real‑world AI use cases and future development directions.

AIBESElasticsearch

0 likes · 16 min read

How BES Powers Large-Scale Vector Search for AI Applications

DataFunSummit

Oct 30, 2023 · Artificial Intelligence

Exploring General AI, Large Language Models, Knowledge Graphs, and Reinforcement Learning – Insights from DataFun

This article presents a comprehensive overview of DaGuan Data's explorations in general artificial intelligence, large language models, knowledge graphs, reinforcement learning, compute and data requirements, and the emerging concept of Human‑Centric AGI, supplemented by a detailed Q&A session.

AGIArtificial IntelligenceKnowledge Graphs

0 likes · 18 min read

Exploring General AI, Large Language Models, Knowledge Graphs, and Reinforcement Learning – Insights from DataFun

DataFunSummit

Oct 27, 2023 · Artificial Intelligence

ChatGPT Technology, Domesticization Attempts, and Open‑Source Large Models

This article reviews the evolution and challenges of ChatGPT technology, describes the authors' efforts to localize and commercialize the model for the Chinese market, and introduces their open‑source Chinese large‑model initiative, including training methods, performance gaps, and future improvement directions.

ChatGPTChinese NLPModel Localization

0 likes · 11 min read

ChatGPT Technology, Domesticization Attempts, and Open‑Source Large Models

Baidu Tech Salon

Oct 25, 2023 · Artificial Intelligence

Intelligent Question Answering Technology in Baidu Search: Development, Modeling, and Retrieval‑Enhanced Generation

The article surveys Baidu Search’s intelligent question‑answering system, tracing its evolution from feature‑engineered retrieval to large pre‑trained and generative models, and detailing hierarchical readers, multi‑teacher distillation, retrieval‑enhanced generation, and instruction decomposition as key techniques for delivering fast, accurate, citation‑rich answers.

Baidu SearchKnowledge DistillationRetrieval-Augmented Generation

0 likes · 18 min read

Intelligent Question Answering Technology in Baidu Search: Development, Modeling, and Retrieval‑Enhanced Generation

Baidu Geek Talk

Oct 25, 2023 · Artificial Intelligence

How Baidu Search Is Transforming Machine Question Answering with Large‑Scale AI Models

This article reviews the evolution of machine question answering, from early feature‑engineered systems to modern large‑language‑model‑driven retrieval‑augmented generation, outlines Baidu Search’s current Retriever‑Reader architecture, discusses challenges such as semantic complexity, latency and answer quality, and presents solutions including hierarchical DocMRC modeling, multi‑teacher knowledge distillation, and instruction decomposition for efficient, high‑quality answers.

BaiduKnowledge DistillationRetrieval-Augmented Generation

0 likes · 18 min read

How Baidu Search Is Transforming Machine Question Answering with Large‑Scale AI Models

DataFunTalk

Oct 25, 2023 · Artificial Intelligence

Applying Large Language Models to Wireless Network Intelligent Operations: Opportunities, Challenges, and Platform Construction

This article examines how large language model technology can be leveraged for intelligent operation of wireless communication networks, analyzing its advantages, current challenges, platform architecture, experimental validation, and future research directions within the telecom industry.

AIintelligent operationknowledge graph

0 likes · 17 min read

Applying Large Language Models to Wireless Network Intelligent Operations: Opportunities, Challenges, and Platform Construction

AI Large Model Application Practice

Oct 23, 2023 · Artificial Intelligence

Unlocking GPT‑4V: A Concise Guide to Multimodal Capabilities and Prompt Techniques

This article summarizes the GPT‑4V research paper, detailing its visual input modes, effective prompting strategies, diverse multimodal abilities, high‑value application scenarios, and ways to enhance the model with classic LLM techniques while noting current limitations.

AI applicationsGPT-4Vlarge language models

0 likes · 17 min read

Unlocking GPT‑4V: A Concise Guide to Multimodal Capabilities and Prompt Techniques

Zuoyebang Tech Team

Oct 19, 2023 · Artificial Intelligence

How AI and Big Data Are Transforming Education: Insights from Zuoyebang’s Chief Scientist

At the GET2023 Education Technology Conference, Zuoyebang’s chief scientist Song Yang detailed how AI, large language models, big data, and smart hardware are reshaping learning experiences across subjects, from math problem generation to interactive programming assistants, and outlined the company’s vision for AI‑driven education.

AI in EducationEducational Technologylarge language models

0 likes · 12 min read

How AI and Big Data Are Transforming Education: Insights from Zuoyebang’s Chief Scientist

Alimama Tech

Oct 18, 2023 · Artificial Intelligence

Technical Challenges and Directions for Large‑Model Applications in E‑commerce

Taobao Group’s ten large‑model challenges target e‑commerce AI by demanding domain‑specific pre‑training, multi‑step reasoning, extended context handling, factual reliability, intelligent tool orchestration, robust retrieval integration, fuzzy‑intent tool selection, scalable multi‑objective RLHF, improved query rewriting, and knowledge‑driven recommendation.

RLHFe-commerceknowledge hallucination

0 likes · 16 min read

Technical Challenges and Directions for Large‑Model Applications in E‑commerce

DaTaobao Tech

Oct 18, 2023 · Artificial Intelligence

Large Model Application Challenges for E-commerce

Taobao Group’s ten large‑model e‑commerce challenges call for researchers to build domain‑specific data pipelines, mitigate forgetting, balance expertise with generality, enable multi‑step reasoning, handle long contexts, reduce hallucinations, integrate tool use, improve fuzzy intent detection, apply multi‑objective RLHF, and generate cognitively novel recommendations.

RLHFknowledge hallucinationlarge language models

0 likes · 14 min read

Large Model Application Challenges for E-commerce

Baidu Geek Talk

Oct 16, 2023 · Industry Insights

What Is AI‑Native Thinking and Why It Will Shape the Next Wave of Applications

The article explores the concept of AI‑native thinking, outlines the mindset and conditions needed for AI‑native applications, showcases examples such as Baidu Wenku and a legal‑assistant hackathon project, and discusses platform support, technical foundations, and emerging opportunities in the large‑model era.

AI-nativeBaiduIndustry Insight

0 likes · 14 min read

What Is AI‑Native Thinking and Why It Will Shape the Next Wave of Applications

Baidu Geek Talk

Oct 11, 2023 · Artificial Intelligence

How Baidu’s Qianfan 2.0 Supercharges Large‑Model Development and Deployment

The article reviews Baidu Cloud’s Qianfan 2.0 platform, detailing its expanded model catalog, dataset library, Chinese‑language enhancements, compression and speed gains, robust AI infrastructure, application templates, and end‑to‑end data‑labeling pipeline that together lower cost and accelerate large‑model adoption across industries.

AI PlatformCloud AIModel Deployment

0 likes · 14 min read

How Baidu’s Qianfan 2.0 Supercharges Large‑Model Development and Deployment

JD Cloud Developers

Oct 10, 2023 · Artificial Intelligence

Do Large Language Models Have a Mind? Attention, Emergence & Compression Explained

This article examines whether ChatGPT and other large language models exhibit true Theory of Mind, detailing the role of attention mechanisms, neural network architecture, emergent abilities, the Chinese‑room argument, and how compression of massive textual data underlies their apparent intelligence.

Theory of Mindattention mechanismcompression

0 likes · 30 min read

Do Large Language Models Have a Mind? Attention, Emergence & Compression Explained

Baobao Algorithm Notes

Oct 9, 2023 · Artificial Intelligence

Demystifying RLHF and PPO for Large Language Models: Theory and Practice

This article explains why Reinforcement Learning from Human Feedback (RLHF) is crucial for LLM intelligence, outlines the three-stage training pipeline, details InstructGPT's reward model and PPO optimization, and provides a practical guide to implementing RLHF with deep‑learning frameworks.

Artificial IntelligencePPORLHF

0 likes · 17 min read

Demystifying RLHF and PPO for Large Language Models: Theory and Practice

DataFunSummit

Sep 30, 2023 · Artificial Intelligence

Causal Inference from the Perspective of Large Models

This presentation by senior AI architect He Gang explores how large language models and LLM‑powered agents can enhance causal inference tasks, detailing model‑assisted analysis, agent‑based inference methods, and multi‑agent simulations to advance causal research.

AILLM agentslarge language models

0 likes · 2 min read

Causal Inference from the Perspective of Large Models

NetEase LeiHuo Testing Center

Sep 22, 2023 · Artificial Intelligence

Understanding Large Language Models and Prompt Engineering: A Practical Guide

This article provides an introductory overview of large language models (LLMs), compares popular models, explains their underlying principles, and offers practical guidance on prompt engineering, model evaluation, usage tips, and safety considerations, helping readers effectively select and apply LLMs in various scenarios.

AILLMlarge language models

0 likes · 44 min read

Understanding Large Language Models and Prompt Engineering: A Practical Guide

Tencent Tech

Sep 20, 2023 · Artificial Intelligence

Why Do Large Language Models Hallucinate and How to Reduce It?

The article explains why large language models generate hallucinations—due to data errors, training conflicts, and inference uncertainty—and outlines data‑cleaning, model‑level feedback, knowledge augmentation, constraint techniques, and post‑processing methods such as the “Truth‑seeking” algorithm to mitigate the issue.

AI safetyKnowledge retrievaldata quality

0 likes · 8 min read

Why Do Large Language Models Hallucinate and How to Reduce It?

DataFunSummit

Sep 19, 2023 · Artificial Intelligence

Advances in Information Extraction: From PLM to LLM Paradigms at Alibaba DAMO Academy

This article reviews Alibaba DAMO Academy's research on information extraction, covering background concepts, PLM-era extraction paradigms, few‑shot extraction techniques, and the emerging LLM‑era approaches, while also sharing practical insights, benchmark results, and future directions.

Alibaba DAMORetrieval-Augmented Generationfew-shot learning

0 likes · 24 min read

Advances in Information Extraction: From PLM to LLM Paradigms at Alibaba DAMO Academy

Ximalaya Technology Team

Sep 18, 2023 · Artificial Intelligence

Understanding Autonomous and Autopilot AI Agents: Insights from Industry Experts

The article surveys the rise of LLM‑powered AI agents, defining them as LLM + memory + planning + tool use, contrasting fully autonomous agents with human‑guided autopilot/copilot variants, outlining their benefits, risks such as hallucinations and unsafe actions, and urging modular frameworks and oversight for reliable enterprise deployment.

AI AgentsAgent FrameworkArtificial Intelligence

0 likes · 27 min read

Understanding Autonomous and Autopilot AI Agents: Insights from Industry Experts

AntTech

Sep 12, 2023 · Artificial Intelligence

Ensuring Trustworthy and Secure AI: Insights from the 2023 Pujiang Innovation Forum

The 2023 Pujiang Innovation Forum highlighted the rapid rise of generative AI, its associated security and privacy risks, and presented Ant Group's multi‑stage, multi‑layered approach—including data, training, and inference controls and three core defense technologies—to achieve safe, reliable, and open knowledge sharing in the era of large language models.

Information Securityknowledge sharinglarge language models

0 likes · 10 min read

Ensuring Trustworthy and Secure AI: Insights from the 2023 Pujiang Innovation Forum

DaTaobao Tech

Sep 11, 2023 · Artificial Intelligence

Large Language Model Upgrade Paths and Architecture Selection

This article analyzes upgrade paths of major LLMs—ChatGLM, LLaMA, Baichuan—detailing performance, context length, and architectural changes, then examines essential capabilities, data cleaning, tokenizer and attention design, and offers practical guidance for balanced scaling and efficient model construction.

BaichuanChatGLMLLM architecture

0 likes · 32 min read

Large Language Model Upgrade Paths and Architecture Selection

DataFunSummit

Sep 9, 2023 · Artificial Intelligence

Evolution of AIGC Technology and Its Applications in Life Sciences

This article reviews the development of AIGC and generative AI technologies—including image, text, and molecular generation—explains key model advances such as diffusion and large language models, discusses their impact on drug discovery, and outlines current challenges, opportunities, and future directions.

AI in Life SciencesAIGCdrug discovery

0 likes · 14 min read

Evolution of AIGC Technology and Its Applications in Life Sciences

DataFunTalk

Sep 8, 2023 · Artificial Intelligence

Knowledge Processing in the Era of Large Models: New Opportunities and New Challenges

This article examines how large language models and knowledge graphs complement each other, discussing their respective strengths, integration techniques such as prompt engineering and knowledge editing, and outlining future research directions for building large knowledge models that combine linguistic understanding with structured knowledge representation.

AIKnowledge Graphsknowledge representation

0 likes · 27 min read

Knowledge Processing in the Era of Large Models: New Opportunities and New Challenges

Continuous Delivery 2.0

Sep 7, 2023 · Artificial Intelligence

Google’s Internal Memo: “We Have No Moat, Neither Does OpenAI” – The Rise of Open‑Source AI

A leaked Google internal document titled “We have no moat, OpenAI also has none” reveals that both companies are losing the AI arms race to rapidly advancing open‑source models, which achieve comparable performance at a fraction of the cost, prompting a strategic rethink for Google.

AIGoogleLoRA

0 likes · 16 min read

Google’s Internal Memo: “We Have No Moat, Neither Does OpenAI” – The Rise of Open‑Source AI

Alibaba Cloud Developer

Aug 28, 2023 · Artificial Intelligence

AI-Driven Application Engineering: From Prompt Engineering to Autonomous Agents

This article examines how the rapid rise of generative AI reshapes application engineering by outlining AI's core characteristics, the challenges developers face, the evolution of prompt and chain-of-thought techniques, the emergence of agents and tool integration, and the future direction toward AI‑centric computing architectures.

AIagentsapplication architecture

0 likes · 20 min read

AI-Driven Application Engineering: From Prompt Engineering to Autonomous Agents

FunTester

Aug 22, 2023 · Artificial Intelligence

The Current State and Future Outlook of AI‑Driven Software Testing

The article examines how large‑language models, test‑case generation technologies, and model‑driven testing are reshaping software testing, discusses the challenges of applying AI to testing, and outlines future directions and skill sets for professionals seeking to leverage AI in quality assurance.

AIKnowledge Graphslarge language models

0 likes · 14 min read

The Current State and Future Outlook of AI‑Driven Software Testing

DataFunTalk

Aug 21, 2023 · Artificial Intelligence

Can We Build Large-Scale Models for Recommendation Systems?

In this talk, Zhang Pengtao, a Sina Weibo technical expert with a Ph.D. in computer applications, explores how the strong memory capabilities of NLP large language models inspire the design of independent memory mechanisms for recommendation systems, covering model concepts, HCNet & MemoNet, experimental results, and practical takeaways for enhancing recommendation model performance.

AIMemory MechanismsRecommendation Systems

0 likes · 2 min read

Can We Build Large-Scale Models for Recommendation Systems?

DataFunTalk

Aug 19, 2023 · Artificial Intelligence

Applying Large Language Models to Zhihu's Bridge Platform: Use Cases, Challenges, and Solutions

This article details how Zhihu's internal Bridge platform integrates large language models for business analysis, knowledge taxonomy, natural‑language‑to‑filter conversion, and ad‑hoc data queries, describing the workflow, technical hurdles, iterative improvements, and future directions.

AI for business analyticsknowledge taxonomylarge language models

0 likes · 12 min read

Applying Large Language Models to Zhihu's Bridge Platform: Use Cases, Challenges, and Solutions

DataFunTalk

Aug 16, 2023 · Artificial Intelligence

Data Engineering, Automated Evaluation, and Knowledge Graph Integration in Large Model Development

This article presents a comprehensive overview of data engineering practices, pre‑training data composition, automated model evaluation techniques, and the synergistic use of knowledge graphs within large‑scale AI model research, highlighting pipelines, quality criteria, and practical case studies.

automation evaluationdata engineeringknowledge graph

0 likes · 29 min read

Data Engineering, Automated Evaluation, and Knowledge Graph Integration in Large Model Development

Bilibili Tech

Aug 15, 2023 · Backend Development

Bilibili Customer Service System Architecture and Implementation

The article explains Bilibili's self‑developed customer‑service platform, describing its modular architecture, core workflows, and implementation of features such as intelligent QA with Faiss vector search, Redis‑based seat scheduling, a robust workstation, permission control, and exploration of large language models, highlighting improvements in interception rate, satisfaction, and handling time.

Backend DevelopmentCustomer Service SystemFaiss vector search

0 likes · 20 min read

Bilibili Customer Service System Architecture and Implementation

DataFunSummit

Aug 14, 2023 · Artificial Intelligence

State of GPT: A Programmer’s Guide to Large Language Model Fundamentals, Training, and Applications

This article provides programmers with a comprehensive overview of large language models—including their evolution, core concepts, data pipelines, model architectures, training techniques such as 3D parallelism, supervised fine‑tuning, RLHF, open‑source recipes, and emerging application ecosystems—while also highlighting current challenges and future directions.

Fine‑tuningLLM applicationsRLHF

0 likes · 43 min read

State of GPT: A Programmer’s Guide to Large Language Model Fundamentals, Training, and Applications

php Courses

Aug 14, 2023 · Artificial Intelligence

Guide to the Five Most Powerful Large Language Models and How to Choose Them

This article explains the fundamentals of modern large language models, outlines the top five most powerful LLMs—including GPT‑4, Claude 2, Llama 2, Orca, and Cohere—and provides practical guidance on selecting and applying them across business and development use cases.

AI applicationsClaude 2GPT-4

0 likes · 9 min read

Guide to the Five Most Powerful Large Language Models and How to Choose Them

DataFunTalk

Aug 13, 2023 · Artificial Intelligence

Applying Large Language Models to Search Advertising Satisfaction: From DNN to ERNIE and Prompt Learning

The article details how Baidu's Fengchao team leverages large language models, including a transition from DNN embeddings to ERNIE, introduces multi‑level tokenization and discrete core‑word inputs, and applies prompt learning and AIGC techniques to improve search advertising satisfaction and industry‑specific relevance modeling.

AIGCBaidularge language models

0 likes · 22 min read

Applying Large Language Models to Search Advertising Satisfaction: From DNN to ERNIE and Prompt Learning

DataFunTalk

Aug 9, 2023 · Artificial Intelligence

Key Technologies for Domain‑Specific Large Models: Insights from the World AI Conference

This report, based on Professor Xiao Yanghua’s presentation at the World AI Conference, examines why vertical domains need general large models, outlines their key capabilities such as open‑world understanding, combinatorial innovation, evaluation, complex instruction execution, task planning, and symbolic reasoning, and discusses current limitations and optimization strategies for domain‑specific deployment.

AI evaluationVertical AIlarge language models

0 likes · 17 min read

Key Technologies for Domain‑Specific Large Models: Insights from the World AI Conference

Efficient Ops

Aug 8, 2023 · Artificial Intelligence

Rethinking Software Development in the Age of Large Language Models

The article examines fundamental challenges of applying large language models to software engineering—such as scale limits, lack of abstract reasoning, hidden tacit knowledge, and maintenance difficulties—and proposes practical recommendations for integrating AI with disciplined development practices.

AI integrationdevelopment automationknowledge management

0 likes · 7 min read

Rethinking Software Development in the Age of Large Language Models

Baidu Intelligent Cloud Tech Hub

Aug 8, 2023 · Artificial Intelligence

Unlocking LMOps: How Enterprises Can Master Large Model Operations

This article explains the evolution from traditional machine learning to the current large‑model era, introduces LMOps concepts and key technologies, compares them with MLOps, and showcases Baidu Cloud's Qianfan platform as a practical solution for building, deploying, and managing large language models in industry.

AI OperationsBaidu CloudLMOps

0 likes · 22 min read

Unlocking LMOps: How Enterprises Can Master Large Model Operations

DataFunTalk

Jul 27, 2023 · Artificial Intelligence

Applying AIGC in E‑commerce: Product Copy and Image Generation with Large Language Models

This article shares recent AIGC practices in e‑commerce, detailing product copy generation using GPT‑based models, image creation with Stable Diffusion, the evolution of large language models, technical solutions, experimental results, and future opportunities for AI‑driven automation in online retail.

AIGCe-commerceimage generation

0 likes · 18 min read

Applying AIGC in E‑commerce: Product Copy and Image Generation with Large Language Models

Baidu Geek Talk

Jul 26, 2023 · Artificial Intelligence

Insights on AIGC Development and Commercial Applications by Baidu's Chief Architect

Baidu’s chief architect Li Shuanglong outlined how AIGC, driven by advanced large‑language and multimodal models, is already powering commercial tools such as automated copywriting, 2D digital‑human video creation and lead‑generation chatbots, while emphasizing future progress in engineering scalability, algorithmic fidelity, data quality, and scenario‑focused applications.

AI commercializationAI researchAIGC

0 likes · 8 min read

Insights on AIGC Development and Commercial Applications by Baidu's Chief Architect

Rare Earth Juejin Tech Community

Jul 24, 2023 · Artificial Intelligence

Comprehensive Survey of Large Language Models: History, Key Technologies, Resources, and Future Directions

This article provides a detailed overview of large language models (LLMs), tracing their evolution from statistical and neural language models to modern pre‑trained transformers, discussing scaling, training, adaptation, utilization, evaluation methods, available resources, and outlining current challenges and future research directions.

Model ScalingPre‑trainingevaluation

0 likes · 26 min read

Comprehensive Survey of Large Language Models: History, Key Technologies, Resources, and Future Directions

Alibaba Cloud Developer

Jul 19, 2023 · Artificial Intelligence

Mastering Prompt Engineering: Techniques, Tips, and Real-World Examples

This comprehensive guide explores prompt engineering for large language models, covering its background, fundamental concepts, prompt formats, construction principles, advanced techniques like few‑shot, zero‑shot, and chain‑of‑thought prompting, as well as practical examples, evaluation metrics, and future directions.

Artificial IntelligenceFew-shotLLM

0 likes · 33 min read

Mastering Prompt Engineering: Techniques, Tips, and Real-World Examples

Baidu Intelligent Cloud Tech Hub

Jul 17, 2023 · Artificial Intelligence

How Vector Retrieval Powers Large Language Models: Techniques and Practices

This article explains the fundamentals of vector retrieval, its role in enhancing large language models through embedding and prompt engineering, and details the algorithms, system architecture, and Baidu's engineering practices for building high‑performance vector databases.

AIEmbeddingVector Database

0 likes · 14 min read

How Vector Retrieval Powers Large Language Models: Techniques and Practices

ZhongAn Tech Team

Jul 14, 2023 · Artificial Intelligence

Exploring AIGC Applications in Insurance: Insights from ZhongAn Insurance CTO Jiang Jiyun

The interview with ZhongAn Insurance CTO Jiang Jiyun discusses how the company leverages AIGC technologies such as large language models, embeddings, and prompt engineering to enhance marketing, intelligent customer service, and data security, while highlighting practical challenges and best practices for AI adoption in the insurance sector.

AIGCData SecurityEmbedding

0 likes · 15 min read

Exploring AIGC Applications in Insurance: Insights from ZhongAn Insurance CTO Jiang Jiyun

21CTO

Jul 8, 2023 · Artificial Intelligence

Unlocking LangChain: Build End-to-End LLM Apps with Chains, Agents, and Memory

This article introduces LangChain—a modular framework for constructing large‑language‑model applications—covering its core components, asynchronous support, prompt engineering, memory handling, chain and agent workflows, token considerations, embedding techniques, and a step‑by‑step Python example that culminates in a Gradio‑based conversational chatbot.

AI developmentEmbeddingLangChain

0 likes · 20 min read

Unlocking LangChain: Build End-to-End LLM Apps with Chains, Agents, and Memory

DeWu Technology

Jul 5, 2023 · Artificial Intelligence

Fine-tuning Large Language Models with LoRA/QLoRA and Deploying via GPTQ Quantization on KubeAI

The article explains how LoRA and its 4‑bit QLoRA extension dramatically reduce trainable parameters and GPU memory for fine‑tuning large language models, while GPTQ post‑training quantization compresses weights for cheap inference, and shows how KubeAI integrates these techniques into a one‑click workflow for 7 B, 13 B, and 33 B models from data upload to API deployment.

GPTQKubeAILoRA

0 likes · 13 min read

Fine-tuning Large Language Models with LoRA/QLoRA and Deploying via GPTQ Quantization on KubeAI

Network Intelligence Research Center (NIRC)

Jul 1, 2023 · Artificial Intelligence

Prompting Large Language Models for Knowledge‑Based Visual Question Answering: The Prophet Framework

This article analyzes the Prophet framework, which leverages a traditional VQA model to generate answer candidates and in‑context examples that prompt GPT‑3, achieving state‑of‑the‑art performance on the challenging OK‑VQA and A‑OKVQA benchmarks.

GPT-3MCANProphet

0 likes · 9 min read

Prompting Large Language Models for Knowledge‑Based Visual Question Answering: The Prophet Framework

DataFunSummit

Jun 30, 2023 · Artificial Intelligence

Roundtable on Large‑Model‑Based Recommendation Systems: Opportunities, Challenges, and Future Directions

In this expert roundtable, leading researchers and engineers discuss the current state of recommendation systems, how large language models can reshape the field, the technical and practical challenges involved, and practical advice for practitioners looking to adopt AI‑driven personalization solutions.

AIIndustry InsightsRecommendation Systems

0 likes · 36 min read

Roundtable on Large‑Model‑Based Recommendation Systems: Opportunities, Challenges, and Future Directions

DataFunSummit

Jun 28, 2023 · Artificial Intelligence

OPPO's CHAOS Pretrained Large Model and GammaE Knowledge‑Graph Multi‑hop Reasoning: Techniques and Insights

This article presents OPPO Research Institute's recent advances in large‑model AI, detailing the CHAOS pretrained model that topped the CLUE leaderboard, the knowledge‑enhanced training pipeline, and the GammaE model for multi‑hop reasoning over knowledge graphs, together with experimental results and practical training tips.

AI researchGammaEchaos

0 likes · 20 min read

OPPO's CHAOS Pretrained Large Model and GammaE Knowledge‑Graph Multi‑hop Reasoning: Techniques and Insights

政采云技术

Jun 28, 2023 · Artificial Intelligence

An Overview of ChatGPT: Architecture, Training Process, Advantages, Risks, and Practical Team Deployment

This article explains what GPT is, how it is trained, its strengths and limitations, the various risks it poses, and provides practical guidance on safely adopting large language models like ChatGPT within development teams, including code‑level analysis examples.

AI risksChatGPTTeam Adoption

0 likes · 13 min read

An Overview of ChatGPT: Architecture, Training Process, Advantages, Risks, and Practical Team Deployment

Programmer DD

Jun 20, 2023 · Artificial Intelligence

Yann LeCun: Today's AI Still Below Dog Level – Inside Meta’s Voicebox, MusicGen & I‑JEPA

Meta’s chief AI scientist Yann LeCun warned that current large language models still fall short of human and even dog intelligence, citing their lack of real‑world understanding, while Meta unveiled three new generative AI models—Voicebox for speech, MusicGen for music, and I‑JEPA for image reasoning—showcasing both progress and remaining limitations.

Artificial Intelligencecomputer visiongenerative AI

0 likes · 7 min read

Yann LeCun: Today's AI Still Below Dog Level – Inside Meta’s Voicebox, MusicGen & I‑JEPA

DataFunTalk

Jun 20, 2023 · Artificial Intelligence

How Recommendation Systems Work and Their Integration with ChatGPT

This article explains the fundamentals of recommendation systems, their digital representation, how ChatGPT and large language models are applied to enhance recommendation performance, and highlights emerging trends such as conversational recommendation and a recommended book on the subject.

AIChatGPTConversational AI

0 likes · 8 min read

How Recommendation Systems Work and Their Integration with ChatGPT

DataFunSummit

Jun 14, 2023 · Artificial Intelligence

DataFun Summit 2023: Large Language Models and AIGC Conference

DataFun will host the DataFun Summit 2023 on June 17‑18, featuring three chairs and eight presenters who will discuss core topics such as large language model research, multimodal generation, reinforcement learning, tool learning, distributed training, and industry applications, with free registration via QR code.

AI ConferenceAIGCMachine Learning

0 likes · 42 min read

DataFun Summit 2023: Large Language Models and AIGC Conference

Rare Earth Juejin Tech Community

Jun 14, 2023 · Artificial Intelligence

ChatGPT Practice Applications and Large Model Technology Insights from the Juejin Offline Salon

The article recaps a Beijing offline salon where experts and open‑source contributors discussed ChatGPT desktop applications, the development and deployment of ChatGPT‑Next‑Web, large‑language‑model challenges, the VisualGLM multimodal model, and product design considerations, providing technical insights and community perspectives on AI advancements.

AIChatGPTOpen Source

0 likes · 9 min read

ChatGPT Practice Applications and Large Model Technology Insights from the Juejin Offline Salon