Tagged articles
674 articles
Page 6 of 7
JD Tech Talk
JD Tech Talk
Aug 19, 2024 · Artificial Intelligence

AI‑Driven Automated Question Generation for Aviation Maintenance Training

The article describes how JD Aviation’s maintenance department uses a vector‑based knowledge base and large‑language‑model services to automatically generate, evaluate, and maintain training exam questions, addressing the rapid growth of manuals, frequent updates, and the heavy manual workload of traditional test creation.

AIKnowledge BaseLarge Language Model
0 likes · 12 min read
AI‑Driven Automated Question Generation for Aviation Maintenance Training
21CTO
21CTO
Aug 17, 2024 · Artificial Intelligence

Understanding Large Language Models: Training, Uses, and a Llama 3 Code Demo

This article explains what large language models (LLMs) are, how they are trained, their diverse applications across industries, the challenges they face, and provides a practical Python example using Replicate to run Meta's Llama 3‑70b‑instruct model.

AILLMLarge Language Model
0 likes · 11 min read
Understanding Large Language Models: Training, Uses, and a Llama 3 Code Demo
Meituan Technology Team
Meituan Technology Team
Aug 8, 2024 · Artificial Intelligence

BlackPearl Team Wins All Three Tracks of KDD 2024 OAG‑Challenge Cup with Large‑Model Solutions

The BlackPearl team from Meituan’s Dazhong Dianping division swept all three KDD 2024 OAG‑Challenge Cup tracks—WhoIsWho, PST, and AQA—by deploying innovative large‑model techniques such as iterative text clustering, graft‑learning‑enhanced BERT RAG pipelines, and a Boosting LLM‑for‑Vector search, and have released the code publicly on GitHub.

Academic DisambiguationKDD CupLarge Language Model
0 likes · 4 min read
BlackPearl Team Wins All Three Tracks of KDD 2024 OAG‑Challenge Cup with Large‑Model Solutions
58 Tech
58 Tech
Aug 7, 2024 · Artificial Intelligence

Bridging Compute and Applications: 58.com AI Lab’s Large‑Model Platform and AI Agent Solutions

In this article, 58.com AI Lab senior director Zhan Kunlin explains how the company built a multi‑layer AI platform, created a vertical large‑language model called LingXi, and developed an AI Agent system with RAG capabilities to accelerate practical AI applications across various business scenarios.

AI PlatformAI agentsLarge Language Model
0 likes · 10 min read
Bridging Compute and Applications: 58.com AI Lab’s Large‑Model Platform and AI Agent Solutions
NewBeeNLP
NewBeeNLP
Aug 5, 2024 · Industry Insights

How Alibaba Cloud Scales Search Recommendations with Big Data, AI, and LLMs

This article details Alibaba Cloud's end‑to‑end architecture for search and advertising recommendation, covering the data platform, AI services, feature‑store design, training and inference optimizations, and the integration of large language models for new recommendation scenarios.

AI PlatformAlibaba CloudBig Data
0 likes · 17 min read
How Alibaba Cloud Scales Search Recommendations with Big Data, AI, and LLMs
Java Tech Enthusiast
Java Tech Enthusiast
Aug 1, 2024 · Artificial Intelligence

Apple Intelligence: Inside the New Apple Foundation Model

Apple Intelligence, an on‑device AI suite debuting with iOS 18.1 beta, centers on the Apple Foundation Model—a 3‑billion‑parameter on‑device LLM (and a larger undisclosed cloud version) trained on TPUs with novel RL algorithms and mixed‑precision quantization, delivering Siri, writing assistance, photo search, and benchmark performance that surpasses GPT‑4, though currently limited to paid developers.

AIApple IntelligenceLarge Language Model
0 likes · 11 min read
Apple Intelligence: Inside the New Apple Foundation Model
DataFunTalk
DataFunTalk
Aug 1, 2024 · Artificial Intelligence

Ant Group's Time Series AI Practices: AntFlux Engine and Real‑World Applications

This article presents Ant Group's comprehensive time‑series AI solutions, detailing the AntFlux platform, the evolution from statistical to deep and large‑scale models—including Time‑LLM, iTransformer, and SLOTH—and illustrating how these technologies empower business insight, forecasting, decision‑making, and green computing across diverse scenarios.

AntFluxLarge Language ModelTime-series
0 likes · 17 min read
Ant Group's Time Series AI Practices: AntFlux Engine and Real‑World Applications
Baobao Algorithm Notes
Baobao Algorithm Notes
Jul 31, 2024 · Artificial Intelligence

What Makes Mistral’s 7B, Mixtral, and Large 2 Models Stand Out? A Deep Technical Dive

This article compiles key technical details of the Mistral model family—including Mistral 7B, Mixtral 8×7B, Mixtral 8×22B, Mistral Nemo, and Mistral Large 2—covering their architectural innovations such as sliding‑window attention, grouped‑query attention, mixture‑of‑experts design, scaling parameters, performance benchmarks, quantization requirements, and practical deployment commands.

Grouped Query AttentionLarge Language ModelMistral
0 likes · 17 min read
What Makes Mistral’s 7B, Mixtral, and Large 2 Models Stand Out? A Deep Technical Dive
DataFunTalk
DataFunTalk
Jul 26, 2024 · Artificial Intelligence

Llama 3: Open‑source Large Language Model Technical Report and Evaluation

This comprehensive technical report details the development, architecture, training methodology, extensive benchmark evaluations, safety measures, and inference optimizations of Meta's open‑source Llama 3 large language model series, covering models up to 405 billion parameters and supporting multilingual, multimodal, and tool‑use capabilities.

AILLaMALarge Language Model
0 likes · 115 min read
Llama 3: Open‑source Large Language Model Technical Report and Evaluation
Data Thinking Notes
Data Thinking Notes
Jul 25, 2024 · Information Security

How Large Language Models Transform Data Security Compliance Management

This article explains how a leading insurance technology group leverages large language models to streamline data security compliance, detailing the evolution of data management, key governance challenges, multimodal AI architecture, and practical workflows for policy enforcement, risk monitoring, and asset management.

AIComplianceData Security
0 likes · 10 min read
How Large Language Models Transform Data Security Compliance Management
NewBeeNLP
NewBeeNLP
Jul 25, 2024 · Artificial Intelligence

Llama 3.1 Unveiled: How the New Open‑Source Giant Matches GPT‑4o and Claude 3.5

Meta has officially released Llama 3.1, a 405‑billion‑parameter open‑source model that matches or surpasses GPT‑4o and Claude 3.5 on over 150 benchmarks, expands context to 128 K tokens, supports eight languages, and is accompanied by a detailed 100‑page paper describing its data, training stack, architecture, quantization, safety measures, and ecosystem support.

AI safetyLarge Language ModelLlama 3.1
0 likes · 15 min read
Llama 3.1 Unveiled: How the New Open‑Source Giant Matches GPT‑4o and Claude 3.5
Kuaishou Tech
Kuaishou Tech
Jul 17, 2024 · Artificial Intelligence

Key Technical Innovations in Kuaishou’s “Kuaiyi” Large Model and Its Real-World Applications

The article details Kuaishou’s development of the 175B “Kuaiyi” multimodal large model, presenting eight novel technical innovations—from Temporal Scaling Law and MiLe Loss to MoE‑enhanced reward modeling—and describes how these advances enable high‑performance AI services such as the AI Xiao Kuai chatbot across diverse real‑world scenarios.

AI applicationsLarge Language ModelScaling Law
0 likes · 12 min read
Key Technical Innovations in Kuaishou’s “Kuaiyi” Large Model and Its Real-World Applications
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 17, 2024 · Artificial Intelligence

How Alibaba Cloud Built Service‑Domain AI Agents: Design, Practice, and Results

This article explains how Alibaba Cloud designed and deployed large‑language‑model agents for its service domain, covering background, ideal LLM deployment, the shift from explanation to problem solving, the agent framework, practical implementation, automation trade‑offs, training, evaluation, and real‑world impact.

AI agentAlibaba CloudCustomer Service Automation
0 likes · 20 min read
How Alibaba Cloud Built Service‑Domain AI Agents: Design, Practice, and Results
DataFunSummit
DataFunSummit
Jul 16, 2024 · Artificial Intelligence

Knowledge Graph Construction, Reasoning, and QA for Intelligent Hypertension Diagnosis

This article presents a comprehensive exploration of knowledge‑graph‑based modeling, neural‑symbolic multi‑hop reasoning, and large‑model‑driven question answering applied to precise medication decision‑making in hypertension, detailing system architecture, experimental evaluations, real‑world deployments, and future research directions.

Large Language ModelReasoninghypertension
0 likes · 26 min read
Knowledge Graph Construction, Reasoning, and QA for Intelligent Hypertension Diagnosis
NewBeeNLP
NewBeeNLP
Jul 16, 2024 · Artificial Intelligence

Can Item Language Models Bridge LLMs and Collaborative Filtering for Conversational Recommendation?

This paper identifies three challenges of applying large language models to recommendation systems and proposes an Item Language Model that combines an item encoder with a frozen LLM, demonstrating through extensive experiments that language‑item alignment and interaction knowledge significantly improve conversational recommendation performance.

Large Language ModelQ-Formercollaborative filtering
0 likes · 10 min read
Can Item Language Models Bridge LLMs and Collaborative Filtering for Conversational Recommendation?
Baidu Geek Talk
Baidu Geek Talk
Jul 15, 2024 · Industry Insights

How AI Is Revolutionizing Physical Network Fault Localization

This article explains how Baidu Cloud evolved from manual and integrated network fault detection to AI-driven localization using large language models, detailing structured prompting, multi‑agent workflows, and real‑world comparisons that demonstrate improved accuracy and faster mitigation.

AIFault LocalizationInfrastructure
0 likes · 14 min read
How AI Is Revolutionizing Physical Network Fault Localization
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Jul 10, 2024 · Artificial Intelligence

How AI Transforms Physical Network Fault Localization: From Manual to LLM‑Powered Precision

This article explains how Baidu Cloud evolved its physical network fault‑location workflow—from manual analysis and integrated multi‑signal algorithms to AI‑driven reasoning with large language models—highlighting structured prompting, multi‑agent collaboration, and measurable improvements in accuracy and automation.

AIFault LocalizationLarge Language Model
0 likes · 15 min read
How AI Transforms Physical Network Fault Localization: From Manual to LLM‑Powered Precision
Baidu Tech Salon
Baidu Tech Salon
Jul 9, 2024 · Artificial Intelligence

AI-Powered Job Matching Application Using ERNIE SDK

The AI‑powered job‑matching application built with Baidu’s ERNIE SDK, created by PaddlePaddle expert Gao Fuzhi, intelligently parses a candidate’s resume, matches them to suitable positions, supplies detailed salary, location and benefit data, analyzes job requirements, and offers personalized skill and interview guidance, aiming to improve recruitment efficiency for both seekers and employers.

AIERNIE SDKLarge Language Model
0 likes · 8 min read
AI-Powered Job Matching Application Using ERNIE SDK
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Jul 6, 2024 · Artificial Intelligence

ChatGLM Evolution: Deep Dive into GLM Architecture, Pretraining, and ChatGLM‑4

This article provides a comprehensive technical overview of the ChatGLM series—from the original ChatGLM‑6B model and its GLM‑based pre‑training framework to the enhancements in ChatGLM‑2, the architectural parity of ChatGLM‑3, and the advanced capabilities of the latest ChatGLM‑4, covering model structure, position encoding, attention mechanisms, multi‑task pretraining, and tool integration.

AIChatGLMGLM
0 likes · 25 min read
ChatGLM Evolution: Deep Dive into GLM Architecture, Pretraining, and ChatGLM‑4
ByteDance SYS Tech
ByteDance SYS Tech
Jun 30, 2024 · Operations

How Large‑Model AI Is Transforming Intelligent Operations (AIOps)

This article explores the latest concepts, planning roadmap, and practical applications of large‑model AI in intelligent operations, detailing AIOps use cases, system‑level automation, multi‑agent architectures, and how a dedicated platform accelerates deployment and efficiency across data‑center environments.

AI agentsIntelligent OperationsLarge Language Model
0 likes · 18 min read
How Large‑Model AI Is Transforming Intelligent Operations (AIOps)
JD Cloud Developers
JD Cloud Developers
Jun 25, 2024 · Artificial Intelligence

Why Do Large Language Models Output Text Word‑by‑Word? Inside the Transformer Mechanics

This article explains the fundamental architecture of large language models, from the dual file nature of parameters and code, through neural network basics, perceptrons, and weight training, to the Transformer’s tokenization, positional encoding, self‑attention, and inference processes, illustrated with diagrams and examples.

Large Language ModelNeural NetworkSelf-Attention
0 likes · 22 min read
Why Do Large Language Models Output Text Word‑by‑Word? Inside the Transformer Mechanics
JD Tech Talk
JD Tech Talk
Jun 21, 2024 · Artificial Intelligence

Multilingual Support System Using Large Language Models: Architecture, Workflow, and Implementation Plan

This document outlines a comprehensive plan to enhance international logistics systems with real‑time multilingual support using large language models, detailing goals, architecture, automated translation, user‑driven term management, approval workflows, cloud deployment, and expected efficiency and quality improvements.

Large Language Modelmultilingualterm management
0 likes · 14 min read
Multilingual Support System Using Large Language Models: Architecture, Workflow, and Implementation Plan
Architecture Digest
Architecture Digest
Jun 21, 2024 · Artificial Intelligence

Getting Started with Spring Cloud Alibaba AI: Integrating Tongyi Large Models in Spring Boot

This article introduces Spring Cloud Alibaba AI, explains its relationship to Spring AI, and provides a step‑by‑step tutorial—including Maven setup, dependency configuration, code examples, and sample calls—to integrate Alibaba's Tongyi large‑model services for text QA, image generation, and speech synthesis in a Java Spring Boot application.

AI integrationAlibaba CloudJava
0 likes · 11 min read
Getting Started with Spring Cloud Alibaba AI: Integrating Tongyi Large Models in Spring Boot
AntTech
AntTech
Jun 20, 2024 · Artificial Intelligence

Predicting Football Match Outcomes with Graph Neural Networks and Large Language Models: The “Smart Guess Football” Project

During the 2024 European Championship, TuGraph engineers built an interactive system called “Smart Guess Football” that combines graph computing, graph neural networks, transformers and large language models to model player relationships and predict match outcomes, achieving up to 71% accuracy on limited test matches.

AIGraph Neural NetworkLarge Language Model
0 likes · 7 min read
Predicting Football Match Outcomes with Graph Neural Networks and Large Language Models: The “Smart Guess Football” Project
NewBeeNLP
NewBeeNLP
Jun 18, 2024 · Artificial Intelligence

How Shopee Builds an E‑Commerce Knowledge Graph and Leverages Large Models

This article presents Shopee's comprehensive approach to constructing an e‑commerce knowledge graph, detailing the challenges of heterogeneous data, multi‑language handling, entity disambiguation, and the integration of deep learning and large language models to improve product matching, recommendation, and operational efficiency.

AILarge Language ModelMultimodal
0 likes · 22 min read
How Shopee Builds an E‑Commerce Knowledge Graph and Leverages Large Models
Bilibili Tech
Bilibili Tech
Jun 14, 2024 · Artificial Intelligence

Technical Report on the Index-1.9B Series: Model Variants, Pre‑training Optimizations, and Alignment Experiments

The report presents the open‑source Index‑1.9B family—base, pure, chat, and character variants—detailing benchmark results, pre‑training optimizations such as a normalized LM‑Head and deeper‑slim architectures, the importance of modest instruction data, alignment via SFT/DPO, role‑play enhancements with RAG, and acknowledges remaining safety and factual limitations.

Instruction TuningLLMLarge Language Model
0 likes · 15 min read
Technical Report on the Index-1.9B Series: Model Variants, Pre‑training Optimizations, and Alignment Experiments
JD Tech Talk
JD Tech Talk
Jun 6, 2024 · Artificial Intelligence

AI‑Powered Code Review Integrated into CI Pipelines for Faster, Higher‑Quality Development

This article analyses the drawbacks of manual code review, explains why they arise, and presents a practical solution that embeds a large‑language‑model‑based AI reviewer into a CI/CD pipeline, detailing configuration steps, script examples, and the resulting efficiency and quality gains.

AI code reviewLarge Language ModelSoftware quality
0 likes · 8 min read
AI‑Powered Code Review Integrated into CI Pipelines for Faster, Higher‑Quality Development
JD Cloud Developers
JD Cloud Developers
Jun 6, 2024 · Artificial Intelligence

Boost Code Review Efficiency with AI-Powered CI Integration

This guide explains how embedding a large‑language‑model AI into a CI pipeline can automate code reviews, cut review time, improve consistency and accuracy, and ultimately raise development efficiency and code quality while reducing manual effort and communication overhead.

AICode ReviewJava
0 likes · 9 min read
Boost Code Review Efficiency with AI-Powered CI Integration
Baidu Tech Salon
Baidu Tech Salon
May 30, 2024 · Artificial Intelligence

How AI Code Assistant Baidu Comate Boosted Medical Imaging Processing by 9×

A graduate student’s lab cut the time to process 150 GB of medical imaging data from one week for three people to two days for one person by using Baidu Comate’s AI‑driven code generation, annotation, and private‑knowledge enhancement features, achieving over nine‑fold productivity gains.

AI code assistantBaidu ComateLarge Language Model
0 likes · 8 min read
How AI Code Assistant Baidu Comate Boosted Medical Imaging Processing by 9×
Baidu Geek Talk
Baidu Geek Talk
May 29, 2024 · Artificial Intelligence

How Baidu’s AI Code Assistant Boosted R&D Efficiency by Over 11% in Marketing Platforms

The article analyzes how Baidu's marketing service team leveraged the Wenxin large model and the Baidu Comate AI code assistant to accelerate product reconstruction, achieve AI‑native development, and quantify a daily engineering productivity gain of roughly 11.2% through reduced coding time and automated deployment workflows.

AI code assistantAI-native developmentBaidu Comate
0 likes · 13 min read
How Baidu’s AI Code Assistant Boosted R&D Efficiency by Over 11% in Marketing Platforms
21CTO
21CTO
May 28, 2024 · Artificial Intelligence

When Google’s AI Overview Hallucinates: Surprising Misanswers and What They Reveal

Google’s AI Overview, unveiled at I/O 2024, replaces traditional search results with AI‑generated summaries, but real‑world usage shows bizarre hallucinations—from claiming the internet is 100% true to recommending eating stones—highlighting the lingering challenges of large language models.

AI HallucinationAI OverviewGoogle AI
0 likes · 7 min read
When Google’s AI Overview Hallucinates: Surprising Misanswers and What They Reveal
JD Retail Technology
JD Retail Technology
May 27, 2024 · Artificial Intelligence

Automating Test Case Generation with Large Language Models and LangChain

This article describes how large language models and the LangChain framework can be combined with PDF parsing, text chunking, memory management, and a vector database to automatically generate software test cases, achieving significant efficiency gains while outlining implementation details, results, and future challenges.

AILangChainLarge Language Model
0 likes · 10 min read
Automating Test Case Generation with Large Language Models and LangChain
21CTO
21CTO
May 23, 2024 · Artificial Intelligence

How xAI’s Grok 1.5V Adds Multimodal Image Input for Developers

xAI’s Grok 1.5V is set to support multimodal image input, allowing developers to upload pictures and receive text‑based answers via the Python SDK, marking a major upgrade that narrows the gap with leading models like GPT‑4 and signals a new frontier for AI chatbots.

AI chatbotsGrokLarge Language Model
0 likes · 4 min read
How xAI’s Grok 1.5V Adds Multimodal Image Input for Developers
Baidu Tech Salon
Baidu Tech Salon
May 22, 2024 · Industry Insights

How Baidu’s AI‑Powered Code Assistant Boosts R&D Efficiency by Over 11 %

The article examines Baidu Marketing Service's AI‑native transformation using the Wenxin large model and Baidu Comate, detailing how real‑time code recommendations, open‑platform integration, and generative AI dramatically improve developer productivity, reduce coding time, and increase marketing ROI.

AIAI-native developmentBaidu Comate
0 likes · 11 min read
How Baidu’s AI‑Powered Code Assistant Boosts R&D Efficiency by Over 11 %
360 Tech Engineering
360 Tech Engineering
May 17, 2024 · Artificial Intelligence

360VL: An Open‑Source Multimodal Large Language Model Based on Llama‑3‑70B

The article introduces 360VL, an open‑source multimodal large language model built on Llama‑3‑70B, describes its novel C‑abs bridge architecture for high‑resolution visual understanding, outlines the two‑stage training with bilingual data, and presents benchmark results showing superior performance over prior LMMs.

AI researchLarge Language ModelLlama3
0 likes · 8 min read
360VL: An Open‑Source Multimodal Large Language Model Based on Llama‑3‑70B
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
May 15, 2024 · Artificial Intelligence

OpenAI Unveils GPT‑4o: An Omni‑Capable Multimodal Model Offered Free to All Users

OpenAI introduced GPT‑4o, a free, omni‑capable multimodal model that processes text, audio, and images together, delivers near‑human response latency, showcases impressive live demos, and will soon be available via a discounted API, marking a significant step forward in end‑to‑end AI research.

AI researchGPT-4oLarge Language Model
0 likes · 7 min read
OpenAI Unveils GPT‑4o: An Omni‑Capable Multimodal Model Offered Free to All Users
CSS Magic
CSS Magic
May 13, 2024 · Artificial Intelligence

DeepSeek: China’s New LLM Dark Horse – First Impressions and Shockingly Low Prices

The article evaluates DeepSeek v2, a 100‑billion‑parameter MoE model, highlighting its near‑GPT‑4 benchmark performance, OpenAI‑compatible API, 32k‑token context, exceptionally low pricing, a custom token‑utilization metric, and the practical drawbacks observed during hands‑on testing.

API compatibilityDeepSeekLarge Language Model
0 likes · 9 min read
DeepSeek: China’s New LLM Dark Horse – First Impressions and Shockingly Low Prices
Baobao Algorithm Notes
Baobao Algorithm Notes
May 9, 2024 · Artificial Intelligence

Inside Deepseek‑V2: How Multi‑Head Latent Attention Cuts KV‑Cache and Boosts Performance

This article provides an in‑depth technical analysis of Deepseek‑V2, covering its 236B parameter size, Multi‑Head Latent Attention optimization that reduces KV‑cache memory, architectural details, training pipelines, infrastructure choices, and performance results on benchmarks such as MMLU and instruction following.

AI ArchitectureDeepSeekLarge Language Model
0 likes · 17 min read
Inside Deepseek‑V2: How Multi‑Head Latent Attention Cuts KV‑Cache and Boosts Performance
Baidu Tech Salon
Baidu Tech Salon
May 8, 2024 · Artificial Intelligence

Sugar BI: AI‑Driven Next‑Generation Business Intelligence Platform

Sugar BI, Baidu’s AI‑driven next‑generation business intelligence platform, evolves from the 2016 ShowX system into a zero‑code, multi‑source analytics suite that integrates over 30 data connectors, advanced semantic modeling, and the Wenxin‑powered Sugar Bot, which transforms natural‑language queries into optimized visualizations via intelligent chart recommendation, positioning it as a leading AI‑augmented BI solution.

AIData visualizationIntelligent Chart Recommendation
0 likes · 19 min read
Sugar BI: AI‑Driven Next‑Generation Business Intelligence Platform
Baidu Geek Talk
Baidu Geek Talk
May 8, 2024 · Artificial Intelligence

Sugar BI: AI‑Driven Next‑Generation Business Intelligence Platform

Sugar BI, evolving from the internal ShowX platform to versions 2.0‑4.0, now offers a zero‑code, drag‑and‑drop visual editor, support for over 30 data sources, AI‑powered automatic analysis and the Sugar Bot Q&A module that transforms multi‑day data tasks into minutes, delivering containerized SaaS BI with intelligent chart recommendation and rapid, code‑free decision‑making for enterprises.

AIAnalyticsBI
0 likes · 19 min read
Sugar BI: AI‑Driven Next‑Generation Business Intelligence Platform
Baobao Algorithm Notes
Baobao Algorithm Notes
May 6, 2024 · Artificial Intelligence

DeepSeek-V2: 236B MoE LLM Delivers Higher Performance While Cutting Training Cost by 42%

DeepSeek‑V2 is a 236‑billion‑parameter mixture‑of‑experts language model that reduces training cost by 42.5 %, cuts KV‑cache usage by 93.3 %, and boosts generation throughput 5.76×, while achieving state‑of‑the‑art scores on benchmarks such as MMLU, C‑Eval, BBH, HumanEval, and GSM8K for both base and chat variants.

AIDeepSeek-V2Large Language Model
0 likes · 11 min read
DeepSeek-V2: 236B MoE LLM Delivers Higher Performance While Cutting Training Cost by 42%
IT Services Circle
IT Services Circle
May 1, 2024 · Artificial Intelligence

Summary of Andrew Ng’s AI Agent Talk: Models, Workflows, and Design Patterns

The article summarizes Andrew Ng’s presentation on AI agents, contrasting traditional single‑prompt large‑model usage with iterative agent‑based workflows, reporting experimental accuracy gains, and outlining four agent design patterns—reflection, tool use, planning, and multi‑agent collaboration—while discussing practical trade‑offs such as latency and token speed.

AI agentDesign PatternsLarge Language Model
0 likes · 7 min read
Summary of Andrew Ng’s AI Agent Talk: Models, Workflows, and Design Patterns
Baidu Geek Talk
Baidu Geek Talk
Apr 22, 2024 · Artificial Intelligence

Designing Effective Prompts for Large Language Models: Structure, Code Examples, and Regex Extraction

The article presents a systematic prompt template—comprising Instruction, Input Data, Context, and Output Indicator—demonstrates code examples for single‑ and multi‑task formatting, shows how clear markers enable regex extraction, and introduces Baidu’s PaddlePaddle Star River Community to simplify building reliable LLM‑driven applications.

AICode ExampleLarge Language Model
0 likes · 13 min read
Designing Effective Prompts for Large Language Models: Structure, Code Examples, and Regex Extraction
DataFunTalk
DataFunTalk
Apr 21, 2024 · Artificial Intelligence

Guidelines for Building Domain-Specific Large Models: Dataset Construction, Training Methods, Evaluation, and Hardware Benchmarking

This article presents a comprehensive guide on constructing domain-specific large language models, covering the differences from general models, how to build high‑quality domain datasets, selecting appropriate training methods, designing validation sets, evaluating model capabilities, and benchmarking domestic hardware performance.

AIDataset ConstructionLarge Language Model
0 likes · 20 min read
Guidelines for Building Domain-Specific Large Models: Dataset Construction, Training Methods, Evaluation, and Hardware Benchmarking
21CTO
21CTO
Apr 20, 2024 · Artificial Intelligence

What Developers Need to Know About Meta’s New Open‑Source Llama 3 Model

Meta’s newly open‑source Llama 3 model pushes the frontier of large language models with a larger context window, Mixture‑of‑Experts architecture, multilingual support, and multimodal capabilities, while facing challenges in transparency, bias, and computational resources, and offering diverse applications from NLU to code generation.

AILarge Language ModelLlama3
0 likes · 10 min read
What Developers Need to Know About Meta’s New Open‑Source Llama 3 Model
New Oriental Technology
New Oriental Technology
Apr 19, 2024 · Artificial Intelligence

Effective Prompt Engineering for Large Language Models

This article explains how large language models work, why well‑crafted prompts are essential, and presents practical strategies—such as clarity, conciseness, focus, role‑setting, delimiters, few‑shot examples, and step‑by‑step instructions—to help users obtain accurate and relevant responses from AI systems.

AILLM strategiesLarge Language Model
0 likes · 12 min read
Effective Prompt Engineering for Large Language Models
AntTech
AntTech
Apr 19, 2024 · Artificial Intelligence

AgentUniverse: An Enterprise‑Grade Multi‑Agent Framework for Complex Financial Analysis

The article introduces AgentUniverse, a large‑model multi‑agent framework that orchestrates specialized agents through a PEER collaboration pattern to overcome LLM limitations in complex financial tasks, demonstrates its architecture, workflow, experimental superiority on benchmarks, and provides open‑source installation details.

AIAgent FrameworkFinancial Analysis
0 likes · 10 min read
AgentUniverse: An Enterprise‑Grade Multi‑Agent Framework for Complex Financial Analysis
NewBeeNLP
NewBeeNLP
Apr 19, 2024 · Artificial Intelligence

Llama 3 Unveiled: 8B & 70B Models Set New SOTA Across Benchmarks

Meta announced the open‑source Llama 3 series (8B and 70B parameters), detailing its decoder‑only Transformer architecture, 15 T‑token multilingual training data, superior benchmark scores over competitors, a limited 8K context window, and upcoming cloud and web‑based deployments.

Large Language ModelLlama 3Meta AI
0 likes · 7 min read
Llama 3 Unveiled: 8B & 70B Models Set New SOTA Across Benchmarks
DataFunSummit
DataFunSummit
Apr 16, 2024 · Artificial Intelligence

Intelligent Risk Control: Definitions, Expert Systems, Algorithmic Systems, and Emerging AI Techniques

This article explains intelligent risk control as a synergy of expert experience and algorithmic decision‑making, outlines its definition, expert human systems, digital algorithmic systems, and explores advanced AI methods such as reinforcement learning, large language models with knowledge graphs, adversarial learning, graph neural networks, and a practical supply‑chain case study.

Graph Neural NetworkLarge Language Modeladversarial learning
0 likes · 11 min read
Intelligent Risk Control: Definitions, Expert Systems, Algorithmic Systems, and Emerging AI Techniques
CSS Magic
CSS Magic
Apr 12, 2024 · Artificial Intelligence

Answering Common Kimi API Questions and Exploring AI App Development

This article addresses frequent Kimi API queries, explains the API's purpose, available endpoints, model specifications, token‑based pricing, differences from the web assistant, response variability, JSON output workarounds, and shares upcoming roadmap items for developers building AI applications.

AI developmentChat CompletionJSON output
0 likes · 10 min read
Answering Common Kimi API Questions and Exploring AI App Development
21CTO
21CTO
Apr 11, 2024 · Artificial Intelligence

Google Unveils CodeGemma: New AI Models for Code Generation & Reasoning

Google has introduced the CodeGemma series, expanding its Gemma AI models with new variants optimized for code generation and reasoning, featuring 2B‑7B parameter models trained on 500 billion tokens, delivering full‑code block generation, strong benchmark results, and availability on Kaggle, Hugging Face, and Vertex AI.

AIGoogleLarge Language Model
0 likes · 4 min read
Google Unveils CodeGemma: New AI Models for Code Generation & Reasoning
DataFunSummit
DataFunSummit
Apr 10, 2024 · Artificial Intelligence

Large Language Model Inference Overview and Performance Optimizations

This article presents a comprehensive overview of large language model inference, describing the prefill and decoding stages, key performance metrics such as throughput, latency and QPS, and detailing a series of system-level optimizations—including pipeline parallelism, dynamic batching, KV‑cache quantization, and hardware considerations—to significantly improve inference efficiency on modern GPUs.

GPULarge Language ModelLatency
0 likes · 23 min read
Large Language Model Inference Overview and Performance Optimizations
21CTO
21CTO
Apr 8, 2024 · Artificial Intelligence

How Naver’s HyperCLOVA X Advances Multilingual AI for Asian Languages

Naver’s newly unveiled HyperCLOVA X large‑language model, detailed in an arXiv technical report, claims superior cross‑lingual reasoning for Asian languages, especially Korean, by pre‑training on a data mix of Korean, multilingual text and code, achieving state‑of‑the‑art translation and multilingual capabilities.

AI researchHyperCLOVA XKorean NLP
0 likes · 4 min read
How Naver’s HyperCLOVA X Advances Multilingual AI for Asian Languages
21CTO
21CTO
Mar 29, 2024 · Artificial Intelligence

Why Databricks’ Open‑Source DBRX LLM Is Outpacing GPT‑3.5 and Llama 2

Databricks unveiled the open‑source DBRX large language model, which leverages a mixed‑expert architecture to deliver faster, more cost‑effective inference and beats leading open‑source and proprietary models like Llama 2, Mixtral‑8x7B, and GPT‑3.5 on multiple benchmarks.

AIDBRXDatabricks
0 likes · 7 min read
Why Databricks’ Open‑Source DBRX LLM Is Outpacing GPT‑3.5 and Llama 2
Baobao Algorithm Notes
Baobao Algorithm Notes
Mar 28, 2024 · Artificial Intelligence

How Qwen1.5‑MoE‑A2.7B Matches 70B LLM Performance with Only 2.7B Activated Parameters

Qwen1.5‑MoE‑A2.7B is a 2.7 billion‑parameter Mixture‑of‑Experts model that delivers performance comparable to leading 7 billion‑parameter LLMs while cutting training cost by 75% and boosting inference speed by 1.74×, and the article details its architecture, benchmarks, efficiency analysis, and deployment steps.

Large Language ModelMoEModel Benchmark
0 likes · 13 min read
How Qwen1.5‑MoE‑A2.7B Matches 70B LLM Performance with Only 2.7B Activated Parameters
OPPO Kernel Craftsman
OPPO Kernel Craftsman
Mar 22, 2024 · Artificial Intelligence

InternLM Model Fine-Tuning Tutorial with XTuner: Chat Format and Practical Implementation Guide

This tutorial walks through fine‑tuning Shanghai AI Lab’s open‑source InternLM models with XTuner, explaining chat‑format conventions, loading and inference (including multimodal InternLM‑XComposer), dataset preparation, configuration sections, DeepSpeed acceleration, and memory‑efficient QLoRA details for 7‑B‑parameter chat models.

Chat FormatDeepSpeedInternLM
0 likes · 22 min read
InternLM Model Fine-Tuning Tutorial with XTuner: Chat Format and Practical Implementation Guide
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Mar 20, 2024 · Artificial Intelligence

Elon Musk’s xAI Open‑Sources Grok‑1: A 314‑Billion‑Parameter MoE Large Language Model

Elon Musk’s xAI has open‑sourced Grok‑1, a 314‑billion‑parameter mixture‑of‑experts language model built with Rust and JAX, released under an Apache‑2.0 license, and the announcement includes detailed architecture specs, hardware requirements, and the broader context of Musk’s rivalry with OpenAI.

AIGrok-1Large Language Model
0 likes · 6 min read
Elon Musk’s xAI Open‑Sources Grok‑1: A 314‑Billion‑Parameter MoE Large Language Model
Open Source Tech Hub
Open Source Tech Hub
Mar 17, 2024 · Artificial Intelligence

What Is Grok? Inside Elon Musk’s New Open‑Source LLM and the ‘Grokking’ Phenomenon

Elon Musk announced the open‑source release of Grok, xAI’s new large‑language‑model chatbot, while recalling his lawsuit against OpenAI; the article explains Grok’s rapid development, links to the GitHub repository, summarizes the seminal “Grokking” research paper that describes a sudden generalization breakthrough in neural networks, and provides reference links.

AI researchGrokLarge Language Model
0 likes · 3 min read
What Is Grok? Inside Elon Musk’s New Open‑Source LLM and the ‘Grokking’ Phenomenon
CSS Magic
CSS Magic
Mar 13, 2024 · Artificial Intelligence

How Moonshot’s Kimi Model Beats Big‑Tech LLMs with 200k‑Token Context

The author tests Moonshot’s Kimi API, revealing its 200 k‑character context window, superior token‑to‑character ratio compared with GPT‑3.5 and Gemini, and performance that, while slower than GPT‑3.5 Turbo, rivals GPT‑4 Turbo, all while offering OpenAI‑compatible endpoints and free credit for developers.

API compatibilityKimiLarge Language Model
0 likes · 8 min read
How Moonshot’s Kimi Model Beats Big‑Tech LLMs with 200k‑Token Context
DataFunSummit
DataFunSummit
Mar 11, 2024 · Artificial Intelligence

The Synergy of Large Language Models and Knowledge Graphs: Current Status and Future Directions

This article examines how large language models enhance human‑machine interaction and can be combined with knowledge graphs to improve factual Q&A, task‑oriented services, and structured decision‑making, while highlighting ongoing challenges and the enduring role of knowledge graphs in structured domains.

AILarge Language Modeldialogue system
0 likes · 4 min read
The Synergy of Large Language Models and Knowledge Graphs: Current Status and Future Directions
DataFunTalk
DataFunTalk
Mar 7, 2024 · Artificial Intelligence

Integrating Large Language Models with Knowledge Graphs: Current Status and Future Directions

Large language models enhance human‑machine interaction and natural language understanding, but knowledge graphs remain essential for structured, low‑cost decision making, factual retrieval, and domains like finance; combining both can improve conversational systems, while ongoing challenges in knowledge graph construction persist, as highlighted for the upcoming DataFunSummit2024.

Conversational AILarge Language ModelStructured Data
0 likes · 5 min read
Integrating Large Language Models with Knowledge Graphs: Current Status and Future Directions
DevOps
DevOps
Mar 5, 2024 · Artificial Intelligence

Understanding GPT‑4, ChatGPT, and the Foundations of Large Language Models

This article explains the fundamentals of AI, machine learning, deep learning, and natural language processing, describes how Transformer architectures and attention mechanisms power large language models such as GPT‑4 and ChatGPT, and walks through tokenization, prediction, and practical development with Python.

Artificial IntelligenceChatGPTGPT-4
0 likes · 16 min read
Understanding GPT‑4, ChatGPT, and the Foundations of Large Language Models
21CTO
21CTO
Feb 27, 2024 · Artificial Intelligence

Mistral Large: The Open‑Source LLM Challenging GPT‑4 on Azure

Mistral AI, a Paris‑based startup, unveiled Mistral Large—an open‑source, multilingual LLM rivaling GPT‑4 with a 32k token context window, advanced code and math abilities, and native Azure AI integration, marking a major milestone in European AI development.

Azure AILarge Language ModelMistral AI
0 likes · 6 min read
Mistral Large: The Open‑Source LLM Challenging GPT‑4 on Azure
Architects' Tech Alliance
Architects' Tech Alliance
Feb 25, 2024 · Artificial Intelligence

How Sora Redefined Video Generation: Breakthroughs and Industry Impact

The article provides an in‑depth technical analysis of OpenAI's Sora, highlighting its 60‑second 1080p video generation capability, the novel patches‑vectorization and transformer training pipeline that leverages GPT‑generated prompts for multimodal alignment, and its potential to become a universal video‑generation base model that could reshape the AI industry.

AGILarge Language ModelSora
0 likes · 6 min read
How Sora Redefined Video Generation: Breakthroughs and Industry Impact
Programmer DD
Programmer DD
Feb 22, 2024 · Artificial Intelligence

Google Unveils Gemma: Open‑Source LLM Matching Gemini’s Power

Google has launched Gemma, an open‑source large language model available in 2B and 7B parameter versions, built on the same technology as Gemini, outperforming many existing models and capable of running on ordinary laptops, with a detailed technical report and quick‑start guide provided online.

AIGemmaGoogle
0 likes · 3 min read
Google Unveils Gemma: Open‑Source LLM Matching Gemini’s Power
DataFunSummit
DataFunSummit
Feb 21, 2024 · Artificial Intelligence

Applying Knowledge Graphs to E‑commerce AIGC: From Domain to General KG and Large Language Models

This article presents a comprehensive overview of how knowledge graphs are integrated into e‑commerce AIGC pipelines, covering domain‑specific and generic KG‑driven text generation, model architecture, controllable generation techniques, experimental results, and future directions for large language models in commercial settings.

AIAIGCLarge Language Model
0 likes · 23 min read
Applying Knowledge Graphs to E‑commerce AIGC: From Domain to General KG and Large Language Models
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Feb 18, 2024 · Artificial Intelligence

Llama 2: Open Foundation and Fine‑Tuned Chat Models – Overview and Technical Details

The article provides a comprehensive overview of Meta’s Llama 2 series, detailing model sizes, pre‑training data, architectural enhancements, supervised fine‑tuning, RLHF procedures, safety evaluations, reward‑model training, and iterative improvements, highlighting its open‑source release and comparative performance.

AI safetyLarge Language ModelLlama2
0 likes · 27 min read
Llama 2: Open Foundation and Fine‑Tuned Chat Models – Overview and Technical Details
DataFunSummit
DataFunSummit
Feb 12, 2024 · Artificial Intelligence

Ant Group's Time Series AI Practices and the AntFlux Intelligent Engine

This article presents Ant Group's comprehensive time‑series AI solutions, covering the business value of temporal data, the evolution of statistical and deep learning models, large‑scale time‑series platforms such as AntFlux, and real‑world applications ranging from financial forecasting to green computing.

AIAntFluxLarge Language Model
0 likes · 17 min read
Ant Group's Time Series AI Practices and the AntFlux Intelligent Engine
Baidu Geek Talk
Baidu Geek Talk
Feb 7, 2024 · Artificial Intelligence

Design and Implementation of a Knowledge-Base Intelligent Q&A System for Database Operations Using Large Models

The paper details Baidu Intelligent Cloud’s design and deployment of a domain‑specific knowledge‑base Q&A system for database operations, combining prompt‑engineered LLMs with hybrid vector‑search using LangChain, BES vector store, and custom ingestion, addressing recall, token limits, and hallucination challenges across dashboard and IM bot interfaces.

AIDatabase operationsKnowledge Base
0 likes · 16 min read
Design and Implementation of a Knowledge-Base Intelligent Q&A System for Database Operations Using Large Models
Architect
Architect
Jan 27, 2024 · Industry Insights

How We Built a Scalable Smart Customer Service System for an Activity Platform

This article details the end‑to‑end design, implementation, and operational results of a smart customer‑service platform that automates FAQ capture, leverages both Elasticsearch and LLM‑based models, and provides a low‑code, multi‑team backend for rapid issue resolution.

ElasticsearchLarge Language ModelMicroservices
0 likes · 13 min read
How We Built a Scalable Smart Customer Service System for an Activity Platform
Baidu Geek Talk
Baidu Geek Talk
Jan 24, 2024 · Artificial Intelligence

Building AI‑Native Applications with Baidu Cloud AppBuilder

Sun Ke’s keynote at the 2023 Baidu Cloud Intelligence Conference explains how AI‑native development has shifted from model selection to building practical applications, and introduces Baidu Cloud AppBuilder—a three‑layer, low‑code‑and‑code platform that provides multimodal, LLM, and infrastructure services, enabling rapid prototyping of solutions such as automated resume screening and interview preparation.

AIAppBuilderLarge Language Model
0 likes · 12 min read
Building AI‑Native Applications with Baidu Cloud AppBuilder
JD Tech
JD Tech
Jan 24, 2024 · Artificial Intelligence

JD Retail Technology 2023 Highlights: AI‑Driven Supply Chain, Large Language Models, Edge AI, Data Security, and 3D Modeling Innovations

In 2023 JD Retail’s technology team delivered a suite of AI‑powered innovations—including end‑to‑end inventory management, explainable AI for supply chain, privacy‑preserving advertising models, a ReAct‑SFT‑RAG large language model framework, edge AI inference, secure data‑safe‑house infrastructure, and high‑quality 3D modeling pipelines—demonstrating broad academic and industrial impact across multiple domains.

3D modelingAIGCArtificial Intelligence
0 likes · 19 min read
JD Retail Technology 2023 Highlights: AI‑Driven Supply Chain, Large Language Models, Edge AI, Data Security, and 3D Modeling Innovations
360 Quality & Efficiency
360 Quality & Efficiency
Jan 19, 2024 · Artificial Intelligence

Using Large Language Models to Rapidly Build Simple Frontend and Backend Test Tools

This article explains how to quickly create simple web‑based and backend test tools for internal use by leveraging a large language model to generate annotated HTML, CSS, JavaScript and minimal Flask code, outlining prompt design, tool requirements, and deployment tips to boost testing efficiency.

AI Code GenerationBackend DevelopmentLarge Language Model
0 likes · 8 min read
Using Large Language Models to Rapidly Build Simple Frontend and Backend Test Tools
DataFunTalk
DataFunTalk
Jan 16, 2024 · Artificial Intelligence

Applying Knowledge Graphs to E‑commerce AIGC: From Domain‑Specific to General Knowledge Graphs and LLM Integration

This article presents a comprehensive overview of how knowledge graphs are leveraged in e‑commerce AIGC pipelines, detailing domain‑specific and general graph‑based text generation, model architecture, controllable generation techniques, experimental results, and future directions for large language model integration.

AIGCLarge Language ModelText Generation
0 likes · 22 min read
Applying Knowledge Graphs to E‑commerce AIGC: From Domain‑Specific to General Knowledge Graphs and LLM Integration
DataFunSummit
DataFunSummit
Jan 10, 2024 · Artificial Intelligence

Baidu Commercial Multimodal Understanding and AIGC Innovation Practices

This article presents Baidu's commercial multimodal understanding and AIGC innovations, detailing rich‑media multimodal perception, a unified large‑scale representation framework, scenario‑specific fine‑tuning, and practical applications such as marketing copy, digital‑human video, and poster generation.

AIGCAdvertisingBaidu
0 likes · 12 min read
Baidu Commercial Multimodal Understanding and AIGC Innovation Practices
DataFunSummit
DataFunSummit
Jan 8, 2024 · Artificial Intelligence

Enterprise Knowledge Recommendation System at Alibaba: Architecture, Challenges, and Large Model Applications

This article presents Alibaba's enterprise knowledge recommendation system, detailing its role in digital transformation, the challenges of long‑document recommendation, the multi‑layer architecture spanning feature, engine, ranking, and functional layers, various recall strategies, progressive ranking models, and the integration and evaluation of large language models for improved recommendation performance.

AIAlibabaLarge Language Model
0 likes · 23 min read
Enterprise Knowledge Recommendation System at Alibaba: Architecture, Challenges, and Large Model Applications
Architecture & Thinking
Architecture & Thinking
Jan 8, 2024 · Artificial Intelligence

How Baidu Comate Supercharges Coding: A Practical AI Assistant Guide

This article introduces Baidu Comate, an AI-powered coding assistant built on the Wenxin model, explains how to install it, demonstrates its real-time code completion, comment generation, test creation, and optimization features across multiple languages and IDEs, and highlights its benefits for developers.

AI coding assistantGoLarge Language Model
0 likes · 10 min read
How Baidu Comate Supercharges Coding: A Practical AI Assistant Guide
21CTO
21CTO
Dec 31, 2023 · Artificial Intelligence

2023’s Leading Open-Source LLMs: LLaMA, Pythia, MPT, Falcon, BLOOM, Mistral

Since ChatGPT’s debut, interest in large language models has surged, prompting the AI community to explore open‑source alternatives such as LLaMA, Pythia, MPT, Falcon, BLOOM, and Mistral, which together illustrate the rapid diversification and growing competitiveness of open‑source LLMs in 2023.

2023AILarge Language Model
0 likes · 9 min read
2023’s Leading Open-Source LLMs: LLaMA, Pythia, MPT, Falcon, BLOOM, Mistral
DataFunTalk
DataFunTalk
Dec 29, 2023 · Artificial Intelligence

Enterprise Knowledge Assistant: Leveraging Vector Databases and Large Language Models

This article explores the emerging enterprise knowledge assistant paradigm in the era of large models, detailing traditional knowledge management challenges, solution architecture using vector databases and LLMs, core technologies such as ETL pipelines, reranking, secure fine‑tuning, and future prospects for intelligent enterprise applications.

LLM fine-tuningLarge Language ModelVector Database
0 likes · 11 min read
Enterprise Knowledge Assistant: Leveraging Vector Databases and Large Language Models
21CTO
21CTO
Dec 18, 2023 · Artificial Intelligence

Why Did Google’s Gemini‑Pro Claim to Be Baidu’s Model in Chinese Chats?

A recent test on Google Vertex AI showed Gemini‑Pro introducing itself as Baidu’s Wenxin model during Chinese conversations, sparking debate about model attribution, pricing, developer tools, and the broader competition among major AI platforms.

AI PlatformsGemini ProGoogle AI
0 likes · 5 min read
Why Did Google’s Gemini‑Pro Claim to Be Baidu’s Model in Chinese Chats?
CSS Magic
CSS Magic
Dec 15, 2023 · Artificial Intelligence

Google Gemini Free API Launch: A Deep Dive for Developers

Google has opened its Gemini Pro large‑language model via a completely free API with a 60‑calls‑per‑minute limit, offering an online playground, straightforward key registration, efficient token usage, and streaming output, while noting it remains a technical preview rather than a consumer‑ready service.

AIAPI UsageFree API
0 likes · 3 min read
Google Gemini Free API Launch: A Deep Dive for Developers
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Dec 9, 2023 · Artificial Intelligence

Google Unveils Gemini: A New Multimodal Large Model Family (Ultra, Pro, Nano)

Google announced Gemini, a suite of multimodal large language models—including Ultra, Pro, and Nano—that achieve state‑of‑the‑art results on dozens of benchmarks, support native multimodal pre‑training, and are being integrated across Google products such as Bard, Search, and upcoming Pixel devices.

Artificial IntelligenceGeminiGoogle AI
0 likes · 7 min read
Google Unveils Gemini: A New Multimodal Large Model Family (Ultra, Pro, Nano)
Tencent Cloud Developer
Tencent Cloud Developer
Dec 7, 2023 · Artificial Intelligence

Student Score Ranking and Distribution Analysis Using Python and Tencent Hunyuan Model

Using Tencent's Hunyuan model, the tutorial walks through a Python workflow that scrapes a student‑score table from a web page, saves it as CSV and Excel, cleans missing values, computes total and average scores, and visualizes their distributions with matplotlib, illustrating how LLMs can accelerate data‑analysis coding while still needing human verification.

Data AnalysisData visualizationLarge Language Model
0 likes · 8 min read
Student Score Ranking and Distribution Analysis Using Python and Tencent Hunyuan Model
AntTech
AntTech
Dec 2, 2023 · Artificial Intelligence

TechTalk AI Sharing Season: OpenKG Enters Ant Group – Knowledge Graphs and Large Language Models Empower General AI

The TechTalk AI Sharing Season event on November 28 brought together nearly thirty experts from academia and industry to discuss how knowledge graphs and large language models can be integrated to enhance Ant Group's AI strategy across diverse business scenarios, highlighting collaborations, research labs, and future development directions.

AI strategyAnt GroupIndustry-Academia Collaboration
0 likes · 7 min read
TechTalk AI Sharing Season: OpenKG Enters Ant Group – Knowledge Graphs and Large Language Models Empower General AI
HomeTech
HomeTech
Dec 1, 2023 · Artificial Intelligence

Building a Private Knowledge Base and Large‑Model Platform for Enterprise AI Assistants

This article describes how an enterprise leveraged GPT‑3.5 and other large language models to create a private knowledge base, design prompt engineering, implement plugin extensions, and build a secure, scalable backend and front‑end integration platform that enables AI‑driven customer‑service assistants across multiple business lines.

AILarge Language ModelPrivate Knowledge Base
0 likes · 19 min read
Building a Private Knowledge Base and Large‑Model Platform for Enterprise AI Assistants
Baidu Geek Talk
Baidu Geek Talk
Nov 27, 2023 · Industry Insights

Inside Baidu’s Lingjing Platform: How AI Developer Ecosystems Are Built

This article examines Baidu’s Lingjing developer platform, exploring its origins, design choices, integration of plugins and agents, ecosystem advantages, commercial‑monetization loops, and future roadmap, while providing insights from an interview with platform head Zhang Ruixing on the challenges and opportunities of building AI‑native developer platforms.

AIAgentDeveloper Platform
0 likes · 16 min read
Inside Baidu’s Lingjing Platform: How AI Developer Ecosystems Are Built
Ant R&D Efficiency
Ant R&D Efficiency
Nov 21, 2023 · Artificial Intelligence

Can AI Code Completion Transform Java Development? One Engineer’s Journey

Java engineer Wu Ming shares his experience with CodeFuse, an AI-powered code completion tool, describing how large language models enhance coding efficiency, the challenges of early versions, practical tips for integrating AI assistants into workflows, and his vision for AI’s expanding role across the entire software development lifecycle.

AI code assistantAI workflowCodeFuse
0 likes · 12 min read
Can AI Code Completion Transform Java Development? One Engineer’s Journey