Tagged articles
5000 articles
Page 30 of 50
BirdNest Tech Talk
BirdNest Tech Talk
Aug 2, 2024 · Industry Insights

What’s Next for Go? Inside the Oscar Contributor Agent Project

The article traces the lineage of Go’s technical leadership, explains Russ Cox’s shift to AI, and details the Oscar open‑source contributor‑agent architecture that uses large language models to automate maintenance tasks while preserving deterministic code execution.

AIContributor AgentIndustry Insights
0 likes · 10 min read
What’s Next for Go? Inside the Oscar Contributor Agent Project
Java Tech Enthusiast
Java Tech Enthusiast
Aug 1, 2024 · Artificial Intelligence

Apple Intelligence: Inside the New Apple Foundation Model

Apple Intelligence, an on‑device AI suite debuting with iOS 18.1 beta, centers on the Apple Foundation Model—a 3‑billion‑parameter on‑device LLM (and a larger undisclosed cloud version) trained on TPUs with novel RL algorithms and mixed‑precision quantization, delivering Siri, writing assistance, photo search, and benchmark performance that surpasses GPT‑4, though currently limited to paid developers.

AIApple IntelligenceLarge Language Model
0 likes · 11 min read
Apple Intelligence: Inside the New Apple Foundation Model
Kuaishou Tech
Kuaishou Tech
Jul 31, 2024 · Artificial Intelligence

Kuaishou Showcases AI‑Driven Multimedia Innovations at China Multimedia 2024

At the China Multimedia 2024 conference in Yinchuan, Kuaishou presented its latest AI‑driven large‑model technologies—including text‑to‑image, text‑to‑video, and audio models—alongside advances in intelligent video coding, a new research‑fund initiative, and recent industry awards.

AIKuaishouMultimedia
0 likes · 5 min read
Kuaishou Showcases AI‑Driven Multimedia Innovations at China Multimedia 2024
Baidu Geek Talk
Baidu Geek Talk
Jul 31, 2024 · Artificial Intelligence

Quantitative Analysis of Transformer Architecture and Llama Model Performance

This engineering‑focused document reviews transformer fundamentals, derives precise FLOP and memory formulas for attention and feed‑forward layers, defines the MFU performance metric, analyzes memory components and parallelism strategies, examines recent architecture variants such as MQA, GQA, sliding‑window attention and MoE, and provides practice problems applying these calculations.

AIGPU computingTransformer
0 likes · 30 min read
Quantitative Analysis of Transformer Architecture and Llama Model Performance
FunTester
FunTester
Jul 30, 2024 · Operations

Mastering True Observability: Models, Practices, and AI‑Driven Automation

This article explains why true observability is essential for modern software, outlines its five core pillars, details a four‑stage maturity model with benefits and drawbacks, and provides practical steps—including data collection, team organization, and AI automation—to advance from basic monitoring to predictive, self‑healing systems.

AILoggingMaturity Model
0 likes · 13 min read
Mastering True Observability: Models, Practices, and AI‑Driven Automation
DeWu Technology
DeWu Technology
Jul 29, 2024 · Artificial Intelligence

AI-Driven Loss Prevention: A Comprehensive Field-Level Risk Control System

The paper introduces an AI‑driven loss‑prevention platform that augments manual risk analysis with automated field recognition to map database and code models, generate loss‑related methods and interfaces, and deliver pre‑emptive avoidance, real‑time detection, and post‑incident response, achieving over 1,200% growth in identified loss methods and near‑full field coverage.

AIBusiness Intelligencedatabase analysis
0 likes · 8 min read
AI-Driven Loss Prevention: A Comprehensive Field-Level Risk Control System
DataFunSummit
DataFunSummit
Jul 29, 2024 · Artificial Intelligence

Large Language Models for Recommendation Systems: Current Progress, Challenges, and Future Directions

This article reviews the state‑of‑the‑art applications of large language models in recommendation systems, summarizing background knowledge, recent advances such as LLM4Rec, various tuning strategies, agent‑based approaches, open research problems, and future directions for generative recommendation.

AIIn-Context LearningLLM
0 likes · 24 min read
Large Language Models for Recommendation Systems: Current Progress, Challenges, and Future Directions
php Courses
php Courses
Jul 29, 2024 · Artificial Intelligence

Building Reinforcement Learning Algorithms with PHP

This article explains the fundamentals of reinforcement learning, demonstrates how PHP can be used with neural‑network libraries such as Keras or TensorFlow to implement a simple reinforcement‑learning agent, provides a complete PHP code example, and discusses its potential applications.

AICode Examplereinforcement learning
0 likes · 5 min read
Building Reinforcement Learning Algorithms with PHP
Full-Stack DevOps & Kubernetes
Full-Stack DevOps & Kubernetes
Jul 29, 2024 · Artificial Intelligence

How to Run Real‑Time Voice Cloning with Python: A Step‑by‑Step Guide

This guide introduces the open‑source Realtime Voice Cloning project, explains its key features, and provides detailed installation and usage instructions—including environment setup, dependency installation, cloning the repository, and running the demo tool—to enable real‑time voice transformation with Python.

AIPythonReal-time Audio
0 likes · 5 min read
How to Run Real‑Time Voice Cloning with Python: A Step‑by‑Step Guide
DataFunSummit
DataFunSummit
Jul 28, 2024 · Artificial Intelligence

Leveraging Large Language Models for Graph Learning: Opportunities, Current Progress, and Future Directions

This article reviews why large language models can be applied to graph learning, outlines their capabilities and graph data characteristics, surveys current research across different graph types and LLM roles, and proposes future research directions for unified cross‑domain graph learning.

AILarge Language ModelsResearch Directions
0 likes · 19 min read
Leveraging Large Language Models for Graph Learning: Opportunities, Current Progress, and Future Directions
Python Programming Learning Circle
Python Programming Learning Circle
Jul 27, 2024 · Artificial Intelligence

Numpy‑ML: A Pure NumPy Implementation of Machine Learning Algorithms

The Numpy‑ML project, created by UC Berkeley’s David Bourgin, provides a comprehensive pure‑NumPy implementation of over 30 machine‑learning algorithms—including probabilistic models, neural‑network layers, optimizers, and reinforcement‑learning agents—along with extensive data‑preprocessing utilities, all in a single open‑source repository.

AIAlgorithmsNumPy
0 likes · 6 min read
Numpy‑ML: A Pure NumPy Implementation of Machine Learning Algorithms
DataFunTalk
DataFunTalk
Jul 26, 2024 · Artificial Intelligence

Llama 3: Open‑source Large Language Model Technical Report and Evaluation

This comprehensive technical report details the development, architecture, training methodology, extensive benchmark evaluations, safety measures, and inference optimizations of Meta's open‑source Llama 3 large language model series, covering models up to 405 billion parameters and supporting multilingual, multimodal, and tool‑use capabilities.

AILLaMALarge Language Model
0 likes · 115 min read
Llama 3: Open‑source Large Language Model Technical Report and Evaluation
Data Thinking Notes
Data Thinking Notes
Jul 25, 2024 · Information Security

How Large Language Models Transform Data Security Compliance Management

This article explains how a leading insurance technology group leverages large language models to streamline data security compliance, detailing the evolution of data management, key governance challenges, multimodal AI architecture, and practical workflows for policy enforcement, risk monitoring, and asset management.

AIComplianceData Security
0 likes · 10 min read
How Large Language Models Transform Data Security Compliance Management
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Jul 25, 2024 · Artificial Intelligence

Designing Autonomous LLM Agents: Architecture, Memory, Planning, and Learning Strategies

This article surveys the design of autonomous large‑language‑model agents, detailing their modular architecture—including profiling, memory, planning, and execution—while also reviewing common profiling methods, memory structures, planning techniques, action strategies, and various learning approaches such as exemplar, human‑in‑the‑loop, and environment‑feedback training.

AIAgent ArchitectureAutonomous agents
0 likes · 36 min read
Designing Autonomous LLM Agents: Architecture, Memory, Planning, and Learning Strategies
Qunar Tech Salon
Qunar Tech Salon
Jul 25, 2024 · Artificial Intelligence

AI-Generated Video Practices for International Hotels

At the WOT2024 conference, Qunar Travel’s CTO Zheng Jimin presented a comprehensive overview of AI-generated video production for international hotels, detailing challenges, AI-driven workflow automation, practical implementation steps, multilingual translation enhancements, and performance results, offering valuable insights for scaling high‑quality hotel video content.

AIAIGCHotel Industry
0 likes · 11 min read
AI-Generated Video Practices for International Hotels
phodal
phodal
Jul 24, 2024 · Artificial Intelligence

How to Build Trustworthy Coding Agents with Shire’s Custom RAG Workflow

This article explains how to use the Shire language to create reliable coding agents by defining custom RAG workflows, leveraging IDE APIs, code verification functions, and vector‑based search, with detailed examples, configuration snippets, and a roadmap for future enhancements.

AICoding AgentIDE
0 likes · 10 min read
How to Build Trustworthy Coding Agents with Shire’s Custom RAG Workflow
Alibaba Cloud Native
Alibaba Cloud Native
Jul 24, 2024 · Cloud Native

How to Observe and Optimize LLM Applications with Alibaba Cloud ARMS

This article explains the challenges of deploying large language model (LLM) applications, outlines the need for end‑to‑end observability, and details Alibaba Cloud ARMS' LLM‑specific tracing, metrics, and Python agent solutions for monitoring, debugging, and performance optimization.

AILLMMetrics
0 likes · 20 min read
How to Observe and Optimize LLM Applications with Alibaba Cloud ARMS
Baidu Geek Talk
Baidu Geek Talk
Jul 24, 2024 · Artificial Intelligence

AI-Driven Fusion of Peking Opera Characters with Ink-Wash Painting Style Using PaddleGAN

Li Yilin’s AI project blends Peking Opera characters with traditional ink‑wash painting by using PaddleHub for style transfer and PaddleGAN’s First‑Order Motion model for facial motion, then adds music and Wav2Lip lip‑sync, producing videos that modernize Chinese heritage and gauge public cultural awareness.

AIDeep LearningPaddleGAN
0 likes · 9 min read
AI-Driven Fusion of Peking Opera Characters with Ink-Wash Painting Style Using PaddleGAN
StarRocks
StarRocks
Jul 24, 2024 · Big Data

Why Lakehouse Architecture Is Redefining Big Data Infrastructure in the AI Era

The article examines the rapid rise of lakehouse architecture, its market momentum, core components—including storage, metadata, table formats, and compute layers—compares Iceberg, Hudi, and Delta Lake, discusses the shift from HDFS to object storage, and outlines the strategic importance of lakehouses for AI-driven data management and future data infrastructure trends.

AIApache IcebergBig Data
0 likes · 28 min read
Why Lakehouse Architecture Is Redefining Big Data Infrastructure in the AI Era
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jul 24, 2024 · Artificial Intelligence

How Alibaba Cloud’s Milvus Service Boosted E‑commerce Search Stability and Scalability

This case study details how ShiHuo, an e‑commerce recommendation platform, overcame rapid product growth, cluster instability, and high operational overhead by adopting Alibaba Cloud’s fully managed Milvus vector search service, achieving higher performance, better availability, and reduced management costs.

AIMilvusScalability
0 likes · 8 min read
How Alibaba Cloud’s Milvus Service Boosted E‑commerce Search Stability and Scalability
JD Tech Talk
JD Tech Talk
Jul 23, 2024 · Artificial Intelligence

Intelligent Parcel Identification Using Large Language Models in JD Express Logistics

This article examines how JD Express applies large‑language‑model‑based natural language processing to accurately recognize and classify shipped items, addressing low matching rates, improving packaging recommendations, reducing damage and claims, and outlining architecture, model selection criteria, caching strategies, and future operational benefits.

AIJD ExpressNLP
0 likes · 21 min read
Intelligent Parcel Identification Using Large Language Models in JD Express Logistics
JD Cloud Developers
JD Cloud Developers
Jul 23, 2024 · Artificial Intelligence

How JD Logistics Boosts Parcel Accuracy with Large‑Model AI Recognition

JD Logistics tackles low parcel‑matching rates and high breakage costs by deploying large‑language‑model AI to intelligently recognize and classify shipped items, improving packaging recommendations, reducing claims, optimizing routes, and enhancing both B‑to‑B and C‑to‑C operations across its extensive delivery network.

AILogisticsOperational Efficiency
0 likes · 24 min read
How JD Logistics Boosts Parcel Accuracy with Large‑Model AI Recognition
JavaEdge
JavaEdge
Jul 22, 2024 · Artificial Intelligence

What Is a Transformer and Why It’s Transforming AI?

This article explains the fundamentals of transformer models, why they outperform earlier neural networks, their core components such as self‑attention and positional encoding, practical use cases across language and biology, and how they differ from RNNs, CNNs, and other architectures.

AIDeep LearningSelf-Attention
0 likes · 20 min read
What Is a Transformer and Why It’s Transforming AI?
DataFunTalk
DataFunTalk
Jul 21, 2024 · Artificial Intelligence

Integrating DataOps with Large Language Models for Text2SQL: Practices, Challenges, and Future Directions

This article presents a comprehensive overview of how DataOps principles combined with large language models such as GPT‑4 enable more agile and intelligent data engineering workflows, focusing on Text2SQL applications, schema‑linking techniques, practical product implementations, and future research challenges.

AIDataOpsLarge Language Models
0 likes · 23 min read
Integrating DataOps with Large Language Models for Text2SQL: Practices, Challenges, and Future Directions
DaTaobao Tech
DaTaobao Tech
Jul 19, 2024 · Artificial Intelligence

Practices and Techniques for Vertical Domain Large Language Models

Vertical domain large language models, fine‑tuned on specialized data, deliver higher expertise and task performance, but require continual knowledge updates and careful alignment; techniques such as BPO‑guided instruction tuning (+1.8% accuracy), Reflexion‑based Text2API (+4% API correctness), advanced RAG preprocessing, and SFT combined with ORPO (+5.2% gain) demonstrate notable improvements while underscoring remaining challenges and collaborative opportunities.

AIRAGSFT
0 likes · 9 min read
Practices and Techniques for Vertical Domain Large Language Models
DeWu Technology
DeWu Technology
Jul 19, 2024 · Artificial Intelligence

AI‑Powered Anomaly Detection Algorithms for Observability Metrics

The article explains how AI‑powered anomaly detection—using statistical 3‑sigma/Z-score methods, unsupervised machine‑learning like Isolation Forest, and deep‑learning models such as LSTM, Transformer and Pyraformer—overcomes the limits of threshold‑based monitoring by preprocessing data, reducing false alerts, and delivering high‑precision observability metrics.

AIAnomaly DetectionDeep Learning
0 likes · 13 min read
AI‑Powered Anomaly Detection Algorithms for Observability Metrics
Tencent Advertising Technology
Tencent Advertising Technology
Jul 19, 2024 · Artificial Intelligence

The Brutal Aesthetics of Data and Compute: Scaling Laws, Generative AI, and the Evolution of Advertising Systems

This article explains how the scaling law—massive data, compute, and a simple transformer architecture—drives generative AI breakthroughs, how Tencent applied this principle to build larger ad models and the "Hunyuan" large model, and how advertising systems must evolve to truly understand content and users.

@DataAIGenerative AI
0 likes · 11 min read
The Brutal Aesthetics of Data and Compute: Scaling Laws, Generative AI, and the Evolution of Advertising Systems
IT Services Circle
IT Services Circle
Jul 18, 2024 · Artificial Intelligence

Insights on Baidu’s “Robo Fast Run” Autonomous Driving Initiative and Related Career Opportunities

The article discusses Baidu’s new autonomous‑driving service “Robo Fast Run,” its potential impact on ride‑hailing jobs, the technical challenges involved, and provides detailed guidance on the qualifications and learning paths for engineers seeking positions in Baidu’s intelligent‑driving divisions.

AIAutonomous DrivingBaidu
0 likes · 5 min read
Insights on Baidu’s “Robo Fast Run” Autonomous Driving Initiative and Related Career Opportunities
Tencent Cloud Developer
Tencent Cloud Developer
Jul 18, 2024 · Artificial Intelligence

Exploring Large Language Models (LLM): Fundamentals, Applications, and Future Directions

Exploring Large Language Models, this article surveys their core concepts, evolution through Transformers, GPT and BERT, generation challenges, diverse applications such as QA, multimodal creation, summarization and retrieval‑augmented generation, prompt‑engineering frameworks and tools, LangChain‑based pipelines, AI‑driven agents, and future prospects toward domain‑specific use, multimodality, and AGI.

AILLMRAG
0 likes · 35 min read
Exploring Large Language Models (LLM): Fundamentals, Applications, and Future Directions
Sohu Tech Products
Sohu Tech Products
Jul 17, 2024 · Artificial Intelligence

How Weak Supervision Powers Ant Group’s Real‑World AI Challenges

This article presents a comprehensive technical overview of weak‑supervision machine learning at Ant Group, covering its fundamentals, cross‑domain causal effect estimation, strategies for scarce or noisy labels, novel framework components, experimental validation, and practical application scenarios.

AIWeak Supervisioncausal inference
0 likes · 18 min read
How Weak Supervision Powers Ant Group’s Real‑World AI Challenges
Kuaishou Tech
Kuaishou Tech
Jul 16, 2024 · Artificial Intelligence

LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control

LivePortrait is an open‑source, controllable portrait video generation framework that transfers facial expressions and poses from a driving video to static or dynamic portraits in real time, leveraging a 69M‑frame mixed video‑image training set, stitching and retargeting modules, and achieving high quality with low latency.

AIDeep LearningVideo Animation
0 likes · 14 min read
LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control
JD Tech Talk
JD Tech Talk
Jul 16, 2024 · Artificial Intelligence

Task‑Aware Decoding (TaD): A Plug‑and‑Play Method to Mitigate Hallucinations in Large Language Models

TaD, a task‑aware decoding technique jointly developed by JD.com and Tsinghua University and presented at IJCAI 2024, leverages differences between pre‑ and post‑fine‑tuned LLM outputs to construct knowledge vectors, significantly reducing hallucinations across various models, tasks, and data‑scarce scenarios, especially when combined with RAG.

AILLMRAG
0 likes · 18 min read
Task‑Aware Decoding (TaD): A Plug‑and‑Play Method to Mitigate Hallucinations in Large Language Models
Architects' Tech Alliance
Architects' Tech Alliance
Jul 15, 2024 · Industry Insights

Why Ethernet Is Overtaking InfiniBand in AI and Data Center Networks

The article analyzes the 2022 global and Chinese switch markets, explains how distributed computing and generative AI workloads rely on high‑performance switches, compares Ethernet and InfiniBand technologies—including bandwidth, latency, and cost factors—and outlines major vendor strategies and future trends in the networking industry.

AIData CenterInfiniBand
0 likes · 14 min read
Why Ethernet Is Overtaking InfiniBand in AI and Data Center Networks
Alibaba Cloud Native
Alibaba Cloud Native
Jul 15, 2024 · Cloud Native

How AI-Driven Gateways Are Evolving to Meet LLM Demands

The article examines how AI-era large language model (LLM) applications impose new traffic, security, and scalability requirements on gateways, and explains how the Envoy‑based open‑source Higress gateway addresses these challenges with hot configuration updates, token‑based rate limiting, streaming support, and multi‑tenant capabilities.

AIInfraLLM
0 likes · 19 min read
How AI-Driven Gateways Are Evolving to Meet LLM Demands
Baidu Geek Talk
Baidu Geek Talk
Jul 15, 2024 · Industry Insights

How AI Is Revolutionizing Physical Network Fault Localization

This article explains how Baidu Cloud evolved from manual and integrated network fault detection to AI-driven localization using large language models, detailing structured prompting, multi‑agent workflows, and real‑world comparisons that demonstrate improved accuracy and faster mitigation.

AIFault LocalizationLarge Language Model
0 likes · 14 min read
How AI Is Revolutionizing Physical Network Fault Localization
phodal
phodal
Jul 15, 2024 · Industry Insights

How AI Is Redefining the Entire Software Development Lifecycle in 2024

From 2023 to 2024, AI has expanded from simple developer assistance to comprehensive, lifecycle‑wide support, evolving through individual, team, and organizational layers and shifting from local IDE plugins to domain‑specific intelligent code generation tools.

AIAI assistantsdevelopment tools
0 likes · 16 min read
How AI Is Redefining the Entire Software Development Lifecycle in 2024
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jul 15, 2024 · Artificial Intelligence

How EasyAnimate v3 Generates High‑Resolution Videos with Diffusion Transformers

EasyAnimate v3, an open‑source video generation system from Alibaba Cloud AI Platform, introduces Diffusion Transformer‑based architecture, Hybrid Motion Module, and Slice VAE to enable image‑to‑video, text‑to‑video, and unlimited‑length video creation with up to 720p/144 fps resolution on modest GPU memory.

AIDiffusion TransformerEasyAnimate
0 likes · 5 min read
How EasyAnimate v3 Generates High‑Resolution Videos with Diffusion Transformers
DataFunTalk
DataFunTalk
Jul 14, 2024 · Artificial Intelligence

Time Series and Machine Learning – An Overview and Book Introduction

The article introduces the rapid rise of large language models, the abundance of time‑series data in many sectors, and explains how combining machine‑learning and deep‑learning techniques with time‑series analysis has become a research hotspot, culminating in a new book that systematically covers theory, methods, and real‑world applications.

AIAnomaly Detectionmachine learning
0 likes · 10 min read
Time Series and Machine Learning – An Overview and Book Introduction
AntData
AntData
Jul 12, 2024 · Databases

Recent Advances in Vector Databases Presented at SIGMOD 2024

This article reviews the latest vector database research showcased at SIGMOD 2024, covering system designs such as Starling, Vexless, RaBitQ, and ACORN, and discusses current academic hotspots including query processing, index structures, optimization techniques, and hardware acceleration for large‑scale similarity search.

AISIGMOD 2024Vector Database
0 likes · 20 min read
Recent Advances in Vector Databases Presented at SIGMOD 2024
21CTO
21CTO
Jul 11, 2024 · Operations

12 Must-Have Open‑Source Tools Every Developer Should Use

Discover a curated list of twelve powerful open‑source tools—from IDEs and API testers to AI model runners and DevOps utilities—that can dramatically boost developer productivity and streamline everyday workflows.

AIAPI testingIDE
0 likes · 10 min read
12 Must-Have Open‑Source Tools Every Developer Should Use
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Jul 10, 2024 · Artificial Intelligence

How AI Transforms Physical Network Fault Localization: From Manual to LLM‑Powered Precision

This article explains how Baidu Cloud evolved its physical network fault‑location workflow—from manual analysis and integrated multi‑signal algorithms to AI‑driven reasoning with large language models—highlighting structured prompting, multi‑agent collaboration, and measurable improvements in accuracy and automation.

AIFault LocalizationLarge Language Model
0 likes · 15 min read
How AI Transforms Physical Network Fault Localization: From Manual to LLM‑Powered Precision
DataFunSummit
DataFunSummit
Jul 10, 2024 · Artificial Intelligence

Applying Large Language Models to Recommendation Systems at Ant Group

The article presents Ant Group's research on integrating large language models into recommendation pipelines, covering background challenges, knowledge extraction, teacher‑model distillation, efficient deployment, experimental results, and future directions to improve accuracy and reduce bias.

AILLMRecommendation Systems
0 likes · 13 min read
Applying Large Language Models to Recommendation Systems at Ant Group
Architects' Tech Alliance
Architects' Tech Alliance
Jul 10, 2024 · Industry Insights

Why AI Large Models Are Driving the Next Industrial Revolution

The article analyzes the rapid evolution of AI large models—from their role in advancing AGI through massive pre‑training and fine‑tuning, to current market dynamics led by GPT and domestic Chinese players, and finally to future multimodal applications, content‑factory capabilities, and emerging AIGC revenue models projected to reach trillion‑yuan scales by 2030.

AIAIGCGPT
0 likes · 7 min read
Why AI Large Models Are Driving the Next Industrial Revolution
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Jul 9, 2024 · Databases

Why Vector Databases Are the Future Backbone of AI Applications

This article explains how vector databases store and query high‑dimensional embeddings, compares them with standalone vector indexes, outlines common embedding types and indexing algorithms, and discusses performance, monitoring, security, and API considerations for building robust AI‑driven systems.

AIANN searchindexing
0 likes · 22 min read
Why Vector Databases Are the Future Backbone of AI Applications
Baidu Tech Salon
Baidu Tech Salon
Jul 9, 2024 · Artificial Intelligence

AI-Powered Job Matching Application Using ERNIE SDK

The AI‑powered job‑matching application built with Baidu’s ERNIE SDK, created by PaddlePaddle expert Gao Fuzhi, intelligently parses a candidate’s resume, matches them to suitable positions, supplies detailed salary, location and benefit data, analyzes job requirements, and offers personalized skill and interview guidance, aiming to improve recruitment efficiency for both seekers and employers.

AIERNIE SDKLarge Language Model
0 likes · 8 min read
AI-Powered Job Matching Application Using ERNIE SDK
Architects' Tech Alliance
Architects' Tech Alliance
Jul 9, 2024 · Industry Insights

How Nvidia’s Accelerated GPU Roadmap Is Shaping AI‑Scale Networking

Nvidia plans to shorten its GPU generation cycle to one year, launching Blackwell Ultra in 2025, Rubin in 2026, and Rubin Ultra in 2027, while boosting token‑generation efficiency and introducing AI‑optimized Ethernet solutions like Spectrum‑X800, aiming to dominate large‑scale AI clusters and reshape the high‑performance networking market.

AIGPUNVIDIA
0 likes · 6 min read
How Nvidia’s Accelerated GPU Roadmap Is Shaping AI‑Scale Networking
Architects' Tech Alliance
Architects' Tech Alliance
Jul 7, 2024 · Operations

Designing High‑Performance Cluster Networks for AI Large Models: InfiniBand vs RoCE

The article analyzes the networking challenges of AI super‑large models, comparing InfiniBand and RoCE technologies, and presents design guidelines for ultra‑scale, high‑bandwidth, low‑latency, and highly stable cluster interconnects to maximize GPU utilization and overall training efficiency.

AIGPU interconnectHigh‑Performance Computing
0 likes · 14 min read
Designing High‑Performance Cluster Networks for AI Large Models: InfiniBand vs RoCE
AntTech
AntTech
Jul 7, 2024 · Information Security

2024 WAIC Forum on Privacy Computing: Enabling Trusted Data Sharing for Large Models

The 2024 WAIC Privacy Computing Forum gathered leading experts from academia and industry to discuss how encryption, anonymization, and secure multi‑party computation can protect data privacy while enabling large‑model training and inference, highlighting technical challenges, standards, and emerging solutions across AI, big data, and information security domains.

AIData SecurityMPC
0 likes · 15 min read
2024 WAIC Forum on Privacy Computing: Enabling Trusted Data Sharing for Large Models
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Jul 6, 2024 · Artificial Intelligence

ChatGLM Evolution: Deep Dive into GLM Architecture, Pretraining, and ChatGLM‑4

This article provides a comprehensive technical overview of the ChatGLM series—from the original ChatGLM‑6B model and its GLM‑based pre‑training framework to the enhancements in ChatGLM‑2, the architectural parity of ChatGLM‑3, and the advanced capabilities of the latest ChatGLM‑4, covering model structure, position encoding, attention mechanisms, multi‑task pretraining, and tool integration.

AIChatGLMGLM
0 likes · 25 min read
ChatGLM Evolution: Deep Dive into GLM Architecture, Pretraining, and ChatGLM‑4
AntTech
AntTech
Jul 6, 2024 · Artificial Intelligence

Ant Group's Alipay Smart Assistant and Other AI Applications Featured in the 2024 World AI Conference Case Collection

At the 2024 World Artificial Intelligence Conference, Ant Group showcased seven AI-driven applications—including the Alipay Smart Assistant, AI Financial Butler, and other specialized intelligent agents—highlighting how large‑model ecosystems and professional intelligent agents are accelerating rigorous industry adoption of generative AI.

AIAI applicationsAlipay
0 likes · 5 min read
Ant Group's Alipay Smart Assistant and Other AI Applications Featured in the 2024 World AI Conference Case Collection
Ops Development & AI Practice
Ops Development & AI Practice
Jul 4, 2024 · Artificial Intelligence

Discriminative vs Generative Models: When to Use Each in AI

The article explains the fundamental differences between discriminative and generative models, detailing their learning objectives, typical algorithms, key characteristics, example implementations, and practical application scenarios, helping readers choose the appropriate model for classification or data‑generation tasks.

AIDiscriminative ModelsGenerative Models
0 likes · 6 min read
Discriminative vs Generative Models: When to Use Each in AI
360 Smart Cloud
360 Smart Cloud
Jul 4, 2024 · Artificial Intelligence

Optimizing Mixture-of-Experts (MoE) Training with the QLM Framework

This article introduces the background and challenges of large language model training, explains the Mixture-of-Experts (MoE) architecture, and details several optimization techniques implemented in the QLM framework—including fine-grained and shared experts, top‑k gating, token distribution, expert parallelism, and grouped GEMM – to improve training efficiency and performance.

AILarge Language ModelsMixture of Experts
0 likes · 10 min read
Optimizing Mixture-of-Experts (MoE) Training with the QLM Framework
Continuous Delivery 2.0
Continuous Delivery 2.0
Jul 2, 2024 · Artificial Intelligence

How Generative AI Tools Are Transforming Software Engineering: Insights from Seattle Tech Leaders

Generative AI tools such as GitHub Copilot, ChatGPT, and Amazon CodeWhisperer are rapidly reshaping software engineering, with surveys showing over 90% of U.S. developers using them, and Seattle tech leaders reporting doubled productivity, faster code migration, and new workflows across startups and enterprises.

AIChatGPTGenerative AI
0 likes · 9 min read
How Generative AI Tools Are Transforming Software Engineering: Insights from Seattle Tech Leaders
Kuaishou Tech
Kuaishou Tech
Jul 1, 2024 · Artificial Intelligence

Kuaishou Showcases AI Innovations at CVPR 2024: Competitions, Large‑Model Demonstrations, and Research Highlights

At CVPR 2024 in Seattle, Kuaishou presented its latest AI research and applications, including a star‑studded gala, a short‑video quality competition, large‑model video generation demos, a multi‑dimensional text‑to‑image evaluation paper, and advanced video processing technologies, underscoring its strong ties with the academic community.

AICVPR 2024Large‑Scale Video Generation
0 likes · 9 min read
Kuaishou Showcases AI Innovations at CVPR 2024: Competitions, Large‑Model Demonstrations, and Research Highlights
Continuous Delivery 2.0
Continuous Delivery 2.0
Jul 1, 2024 · Artificial Intelligence

How Meta Uses Llama2 to Accelerate Incident Response and Root‑Cause Analysis in AIOps

This article explains how Meta applies AI, specifically a fine‑tuned Llama2 model, to improve AIOps by automating incident monitoring, providing real‑time summaries, assisting responders with contextual information, and efficiently narrowing down root‑cause changes, ultimately reducing incident resolution time from hours to minutes.

AILlama2Meta
0 likes · 13 min read
How Meta Uses Llama2 to Accelerate Incident Response and Root‑Cause Analysis in AIOps
21CTO
21CTO
Jun 30, 2024 · Artificial Intelligence

Which Programming Language Is Best for AI? A Comparative Guide

This article surveys the most suitable programming languages for artificial intelligence, examining Python, R, Java, LISP, Prolog, C++, Haskell, JavaScript, and Julia, and outlines their strengths, ecosystem support, and use cases to help developers choose the right tool for AI projects.

AIC++Java
0 likes · 15 min read
Which Programming Language Is Best for AI? A Comparative Guide
AntTech
AntTech
Jun 30, 2024 · Artificial Intelligence

AI Volunteer Assistant for College Entrance Exam Using the agentUniverse Multi‑Agent Framework

The article introduces an AI‑powered “Volunteer Assistant” built on the agentUniverse multi‑agent framework, detailing how it outperforms existing tools by integrating a specialized SOP, multi‑agent collaboration, and employment‑market analysis to provide precise, personalized college‑major recommendations for high‑school graduates.

AICollege AdmissionsEducation Technology
0 likes · 7 min read
AI Volunteer Assistant for College Entrance Exam Using the agentUniverse Multi‑Agent Framework
Continuous Delivery 2.0
Continuous Delivery 2.0
Jun 29, 2024 · Artificial Intelligence

AI in Software Engineering at Google: Progress and the Path Ahead

The article describes how Google has integrated AI, particularly large language models, into its internal software development tools to improve developer productivity, outlines the challenges faced, shares lessons learned, and outlines future directions for AI‑driven engineering assistance.

AIGoogleLLM
0 likes · 10 min read
AI in Software Engineering at Google: Progress and the Path Ahead
21CTO
21CTO
Jun 28, 2024 · Artificial Intelligence

Can Microsoft’s Copilot+ PC Turn ARM Windows into an AI Powerhouse?

The article examines Microsoft’s Copilot+ PC, its ARM‑based Windows platform, Snapdragon SoC performance, AI runtime integration, and future GPT‑6 expectations, weighing whether these innovations will attract developers to the ARM Windows ecosystem.

AIARM WindowsGPT-6
0 likes · 8 min read
Can Microsoft’s Copilot+ PC Turn ARM Windows into an AI Powerhouse?
Baobao Algorithm Notes
Baobao Algorithm Notes
Jun 28, 2024 · Artificial Intelligence

What Makes Gemma 2 a Competitive Open‑Source LLM? Architecture, Training, and Evaluation Insights

The article provides a detailed technical overview of Gemma 2, covering its decoder‑only transformer design, novel attention mechanisms, logit soft‑capping, RMSNorm, knowledge‑distillation training on trillions of tokens, extensive pre‑training infrastructure, and benchmark evaluations that demonstrate its competitiveness against larger proprietary models.

AIGemma 2benchmark evaluation
0 likes · 14 min read
What Makes Gemma 2 a Competitive Open‑Source LLM? Architecture, Training, and Evaluation Insights
Kuaishou Large Model
Kuaishou Large Model
Jun 27, 2024 · Artificial Intelligence

How I2V-Adapter Turns Images into Videos with Minimal Training

Fast‑forwarding image‑to‑video generation, the article introduces I2V‑Adapter, a lightweight plug‑in for Stable Diffusion‑based video diffusion models that converts a single static image into a coherent video without altering the original T2V architecture, and details its design, frame‑similarity prior, experimental results, and real‑world applications.

AII2V-AdapterStable Diffusion
0 likes · 9 min read
How I2V-Adapter Turns Images into Videos with Minimal Training
Python Programming Learning Circle
Python Programming Learning Circle
Jun 27, 2024 · Artificial Intelligence

Homemade Machine Learning – Python Implementations of Popular Algorithms with Jupyter Notebooks

This article introduces the GitHub "Homemade Machine Learning" project, which provides pure‑Python implementations of common supervised and unsupervised machine‑learning algorithms, complete with mathematical explanations, Jupyter‑Notebook demos, installation instructions, and links to datasets for hands‑on learning.

AIJupyterPython
0 likes · 6 min read
Homemade Machine Learning – Python Implementations of Popular Algorithms with Jupyter Notebooks
JD Tech
JD Tech
Jun 27, 2024 · Artificial Intelligence

AI‑Powered Real‑Time Multilingual Support and Terminology Management for International Logistics

The article outlines an AI‑driven approach to streamline multilingual support in an international logistics system by building a unified terminology library, enabling user‑editable terms, automating translation with large language models, and implementing versioned cloud‑based language packs to reduce cost and improve efficiency.

AIinternationalizationmultilingual
0 likes · 16 min read
AI‑Powered Real‑Time Multilingual Support and Terminology Management for International Logistics
Kuaishou Tech
Kuaishou Tech
Jun 26, 2024 · Artificial Intelligence

I2V-Adapter: A Lightweight Image‑to‑Video Adapter for Stable Diffusion Video Diffusion Models

The I2V-Adapter paper introduces a plug‑and‑play lightweight module that enables static images to be converted into dynamic videos using Stable Diffusion‑based text‑to‑video diffusion models without altering the original architecture or pretrained parameters, achieving competitive quality with far less training cost.

AII2V-AdapterStable Diffusion
0 likes · 8 min read
I2V-Adapter: A Lightweight Image‑to‑Video Adapter for Stable Diffusion Video Diffusion Models
NewBeeNLP
NewBeeNLP
Jun 26, 2024 · Interview Experience

From AI Research to Development: My Summer Internship Reflection

After a summer internship, I compare the challenges of pursuing AI research versus software development, recount my friend's success in algorithm roles, and share personal insights on choosing a career path, interview preparation, and the true value of understanding over credentials.

AIInterviewReflection
0 likes · 5 min read
From AI Research to Development: My Summer Internship Reflection
JD Tech Talk
JD Tech Talk
Jun 25, 2024 · Artificial Intelligence

Understanding Large Language Models: From Parameters to Transformer Architecture

This article explains the fundamental concepts behind large language models, including their two-file structure, training process, neural network basics, perceptron examples, weight and threshold calculations, the TensorFlow Playground, and a detailed walkthrough of the Transformer architecture with tokenization, positional encoding, self‑attention, normalization, and feed‑forward layers.

AILarge Language ModelsSelf-Attention
0 likes · 20 min read
Understanding Large Language Models: From Parameters to Transformer Architecture
JavaEdge
JavaEdge
Jun 23, 2024 · Artificial Intelligence

Build a Cultural Name‑Generator with LangChain, Custom Prompts, and Output Parsers

This tutorial walks through installing LangChain, creating an LLM (via own GPU resources or third‑party APIs), designing parameterized prompt templates, implementing a custom output parser for structured results, and running a complete Python example that generates culturally specific names.

AILLMLangChain
0 likes · 7 min read
Build a Cultural Name‑Generator with LangChain, Custom Prompts, and Output Parsers
JavaEdge
JavaEdge
Jun 23, 2024 · Artificial Intelligence

What Is LangChain? Features, Pros, Cons, and Setup Guide

This article introduces LangChain, an open‑source framework for building LLM‑powered applications, outlines its key components such as prompts, chains, agents, and retrieval‑augmented generation, compares its advantages and drawbacks, and provides step‑by‑step instructions for setting up a Python development environment.

AIFrameworkLLM
0 likes · 7 min read
What Is LangChain? Features, Pros, Cons, and Setup Guide
Architecture and Beyond
Architecture and Beyond
Jun 23, 2024 · Artificial Intelligence

AI Programming Paradigms Unveiled: Visual ComfyUI Workflows and LangChain LLM Apps

The article examines two emerging AI programming paradigms—visual, node‑based development with ComfyUI for image generation and modular LLM application construction with LangChain—detailing their architectures, key components, workflow examples, advantages, limitations, and practical guidance for leveraging these tools to boost development efficiency in the rapidly evolving AI landscape.

AIComfyUILLM applications
0 likes · 20 min read
AI Programming Paradigms Unveiled: Visual ComfyUI Workflows and LangChain LLM Apps
Architects' Tech Alliance
Architects' Tech Alliance
Jun 22, 2024 · Artificial Intelligence

Rising Compute Demand of Generative AI Models and GPU Accelerator Trends in 2024

The article analyzes how generative AI models from GPT‑1 to the upcoming GPT‑5 are driving exponential growth in compute requirements, prompting massive cloud capital expenditures and intense competition among GPU vendors such as NVIDIA, AMD, Google, and emerging domestic chip makers, while also highlighting interconnect innovations and cost‑effective solutions.

AIAcceleratorsCompute
0 likes · 12 min read
Rising Compute Demand of Generative AI Models and GPU Accelerator Trends in 2024
AntTech
AntTech
Jun 21, 2024 · Artificial Intelligence

Ant Group’s Security Parallel Aspect Fusion AI: A Case Study Selected at the 2024 World Intelligent Industry Expo

The article details Ant Group’s security parallel aspect fusion AI solution, selected as an exemplary case at the 2024 World Intelligent Industry Expo, explaining its multi‑dimensional data collection, large‑model integration, baseline construction, knowledge‑graph generation, and superior threat‑detection performance.

AIAnt GroupKnowledge Graph
0 likes · 4 min read
Ant Group’s Security Parallel Aspect Fusion AI: A Case Study Selected at the 2024 World Intelligent Industry Expo
58UXD
58UXD
Jun 21, 2024 · Game Development

Build a Free AI‑Powered 3D Digital Human with Metahuman

This guide walks you through creating a realistic 3D AI digital human using Unreal Engine's free Metahuman system and other zero‑cost tools, covering model selection, facial and body adjustments, clothing, export, and tips for both realistic and stylized characters.

3DAIMetaHuman
0 likes · 8 min read
Build a Free AI‑Powered 3D Digital Human with Metahuman
AntTech
AntTech
Jun 20, 2024 · Artificial Intelligence

Predicting Football Match Outcomes with Graph Neural Networks and Large Language Models: The “Smart Guess Football” Project

During the 2024 European Championship, TuGraph engineers built an interactive system called “Smart Guess Football” that combines graph computing, graph neural networks, transformers and large language models to model player relationships and predict match outcomes, achieving up to 71% accuracy on limited test matches.

AIGraph Neural NetworkLarge Language Model
0 likes · 7 min read
Predicting Football Match Outcomes with Graph Neural Networks and Large Language Models: The “Smart Guess Football” Project
JD Cloud Developers
JD Cloud Developers
Jun 20, 2024 · Artificial Intelligence

How Large Language Models Boost Courier Efficiency: From Voice Commands to Smart QA

This article explains how large language models like ChatGPT can transform courier operations by automating voice‑driven tasks, enabling intelligent question answering with retrieval‑augmented generation, extracting and splitting document content, embedding it for vector search, and delivering smart prompts and agents to improve productivity and accuracy.

AIEmbeddingLogistics
0 likes · 15 min read
How Large Language Models Boost Courier Efficiency: From Voice Commands to Smart QA
Tencent Cloud Developer
Tencent Cloud Developer
Jun 20, 2024 · Cloud Computing

Can SaaS Become a Remedy for China's Software Industry? Insights on API Standardization, Business Visualization, and Open Domain

The author argues that, to revive China’s stagnant traditional software sector, SaaS must be reshaped through standardized business‑scenario APIs, real‑time business‑module visualization, and open‑domain architectures—augmented by AI‑driven code generation—to enable scalable customization and unlock new growth opportunities.

AIAPI standardizationBusiness Visualization
0 likes · 10 min read
Can SaaS Become a Remedy for China's Software Industry? Insights on API Standardization, Business Visualization, and Open Domain
Architects' Tech Alliance
Architects' Tech Alliance
Jun 19, 2024 · Industry Insights

China's Computing Power Network Market 2024: Trends, Scale, and Future Outlook

The 2024 white paper on China's computing power network outlines the evolution of data centers and IDC services, quantifies the digital economy’s 50.2 trillion RMB size, details a 624.75 billion RMB market in 2022 with a projected 1.06 trillion RMB valuation by 2025, and examines technology, application sectors, and emerging standards driving the industry.

AIData CenterDigital Economy
0 likes · 8 min read
China's Computing Power Network Market 2024: Trends, Scale, and Future Outlook
JD Tech
JD Tech
Jun 19, 2024 · Artificial Intelligence

Advances in Large AI Models: Prompt Engineering, RAG, Agents, Fine‑Tuning, Vector Databases and Knowledge Graphs

This article surveys the rapid expansion of large AI models, covering prompt engineering, structured prompts, retrieval‑augmented generation, AI agents, fine‑tuning strategies, vector database technology, knowledge graphs, function calling, and their collective role in moving toward artificial general intelligence.

AIFine‑tuningKnowledge Graph
0 likes · 23 min read
Advances in Large AI Models: Prompt Engineering, RAG, Agents, Fine‑Tuning, Vector Databases and Knowledge Graphs