Tagged articles
122 articles
Page 2 of 2
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 21, 2025 · Artificial Intelligence

How Browser‑Use Leverages AI Prompts for Seamless Browser Automation

This article explains how the open‑source browser‑use framework combines carefully designed SystemMessage prompts, structured HumanMessage inputs, and LangChain‑driven tool calls to enable large language models to automate complex web tasks such as shopping, CRM updates, résumé processing, and document generation, while providing concrete code examples and best‑practice tips.

AI automationLangChainLarge Language Model
0 likes · 21 min read
How Browser‑Use Leverages AI Prompts for Seamless Browser Automation
ByteDance Web Infra
ByteDance Web Infra
Jul 3, 2025 · Artificial Intelligence

How the New XPath‑Based Cache Boosts AI Automation Performance by 37%

The update introduces a YAML‑based cache with precise XPath targeting, dual‑validation and smart fallback, a structured API for extracting booleans, numbers, strings and queries, enhanced replay reports with custom nodes and video export, plus extensive web, Android, and reporting optimizations that dramatically improve performance and reduce report size.

AI automationCachingJavaScript
0 likes · 5 min read
How the New XPath‑Based Cache Boosts AI Automation Performance by 37%
dbaplus Community
dbaplus Community
Jun 7, 2025 · Artificial Intelligence

How Large Language Models Are Transforming Data Warehousing: Real-World Experiments and Lessons

The article shares practical experiences using large language models such as Cursor and DeepSeek in data‑warehouse workflows, covering assisted coding, automated metric extraction, self‑service analysis, documentation generation, their benefits, limitations, and the broader impact on data engineering roles.

AI automationData WarehouseLLM
0 likes · 9 min read
How Large Language Models Are Transforming Data Warehousing: Real-World Experiments and Lessons
Sohu Tech Products
Sohu Tech Products
Apr 29, 2025 · Industry Insights

Why Claude + MCP Is Outpacing Traditional IDEs Like Cursor and Windsurf

The article analyzes how Claude combined with custom MCPs such as ClaudeCommander dramatically reduces the popularity of traditional IDEs by offering automatic codebase exploration, multi‑step task planning, and long‑running automation like video compression, while providing step‑by‑step installation and usage guidance.

AI automationClaudeIDE comparison
0 likes · 9 min read
Why Claude + MCP Is Outpacing Traditional IDEs Like Cursor and Windsurf
Snowball Engineer Team
Snowball Engineer Team
Mar 31, 2025 · Frontend Development

Leveraging Multimodal Large Language Models for Frontend Automated Testing (NL2Test)

This article explores how multimodal large language models (MM‑LLMs) combined with structured prompt engineering can transform frontend regression testing by enabling natural‑language‑driven test case generation, visual verification, and script self‑healing, thereby reducing maintenance costs and improving coverage across dynamic UI scenarios.

AI automationMultimodal LLMNL2Test
0 likes · 17 min read
Leveraging Multimodal Large Language Models for Frontend Automated Testing (NL2Test)
Architecture and Beyond
Architecture and Beyond
Mar 9, 2025 · Artificial Intelligence

Evolution of AI Interaction Paradigms: From Function Calling to MCP and AI Agents

The article examines the rapid rise of AI agents like Manus and OpenManus, explains the limitations of cloud‑only models, details the Function Calling mechanism and its pros and cons, introduces the Model Context Protocol (MCP) as a more powerful evolution, and finally describes how AI Agents combine planning, dynamic tool use, memory, and autonomous decision‑making to achieve fully closed‑loop intelligent automation.

AI AgentAI automationAI interaction
0 likes · 20 min read
Evolution of AI Interaction Paradigms: From Function Calling to MCP and AI Agents
Architect
Architect
Mar 8, 2025 · Artificial Intelligence

Understanding Model Context Protocol (MCP): Architecture, Core Components, and Practical Guide

This article provides a comprehensive overview of the Model Context Protocol (MCP), explaining its purpose, core components, differences from traditional APIs, detailed architecture, message types, connection lifecycle, error handling, and step‑by‑step instructions for building and using MCP servers to enable AI agents to act on real‑world data and tasks.

AI automationAI tool integrationClaude
0 likes · 12 min read
Understanding Model Context Protocol (MCP): Architecture, Core Components, and Practical Guide
Ops Development & AI Practice
Ops Development & AI Practice
Jan 27, 2025 · Artificial Intelligence

How OpenAI’s Operator Lets AI Control Browsers Like a Human

The article explains OpenAI’s newly released Operator feature that enables AI to simulate human browser actions, outlines its underlying technologies, explores diverse application scenarios such as web automation and virtual assistants, and discusses the challenges and limitations of this breakthrough.

AI automationBrowser ControlOpenAI
0 likes · 9 min read
How OpenAI’s Operator Lets AI Control Browsers Like a Human
AI Large Model Application Practice
AI Large Model Application Practice
Dec 9, 2024 · Artificial Intelligence

How GUI Agents Use Large Models to Automate Any Desktop Task

This article explains why GUI agents are needed, defines their multimodal capabilities, walks through a high‑level automation scenario, details the architecture of large‑model‑driven GUI agents, highlights recent open‑source projects, and compares them with traditional RPA solutions.

AI automationGUI AgentHuman-Computer Interaction
0 likes · 10 min read
How GUI Agents Use Large Models to Automate Any Desktop Task
21CTO
21CTO
Jul 23, 2024 · Artificial Intelligence

What Is Agentic AI? How Autonomous Agents Boost Productivity and Transform Industries

Agentic AI, also known as autonomous AI agents, enables systems to perceive environments, make decisions, act, and continuously learn, offering higher productivity, smarter decision‑making, and industry‑wide transformation across sectors such as customer service, healthcare, finance, and manufacturing.

AI automationAI frameworksMachine Learning
0 likes · 13 min read
What Is Agentic AI? How Autonomous Agents Boost Productivity and Transform Industries
phodal
phodal
Oct 19, 2023 · Operations

Can LLMs Revolutionize Code Review? Inside AutoDev’s AI‑Powered Approach

The article examines how rising code volume and AI‑generated snippets challenge traditional code review, proposes an LLM‑assisted workflow using AutoDev and DevOpsGenius, details prompt design, commit filtering, and implementation steps, and discusses the benefits and limitations for different team roles.

AI automationCode ReviewDevOps
0 likes · 9 min read
Can LLMs Revolutionize Code Review? Inside AutoDev’s AI‑Powered Approach
phodal
phodal
Jul 2, 2023 · Industry Insights

Can LLMs Revive Classic Software Engineering? A Deep Dive into Standardized AI‑Driven Development

This article explores how large language models can standardize software engineering practices by converting requirements and designs into DSL formats, enabling more automated and efficient code generation, while discussing the challenges of dynamic context building, DSL specification, and the evolving role of LLMs in development pipelines.

AI automationDSLLLM
0 likes · 14 min read
Can LLMs Revive Classic Software Engineering? A Deep Dive into Standardized AI‑Driven Development
Tencent Cloud Developer
Tencent Cloud Developer
Apr 17, 2023 · Artificial Intelligence

AutoGPT: An Overview of Autonomous AI Agents

AutoGPT is an open‑source autonomous AI agent that uses GPT‑4/3.5 APIs to decompose user‑defined goals into sub‑tasks, iteratively execute them, store results in memory, and autonomously build complex outputs such as code, websites, research, or financial plans, though it can incur high token costs and limited transparency.

AI automationAutoGPTGPT-4
0 likes · 8 min read
AutoGPT: An Overview of Autonomous AI Agents
DataFunSummit
DataFunSummit
Mar 1, 2023 · Artificial Intelligence

Automating High-Fidelity Digital Human Creation: Scanning, Driving, and Remaining Challenges

The article details YINGMOU's research on automating the production of high‑fidelity digital humans, covering their rapid 3‑5‑day pipeline, extensive face‑asset database, advanced light‑field scanning, automatic topology reconstruction, AI‑driven rigging, dynamic mapping, and the unresolved issues of hair and cloth.

AI automationMachine LearningPBR materials
0 likes · 12 min read
Automating High-Fidelity Digital Human Creation: Scanning, Driving, and Remaining Challenges
Baidu Tech Salon
Baidu Tech Salon
Oct 24, 2022 · Industry Insights

How Baidu Achieved Unmanned Delivery with Risk‑Driven Testing and AI

This article examines Baidu's risk‑driven, AI‑enhanced approach to unmanned software delivery, detailing the evolution of testing automation, the three human dependencies it eliminates, and the essential capabilities—comprehensive testing, stable builds, and precise risk evaluation—required to free testers from manual intervention.

AI automationcontinuous integrationquality assessment
0 likes · 12 min read
How Baidu Achieved Unmanned Delivery with Risk‑Driven Testing and AI
DataFunTalk
DataFunTalk
Oct 15, 2022 · Artificial Intelligence

AutoDL: Automated and Interpretable Deep Learning – Research Highlights from Baidu Big Data Lab

This article reviews Baidu Big Data Lab's recent advances in automated deep learning (AutoDL), covering its research breakthroughs, integration with PaddlePaddle/PaddleHub, industrial deployments, transfer learning innovations, and future directions for AI automation and interpretability.

AI automationAutoDLNeural Architecture Search
0 likes · 19 min read
AutoDL: Automated and Interpretable Deep Learning – Research Highlights from Baidu Big Data Lab
21CTO
21CTO
Feb 27, 2020 · R&D Management

From Gaming Addict to CTO: My Journey of Breaking Barriers

The author recounts his transformation from a teenage gaming enthusiast in a small Chinese university to a senior engineer, manager, and CTO, detailing pivotal moments, lessons on coding, leadership, AI‑driven automation, entrepreneurship, and strategic thinking that shaped his career.

AI automationEntrepreneurshipLeadership
0 likes · 12 min read
From Gaming Addict to CTO: My Journey of Breaking Barriers