Tagged articles

75 articles

Page 1 of 1

Apr 9, 2026 · Backend Development

Build a PHP‑Powered AI Video Assistant with Webman, Neuron AI & FFmpeg

This guide shows PHP developers how to create a smart video‑processing agent by combining the high‑performance Webman framework, the Neuron AI agent library supporting multiple LLMs, and FFmpeg tools, covering stack selection, core implementation steps, sample code for tools, controller integration, and visual demos of video info extraction, screenshot and transcoding.

LLMVideo processingWebman

0 likes · 9 min read

Build a PHP‑Powered AI Video Assistant with Webman, Neuron AI & FFmpeg

AI Explorer

Mar 8, 2026 · Artificial Intelligence

AutoClip: One‑Click AI Video Highlight Extraction and Editing

AutoClip is an open‑source, locally‑run tool that uses Alibaba's Qwen large language model and OpenAI Whisper to automatically download, transcribe, analyze, and cut high‑light segments from YouTube or Bilibili videos, offering real‑time task monitoring, smart collections, preview, Docker deployment, and a roadmap of future AI‑driven features.

AI video editingDockerFastAPI

0 likes · 7 min read

AutoClip: One‑Click AI Video Highlight Extraction and Editing

ByteDance Data Platform

Dec 23, 2025 · Artificial Intelligence

How Daft and Ray Supercharge Million‑Hour Video Processing for AI‑Powered Robotics

This article details a scalable, distributed pipeline that uses LAS AI Data Lake, Daft on Ray, and advanced video‑processing techniques—scene detection, splitting, frame sampling, filtering, and caption generation—to transform tens of millions of hours of robot‑captured video into high‑quality, searchable semantic data while dramatically boosting CPU and GPU utilization.

AI PipelineDaftDistributed computing

0 likes · 21 min read

How Daft and Ray Supercharge Million‑Hour Video Processing for AI‑Powered Robotics

Sohu Smart Platform Tech Team

Nov 20, 2025 · Artificial Intelligence

How Hooop Turns HarmonyOS into an Offline AI Basketball Coach

Hooop leverages HarmonyOS's on‑device AI and custom vision algorithms to provide real‑time, offline basketball training by detecting shots, analyzing trajectories, automatically clipping scoring clips, and tracking performance metrics without an internet connection.

AIHarmonyOSVideo processing

0 likes · 12 min read

How Hooop Turns HarmonyOS into an Offline AI Basketball Coach

Baidu Intelligent Cloud Tech Hub

Nov 4, 2025 · Artificial Intelligence

How Baidu’s Baige Accelerates Multimodal Video Training with Context Parallelism

Baidu Baige’s enhanced veRL framework dramatically boosts video frame rates and resolution limits, cuts training time, reduces memory usage, and improves model accuracy by leveraging context parallelism and optimized attention on Ampere GPUs for multimodal mixed‑training scenarios.

AI accelerationContext ParallelismVideo processing

0 likes · 6 min read

How Baidu’s Baige Accelerates Multimodal Video Training with Context Parallelism

Python Programming Learning Circle

Aug 29, 2025 · Fundamentals

10 Essential Python Automation Scripts to Eliminate Repetitive Tasks

This article presents ten practical Python automation scripts—covering HTML parsing, QR code scanning, screenshots, audiobooks, PDF editing, StackOverflow queries, mobile control, temperature monitoring, Instagram uploads, and video watermarking—to help you streamline daily repetitive tasks efficiently.

Mobile AutomationPDFScripting

0 likes · 14 min read

10 Essential Python Automation Scripts to Eliminate Repetitive Tasks

Baidu Geek Talk

Aug 20, 2025 · Mobile Development

How Mobile Video Players Boost Visual Quality with Real‑Time Brightness and Color Enhancement

This article explains the engineering of mobile video post‑processing techniques—brightness and color enhancement using GPU shaders, linear gain, YUV scaling, gamma correction, adaptive saturation, HSV adjustments, and skin‑tone protection—to improve clarity, contrast, and naturalness while maintaining real‑time performance.

GPU shaderMobileVideo processing

0 likes · 14 min read

How Mobile Video Players Boost Visual Quality with Real‑Time Brightness and Color Enhancement

Sohu Tech Products

Jul 30, 2025 · Fundamentals

Mastering the Chain of Responsibility: Refactor Video Processing with SOLID Principles

This article demonstrates how to apply the Chain of Responsibility design pattern together with SOLID principles to refactor a video upload, transcoding, review and publishing workflow, providing clear Java examples, UML diagrams, and Spring integration for a maintainable, extensible solution.

Chain of ResponsibilityDesign PatternsJava

0 likes · 13 min read

Mastering the Chain of Responsibility: Refactor Video Processing with SOLID Principles

Python Programming Learning Circle

Jul 8, 2025 · Artificial Intelligence

Create a Dancing Word Cloud from Bilibili Videos with Python – Full Step‑by‑Step Guide

This tutorial walks you through building a Python project that downloads a Bilibili video, extracts its frames, applies Baidu AI human segmentation, scrapes danmu comments, generates a stylized word‑cloud animation, and finally composes a video with background music, showcasing video processing, AI, and data visualization techniques.

AI segmentationBilibiliOpenCV

0 likes · 11 min read

Create a Dancing Word Cloud from Bilibili Videos with Python – Full Step‑by‑Step Guide

DaTaobao Tech

Mar 5, 2025 · Artificial Intelligence

Multimodal Large‑Model Cover Generation AI Agent for Taobao Video and Live Streams

Taobao’s new multimodal AI Agent automatically creates high‑quality static and dynamic video covers by planning tasks, consulting a memory of quality criteria, executing frame selection with ReKV streaming and dual‑stage evaluation, generating marketing copy via fine‑tuned Qwen2.5‑7B, and refining layout, resulting in significantly higher click‑through rates, lower latency, and reduced manual effort.

AIMultimodalVideo processing

0 likes · 17 min read

Multimodal Large‑Model Cover Generation AI Agent for Taobao Video and Live Streams

DeWu Technology

Jan 22, 2025 · Operations

How We Cut Video Detection Memory Usage by 78% with WebAssembly and WorkerFS

This article details the challenges of video corruption detection on a creator platform, analyzes existing server‑side and client‑side approaches, and presents a WebAssembly‑based solution using ffmpeg, WorkerFS, and memory‑growth tuning that reduces memory consumption by up to 78% while speeding up large‑file processing.

Memory OptimizationPerformanceVideo processing

0 likes · 13 min read

How We Cut Video Detection Memory Usage by 78% with WebAssembly and WorkerFS

Kuaishou Audio & Video Technology

Jan 21, 2025 · Fundamentals

How Kuaishou Enables Full‑Chain Dolby Vision Support for UGC

Kuaishou partners with Dolby Labs to bring full‑chain Dolby Vision to its short‑video platform, detailing the technology behind HDR, dynamic metadata, and a brightness‑adjustment solution that ensures seamless playback and optimal visual experience for user‑generated content across devices.

Dolby VisionDynamic MetadataExtended Brightness

0 likes · 10 min read

How Kuaishou Enables Full‑Chain Dolby Vision Support for UGC

Kuaishou Tech

Jan 17, 2025 · Artificial Intelligence

Kuaishou Achieves 7 Papers Accepted at AAAI 2025

Kuaishou has achieved a significant milestone with 7 papers accepted at AAAI 2025, covering diverse AI research areas including video processing, recommendation systems, and image restoration, demonstrating the company's strong research capabilities in artificial intelligence.

AAAI 2025Artificial IntelligenceKuaishou

0 likes · 10 min read

Kuaishou Achieves 7 Papers Accepted at AAAI 2025

Test Development Learning Exchange

Jan 13, 2025 · Artificial Intelligence

Python Tool for Converting English Videos to Chinese Dubbed Videos with Subtitles

This article provides a comprehensive guide on developing a Python tool to convert English videos into versions with Chinese dubbing and subtitles, covering all steps from audio extraction to final synthesis.

AI toolsPythonSpeech Recognition

0 likes · 5 min read

Python Tool for Converting English Videos to Chinese Dubbed Videos with Subtitles

Python Programming Learning Circle

Sep 11, 2024 · Artificial Intelligence

Python Tutorial: Download Bilibili Video, Extract Frames, Perform Human Segmentation with Baidu AI, Generate Word Cloud, and Compose Final Video

This article demonstrates how to use Python to download a Bilibili video, extract frames with OpenCV, perform human segmentation via Baidu AI, generate a word‑cloud animation using MoviePy, and finally compose the processed clips into a complete video with added audio.

AI segmentationOpenCVVideo processing

0 likes · 13 min read

Python Tutorial: Download Bilibili Video, Extract Frames, Perform Human Segmentation with Baidu AI, Generate Word Cloud, and Compose Final Video

JD Retail Technology

Sep 3, 2024 · Backend Development

Design and Architecture of a New Video Review System with Streamlined Frame Extraction and Parallel Processing

This article presents the design goals, architecture, technology selection, and component details of a unified video review system that leverages FFmpeg for frame extraction, stream‑based parallel processing, and flexible synchronous/asynchronous workflows to achieve low latency and high scalability.

StreamingSystem ArchitectureVideo processing

0 likes · 10 min read

Design and Architecture of a New Video Review System with Streamlined Frame Extraction and Parallel Processing

Python Crawling & Data Mining

Aug 7, 2024 · Fundamentals

How to Build a Python Screen Recorder with OpenCV, Pillow, and Keyboard Hotkeys

This article walks through creating a Python‑based screen recorder on Windows 10 using Pillow for screenshots, OpenCV for video encoding, NumPy for frame processing, and pynput to control recording via hotkeys, including steps to install dependencies, calculate optimal FPS, and save MP4 files.

Keyboard HotkeysOpenCVPython

0 likes · 13 min read

How to Build a Python Screen Recorder with OpenCV, Pillow, and Keyboard Hotkeys

Kuaishou Tech

Jun 18, 2024 · Artificial Intelligence

CVPR 2024 Conference Papers: Advances in AI and Computer Vision

KuaiShou presents 8 papers at CVPR 2024, covering AI advancements in computer vision, video quality assessment, and 3D generation, showcasing cutting-edge research in machine learning and multimedia technologies.

3D generationAICVPR

0 likes · 11 min read

CVPR 2024 Conference Papers: Advances in AI and Computer Vision

Bilibili Tech

Apr 26, 2024 · Artificial Intelligence

2024 Bilibili Technology Patent Awards – Highlights of Ten Winning Innovations

On World Intellectual Property Day, Bilibili honored ten breakthrough patents that together enable billion‑scale video duplicate detection, AI‑driven story generation, synchronized live rhythm‑games, automatic OTT casting, knowledge‑graph‑based content moderation, glitch‑free multi‑audio streaming, modular playback integration, neural‑network resolution encoding, AV1 reference‑frame pruning, and fine‑grained GPU isolation.

Artificial IntelligenceEncodingStreaming

0 likes · 6 min read

2024 Bilibili Technology Patent Awards – Highlights of Ten Winning Innovations

Python Programming Learning Circle

Apr 22, 2024 · Fundamentals

How to Convert a Dancing Video into an ASCII Art Video with Python

This guide walks you through downloading a B‑site video, extracting GIF frames, converting them to ASCII art, renaming and ordering the frames, turning the ASCII GIFs into images, assembling them into a video with OpenCV, and adding background music using moviepy, all with Python code.

ASCII artOpenCVPython

0 likes · 8 min read

How to Convert a Dancing Video into an ASCII Art Video with Python

Bilibili Tech

Apr 16, 2024 · Frontend Development

Design and Implementation of a High‑Performance Matroska Demuxer for Web Uploads

The new mkv-demuxer SDK replaces the slow FFmpeg-Wasm solution on Bilibili’s upload page by reading Matroska files in slice-sized ArrayBuffers, parsing EBML headers and SeekHead indexes, and exposing getMeta, getData, and seekFrame APIs, cutting memory use by 98 % and parsing time by 97 % while accelerating cover-generation and recommendation processing.

DemuxerMatroskaVideo processing

0 likes · 17 min read

Design and Implementation of a High‑Performance Matroska Demuxer for Web Uploads

Test Development Learning Exchange

Apr 12, 2024 · Fundamentals

Upscaling 720p Video to 1080p with ffmpeg‑python and OpenCV in Python

The article explains how to use ffmpeg‑python and OpenCV in Python to transcode a 720p video to 1080p, discusses the limitations of simple upscaling, and provides code examples for resolution scaling as well as basic color and sharpening enhancements.

OpenCVVideo processingffmpeg

0 likes · 6 min read

Upscaling 720p Video to 1080p with ffmpeg‑python and OpenCV in Python

360 Smart Cloud

Apr 3, 2024 · Backend Development

Understanding FFmpeg Hardware Acceleration Architecture and Implementation

FFmpeg provides a comprehensive, cross‑platform hardware acceleration framework that abstracts diverse GPU and dedicated video codec interfaces, defines HWContext types, device and frame contexts, and various codec configuration methods, enabling efficient video encoding, decoding, and filtering while addressing performance, compatibility, and pipeline complexity challenges.

GPUMultimediaVideo processing

0 likes · 10 min read

Understanding FFmpeg Hardware Acceleration Architecture and Implementation

Python Programming Learning Circle

Jan 26, 2024 · Operations

Eight Practical Python Automation Scripts for Everyday Tasks

This article presents eight ready‑to‑use Python scripts that automate common tasks such as image and video processing, scheduled email sending, PDF‑to‑image conversion, API data fetching, battery monitoring, testing with pytest, and file backup‑sync, complete with code examples.

Video processingimage-processing

0 likes · 11 min read

Eight Practical Python Automation Scripts for Everyday Tasks

Open Source Tech Hub

Jan 18, 2024 · Backend Development

Install and Use FFmpeg with PHP‑FFMpeg on Ubuntu

This guide explains what FFmpeg is, shows how to install it on Ubuntu 18.04, demonstrates integrating the Webman framework and PHP‑FFMpeg library, and provides step‑by‑step code examples for extracting images, adding watermarks, and basic video editing.

ComposerPHPUbuntu

0 likes · 6 min read

Install and Use FFmpeg with PHP‑FFMpeg on Ubuntu

Bilibili Tech

Jan 12, 2024 · Frontend Development

Understanding WebCodecs: Design Goals, Core API, Demos, and Application Scenarios

WebCodecs, introduced in Chrome 94, provides direct, low‑latency access to hardware‑accelerated audio and video codecs, enabling fine‑grained encoding and decoding control, composable streaming pipelines, and high‑performance demos such as controllable decoding, watermarking, chroma‑key, and client‑side video processing, while still lacking container support and broad browser compatibility.

Audio Video APIsBrowser MediaVideo processing

0 likes · 15 min read

Understanding WebCodecs: Design Goals, Core API, Demos, and Application Scenarios

Bilibili Tech

Dec 22, 2023 · Artificial Intelligence

Intelligent Media Technology and Innovative Applications: Information-Theoretic Principles for Transcoding System Optimization

The upcoming Shanghai Jiao‑Tong University seminar on Intelligent Media Technology will feature Bilibili’s Cai Chunlei presenting an information‑theoretic framework for jointly optimizing video transcoding pipelines, linking traditional coding, deep‑learning methods and future large‑model techniques to improve compression and guide practical system design.

AISeminarVideo processing

0 likes · 4 min read

Intelligent Media Technology and Innovative Applications: Information-Theoretic Principles for Transcoding System Optimization

NetEase Cloud Music Tech Team

Dec 21, 2023 · Artificial Intelligence

Video and Image Technologies in NetEase Cloud Music: Architecture, Algorithms, and Applications

The article examines NetEase Cloud Music’s video and image technology stack—covering a four‑module architecture, algorithms for content understanding, intelligent production, moderation, and interactive effects—and explains how these systems enhance user experience, streamline backend processing, and position the platform for future AIGC‑driven innovations.

AI AlgorithmsMultimodal LearningVideo processing

0 likes · 11 min read

Video and Image Technologies in NetEase Cloud Music: Architecture, Algorithms, and Applications

Rare Earth Juejin Tech Community

Oct 13, 2023 · Artificial Intelligence

Video Background Replacement Using RobustVideoMatting and Python

This tutorial explains how to use the open‑source RobustVideoMatting project with Python, PyTorch, and OpenCV to perform human portrait segmentation and replace video backgrounds, covering repository setup, model loading, custom segmentation functions, and full video compositing steps.

Background ReplacementOpenCVPyTorch

0 likes · 9 min read

Video Background Replacement Using RobustVideoMatting and Python

AntTech

Aug 24, 2023 · Artificial Intelligence

CoDeF: A Canonical Content Field Approach for Consistent Video Processing

The CoDeF algorithm introduced by Ant Group's Interactive Intelligence Lab transforms video processing into image processing using a canonical content field and a temporal deformation field, enabling seamless video style transfer, keypoint tracking, and interactive editing while preserving temporal consistency.

Video processingcanonical content fieldtemporal deformation

0 likes · 5 min read

CoDeF: A Canonical Content Field Approach for Consistent Video Processing

Test Development Learning Exchange

Aug 21, 2023 · Fundamentals

Extracting a Specific Frame from a Video as a Cover Image Using Python and OpenCV

This article demonstrates how to use Python's OpenCV library to extract a designated frame from a video file and save it as a cover image, providing step‑by‑step explanations and complete sample code suitable for automation tasks such as generating video thumbnails.

PythonVideo processingthumbnail

0 likes · 3 min read

Extracting a Specific Frame from a Video as a Cover Image Using Python and OpenCV

IT Services Circle

Mar 3, 2023 · Backend Development

FFmpeg 6.0 “Von Neumann” Released with New Encoders, Decoders, Filters, and ABI Versioning

FFmpeg 6.0 “Von Neumann” has been officially released, introducing numerous new encoders, decoders, and filters, adding ABI versioning to major releases, deprecating old APIs, and enhancing CLI performance with threading, statistics options, and file‑based filter options, while outlining upcoming features for version 6.1.

CLIDecodersEncoders

0 likes · 6 min read

FFmpeg 6.0 “Von Neumann” Released with New Encoders, Decoders, Filters, and ABI Versioning

Programmer DD

Mar 3, 2023 · Backend Development

FFmpeg 6.0 Highlights: New Codecs, Filters, and Performance Boosts

FFmpeg 6.0 "Von Neumann" introduces a host of new codecs, decoders, filters, CLI enhancements, ABI versioning, and a more frequent release cadence, offering developers expanded multimedia processing capabilities and improved performance across platforms.

Backend DevelopmentMultimediaSoftware Release

0 likes · 6 min read

FFmpeg 6.0 Highlights: New Codecs, Filters, and Performance Boosts

Bilibili Tech

Feb 24, 2023 · Artificial Intelligence

Understanding Video Super-Resolution: Principles, Common Defects, and Practical Enhancement Techniques

Video super‑resolution, pioneered by deep‑learning models such as SRCNN, can synthesize plausible high‑frequency detail but often introduces artifacts like loss of stylistic noise, inconsistent line depth, texture smearing, and temporal flicker, which can be mitigated through preprocessing (BM3D denoising, descaling), targeted post‑processing (Gaussian blur, unsharp masking) and selective edge‑based texture merging to preserve original artistic style while enhancing perceived sharpness.

BM3DCUGANFourier Transform

0 likes · 13 min read

Understanding Video Super-Resolution: Principles, Common Defects, and Practical Enhancement Techniques

Baidu Geek Talk

Oct 12, 2022 · Backend Development

Understanding Video Color Spaces, Gamma Correction, and Transcoding with FFmpeg

Video processing involves converting linear sensor data through gamma correction and multiple color‑space transformations—such as RGB, YUV, and XYZ—using standards like BT.601/709/2020, with FFmpeg’s colorspace filter and ffprobe to manage transfer functions, primaries, and ranges during transcoding to preserve accurate colors across devices.

Color ManagementVideo processingcolor space

0 likes · 12 min read

Understanding Video Color Spaces, Gamma Correction, and Transcoding with FFmpeg

Shopee Tech Team

Aug 12, 2022 · Backend Development

Shopee Video Technology: Backend Services, High‑Definition Low‑Bitrate Optimization, and Performance Enhancements

Shopee’s video platform combines live‑stream and on‑demand transcoding, link‑mic, multi‑party mixing, and backend editing services with a proprietary high‑definition low‑bitrate pipeline that leverages GPU and CPU encoders, AI‑enhanced pre‑processing, hierarchical B‑frames, and SIMD‑optimized sharpening to deliver high‑quality video on low‑end devices while cutting compute costs, and the company is actively recruiting engineers for further development.

AI enhancementPerformance OptimizationReal-time communication

0 likes · 19 min read

Shopee Video Technology: Backend Services, High‑Definition Low‑Bitrate Optimization, and Performance Enhancements

Xiaohongshu Tech REDtech

Jul 23, 2022 · Mobile Development

Xiaohongshu Deploys On‑Device Super‑Resolution with Huawei HMS Core for High‑Quality Short Videos

Xiaohongshu, partnering with Huawei HMS Core, now runs on‑device super‑resolution for short videos, instantly upscaling 540p to 1080p and enhancing 720p content using GPU/NPU via HiAI, cutting bandwidth and stutter while keeping power use low across hundreds of Huawei devices.

AI accelerationAndroid NDKHuawei HMS Core

0 likes · 9 min read

Xiaohongshu Deploys On‑Device Super‑Resolution with Huawei HMS Core for High‑Quality Short Videos

Python Programming Learning Circle

Jun 14, 2022 · Fundamentals

Creating an ASCII‑Art Video from a Celebrity Clip Using Python and OpenCV

This tutorial demonstrates how to convert a video of a celebrity into an ASCII art video using Python 3.7, OpenCV, PIL, and NumPy, covering tool setup, frame extraction, grayscale conversion, K‑means clustering for brightness mapping, character rendering, and final video assembly.

ASCII artOpenCVPython

0 likes · 13 min read

Creating an ASCII‑Art Video from a Celebrity Clip Using Python and OpenCV

Youku Technology

Jun 9, 2022 · Mobile Development

Design and Architecture of the Cross-Platform Multimedia Rendering Engine OPR

The OPR engine provides a cross‑platform, GPU‑accelerated rendering framework that unifies audio‑video pre‑ and post‑processing, native UI‑driven danmaku rendering, and real‑time visual effects such as human‑body recognition, using a modular command‑stream architecture, C++ core, monitoring tools, and extensibility for future Vulkan, VR, and plugin integration.

GPUNative UIVideo processing

0 likes · 15 min read

Design and Architecture of the Cross-Platform Multimedia Rendering Engine OPR

Bilibili Tech

Apr 26, 2022 · Artificial Intelligence

2022 Bilibili Technology Patent Selection Awards

The 2022 Bilibili Technology Patent Selection Awards honored ten innovative projects across Best Popularity and Most Popular categories, showcasing advances such as advanced bullet comments, optimized gift animation, video rendering, virtual avatar production, virtual material editing, mini‑program integration, AI‑driven live‑stream switching, blur‑face enhancement, and ghost video tools.

AI enhancementBilibiliMini Program

0 likes · 8 min read

2022 Bilibili Technology Patent Selection Awards

Python Crawling & Data Mining

Feb 22, 2022 · Artificial Intelligence

Create a Dancing Word‑Cloud Video with Python and AI

This tutorial walks through downloading a dance video, extracting frames, using Baidu AI for person segmentation, generating word‑cloud masks, and stitching the results into a dancing word‑cloud video with Python, OpenCV and the WordCloud library.

Baidu AIOpenCVVideo processing

0 likes · 8 min read

Create a Dancing Word‑Cloud Video with Python and AI

Python Programming Learning Circle

Feb 14, 2022 · Fundamentals

How to Convert Video to GIF Using Python and MoviePy

This tutorial explains how to install the MoviePy library, write Python code to load a video file, and generate a GIF while controlling size through resolution scaling, frame rate reduction, sub‑clipping, and output dimensions, all with clear code examples and visual results.

GIFTutorialVideo processing

0 likes · 4 min read

How to Convert Video to GIF Using Python and MoviePy

Bilibili Tech

Jan 28, 2022 · Artificial Intelligence

Real-CUGAN: An Open‑Source AI Super‑Resolution Model for Anime Video Upscaling

Real‑CUGAN is an open‑source AI super‑resolution model that upscales anime video up to 4× using a million‑patch, frequency‑domain‑supervised dataset, delivering faster inference than Real‑ESRGAN, seamless Waifu2x compatibility, and superior texture, line and artifact handling, with code released on GitHub.

AI super-resolutionReal-CUGANVideo processing

0 likes · 8 min read

Real-CUGAN: An Open‑Source AI Super‑Resolution Model for Anime Video Upscaling

Kuaishou Tech

Jan 20, 2022 · Artificial Intelligence

Understanding Kuaishou's KFRUC Algorithm: A Technical Deep Dive into Video Frame Interpolation

This article provides a comprehensive technical analysis of Kuaishou's self-developed KFRUC video frame interpolation algorithm, detailing its motion estimation, occlusion localization, and motion compensation mechanisms to enhance playback smoothness and visual quality in slow-motion and high-frame-rate video applications.

KFRUC AlgorithmMEMCSlow Motion Technology

0 likes · 8 min read

Understanding Kuaishou's KFRUC Algorithm: A Technical Deep Dive into Video Frame Interpolation

Bitu Technology

Jan 7, 2022 · Backend Development

Design and Implementation of Tubi Multimedia Processing Platform (TMPP)

The article details Tubi's Multimedia Processing Platform (TMPP), describing its architecture, processing stages, resource management, and distributed task scheduling for large‑scale video transcoding and delivery across multiple devices.

Cloud ComputingDistributed SystemsResource Management

0 likes · 8 min read

Design and Implementation of Tubi Multimedia Processing Platform (TMPP)

Python Crawling & Data Mining

Jan 7, 2022 · Fundamentals

How to Build a Python Screen Recorder with OpenCV, Pillow, and pynput

Learn how to create a Python-based screen recording tool on Windows 10 using Pillow for screenshots, OpenCV for video encoding, NumPy for frame processing, and pynput for hotkey control, with step-by-step code examples, optimal FPS calculation, and MP4 saving techniques.

OpenCVScreen RecordingVideo processing

0 likes · 13 min read

How to Build a Python Screen Recorder with OpenCV, Pillow, and pynput

Python Programming Learning Circle

Jan 4, 2022 · Artificial Intelligence

Python Project: Download Bilibili Video, Extract Frames, Perform Human Segmentation, Generate Word Cloud, and Compose Final Video

This tutorial walks through a complete Python workflow that downloads a B‑site video, extracts frames with OpenCV, uses Baidu AI for human segmentation, crawls danmu comments, creates a masked word‑cloud animation, and finally merges the clips with audio into a polished video.

OpenCVVideo processingmoviepy

0 likes · 12 min read

Python Project: Download Bilibili Video, Extract Frames, Perform Human Segmentation, Generate Word Cloud, and Compose Final Video

Douyu Streaming

Dec 1, 2021 · Mobile Development

How to Get, Build, and Extend WebRTC m79 Source for Windows, Android, and iOS

This guide explains how to obtain the WebRTC m79 source, compile it for Windows, Android, and iOS, walk through the basic signaling and peer‑connection workflow, and implement advanced video‑capture and audio‑volume features with custom C++ extensions, while unifying the codebase across platforms.

Audio ProcessingCVideo processing

0 likes · 19 min read

How to Get, Build, and Extend WebRTC m79 Source for Windows, Android, and iOS

NetEase Smart Enterprise Tech+

Nov 16, 2021 · Mobile Development

Integrating Faceunity Beauty SDK with NERtc on Android and iOS

This guide explains the core concepts, integration steps, and troubleshooting tips for using the Faceunity (相芯) Beauty SDK with NetEase NERtc on Android and iOS, covering OpenGL ES basics, EGL/EAGL interfaces, three rendering schemes, resource management, and platform‑specific setup.

AndroidMobile DevelopmentNERtc

0 likes · 13 min read

Integrating Faceunity Beauty SDK with NERtc on Android and iOS

High Availability Architecture

Oct 21, 2021 · Cloud Computing

Optimizing NetEase Cloud Music Audio/Video Processing Platform with Serverless

This article describes how NetEase Cloud Music leveraged Serverless function computing to redesign its audio/video algorithm processing platform, covering the existing challenges, the selection criteria for Serverless solutions, the implementation details, performance gains, cost savings, and future directions.

Audio ProcessingCloud FunctionsNetEase

0 likes · 11 min read

Optimizing NetEase Cloud Music Audio/Video Processing Platform with Serverless

Python Programming Learning Circle

Aug 30, 2021 · Frontend Development

Creating a Custom Dynamic Desktop Wallpaper with Python and PyQt5

This tutorial walks through building a dynamic desktop wallpaper on Windows using Python's PyQt5, covering UI layout design, video loading and preview, desktop handle acquisition, wallpaper rendering, and graceful shutdown, with complete code examples for each step.

Desktop applicationDynamic WallpaperPyQt5

0 likes · 12 min read

Creating a Custom Dynamic Desktop Wallpaper with Python and PyQt5

Taobao Frontend Technology

Aug 10, 2021 · Frontend Development

Optimizing Video Thumbnail Selection: Canvas vs FFmpeg WebAssembly

This article examines how Taobao's front‑end team built a custom video frame‑capture tool, compares video+canvas with FFmpeg‑WebAssembly approaches, presents testing results, implementation details, and future optimizations to improve thumbnail selection efficiency and user experience.

CanvasVideo processingWebAssembly

0 likes · 5 min read

Optimizing Video Thumbnail Selection: Canvas vs FFmpeg WebAssembly

MaGe Linux Operations

Jul 18, 2021 · Fundamentals

Turn a Bilibili Dance Clip into an ASCII‑Art Video with Python

Learn how to download a Bilibili dance video, extract GIF frames, convert them to ASCII art, rename and order the frames, transform them into images, and finally stitch them into a music‑backed video using Python tools such as you‑get, OpenCV, and moviepy.

ASCII artVideo processingmoviepy

0 likes · 9 min read

Turn a Bilibili Dance Clip into an ASCII‑Art Video with Python

Python Programming Learning Circle

Jul 9, 2021 · Fundamentals

How to Convert a Dancing Video into an ASCII Art Video Using Python

This tutorial walks through downloading a Bilibili dance video, extracting GIF frames, converting each frame to ASCII art, renaming and ordering the frames, converting them to images, and finally assembling them into a video with background music using Python libraries such as you-get, OpenCV, Pillow, and moviepy.

OpenCVVideo processingmoviepy

0 likes · 9 min read

How to Convert a Dancing Video into an ASCII Art Video Using Python

Tencent Cloud Developer

Jun 22, 2021 · Cloud Computing

Let's Dive Into Serverless World: Tencent Cloud's Serverless Development and Latest Trends

Tencent Cloud’s serverless platform, now serving over a million developers and billions of daily invocations, accelerates business and education workloads, enables massive elastic scaling, integrates video, GPU, and event‑bus services, and simplifies migration, debugging, and SaaS integration, heralding serverless as the next mainstream cloud paradigm.

Cloud ComputingCloud NativeDeveloper Experience

0 likes · 17 min read

Let's Dive Into Serverless World: Tencent Cloud's Serverless Development and Latest Trends

Volcano Engine Developer Services

Jun 16, 2021 · Backend Development

How ByteDance’s Video Processing Platform Achieves Billion‑Scale High Availability

This article explains how ByteDance’s Volcano Engine video platform handles the entire video lifecycle—from client‑side capture to cloud processing, delivery, and playback—by employing a multi‑plane architecture, scalable workflow system, function compute platform, and the dynamic BMF framework to meet massive scale, ensure high availability, improve user experience, and reduce costs.

Function ComputeVideo processinghigh availability

0 likes · 19 min read

How ByteDance’s Video Processing Platform Achieves Billion‑Scale High Availability

Python Crawling & Data Mining

Jun 2, 2021 · Fundamentals

Create Stunning 9‑Grid Short Videos with Python in One Click

Learn how to use Python's moviepy and Pillow libraries to automatically split a video into nine segments, arrange them into a stylish 9‑grid layout, add background music, and export the result as a polished short video, complete with step‑by‑step code examples.

Video processingmoviepy

0 likes · 8 min read

Create Stunning 9‑Grid Short Videos with Python in One Click

Kuaishou Tech

May 7, 2021 · Artificial Intelligence

Kuaishou–Tsinghua Joint Research Institute Showcases AI and Video Technology Collaboration at the Software Discipline Development Forum

The Kuaishou–Tsinghua Future Media Data Joint Research Institute co‑hosted the 2021 Software Discipline Development Forum, highlighting extensive AI‑driven video analysis, computer‑vision, multimodal learning, and recommendation‑system research, as well as talent cultivation and innovative VR livestream experiences for the university’s 110th anniversary celebrations.

AIIndustry-Academia CollaborationVR Live Streaming

0 likes · 7 min read

Kuaishou–Tsinghua Joint Research Institute Showcases AI and Video Technology Collaboration at the Software Discipline Development Forum

HomeTech

Apr 21, 2021 · Artificial Intelligence

AI-Powered Masked Danmaku: Design and Implementation

This article details the design and practical implementation of an AI-driven masked danmaku system that prevents comment overlay on video content, covering background, technology selection, instance segmentation methods, distributed task scheduling, mask generation, client rendering, performance optimizations, and future directions.

AIDistributed SystemsMask Danmaku

0 likes · 18 min read

AI-Powered Masked Danmaku: Design and Implementation

Baidu Geek Talk

Mar 17, 2021 · Artificial Intelligence

Overview of Baidu's Wànxiàng System for Large‑Scale Rich Media Processing

Baidu’s Wànxiàng system processes billions of images and videos daily by extracting low‑ and high‑level features, linking related media, and aggregating semantic attributes in a scalable, timely architecture that leverages thousands of CPU, GPU, and FPGA cores to power accurate, low‑latency rich‑media search and recommendation.

Artificial IntelligenceBaiduImage Analysis

0 likes · 14 min read

Overview of Baidu's Wànxiàng System for Large‑Scale Rich Media Processing

360 Tech Engineering

Feb 23, 2021 · Artificial Intelligence

Video Stutter Detection via Frame Difference Analysis Using FFmpeg

This article explains a method for detecting video stutter by converting uploaded videos into frame sequences with ffmpeg, calculating pixel differences between consecutive frames, aggregating motion metrics, removing scene‑change effects, computing a dynamic factor, and outputting a binary result indicating the presence or absence of stutter.

Video processingalgorithmcomputer vision

0 likes · 5 min read

Video Stutter Detection via Frame Difference Analysis Using FFmpeg

iQIYI Technical Product Team

Feb 5, 2021 · Artificial Intelligence

Efficient General‑Purpose Frame Extraction for AI Video Inference Services

The paper presents a unified, high‑performance frame‑extraction framework that dynamically selects CPU or GPU decoding, leverages multithreaded and CUDA‑accelerated pipelines, keeps frames in memory, and achieves up to ten‑fold latency reductions for diverse AI video‑inference tasks.

AI video inferenceCPU optimizationGPU Acceleration

0 likes · 14 min read

Efficient General‑Purpose Frame Extraction for AI Video Inference Services

Selected Java Interview Questions

Dec 29, 2020 · Artificial Intelligence

Open-Source Video Object Removal Tool Using PyTorch Allows Deleting Elements via Bounding Boxes

An open‑source PyTorch‑based project enables users to remove unwanted objects from videos simply by drawing a bounding box around them, offering a practical demo, step‑by‑step instructions, and a GitHub repository with over 2 k stars.

Object RemovalOpen SourcePyTorch

0 likes · 2 min read

Open-Source Video Object Removal Tool Using PyTorch Allows Deleting Elements via Bounding Boxes

Python Crawling & Data Mining

Dec 22, 2020 · Artificial Intelligence

Create Stunning Video Ghosting Effects with PaddlePaddle’s DeepLabV3p Model

Learn how to generate cinematic ghosting effects in videos by leveraging PaddlePaddle’s PaddleHub deep learning library and the pretrained deeplabv3p_xception65 model for semantic segmentation, with step‑by‑step code, environment setup, and practical testing on classic martial‑arts footage.

Ghost EffectPaddlePaddlePython

0 likes · 7 min read

Create Stunning Video Ghosting Effects with PaddlePaddle’s DeepLabV3p Model

Laravel Tech Community

Jun 18, 2020 · Fundamentals

FFmpeg 4.3 Released with New AV1, Vulkan, AMD AMF, and QSV Support

FFmpeg 4.3 has been released, adding support for TrueHD in MP4, Intel QSV‑accelerated MJPEG and VP9 decoding, Vulkan‑based AMD AMF encoding on Linux, AV1 encoding via rav1e, ZeroMQ, VDPAU VP9 decoding, and numerous filter enhancements.

AV1MultimediaVideo processing

0 likes · 2 min read

FFmpeg 4.3 Released with New AV1, Vulkan, AMD AMF, and QSV Support

Youku Technology

May 7, 2020 · Industry Insights

How Alibaba’s FrameShare Pushes Ultra‑HD Video to the Next Level

This article explains the FrameShare ultra‑HD solution, detailing its four core capabilities—high frame‑rate, ultra‑high resolution, HDR rendering, and surround sound—along with the end‑to‑end video pipeline, key technologies such as frame interpolation, HDR tone‑mapping, cloud‑edge collaboration, and the future vision for nationwide ultra‑HD adoption.

HDRHigh Frame RateIndustry Insight

0 likes · 14 min read

How Alibaba’s FrameShare Pushes Ultra‑HD Video to the Next Level

Meituan Technology Team

Sep 12, 2019 · Mobile Development

How Meituan Engineered a Scalable Mobile Video Platform: Architecture and Lessons

This article details Meituan's end‑to‑end development of a merchant‑side mobile video feature, covering background needs, architecture design, technology selection, implementation of playback, recording, composition, cutting, processing pipelines, encountered pitfalls, monitoring strategies, and future optimization directions.

AndroidMediaCodecOptimization

0 likes · 24 min read

How Meituan Engineered a Scalable Mobile Video Platform: Architecture and Lessons

Meitu Technology

Jun 12, 2019 · Cloud Computing

Meitu's Cloud-Based Image Beautification and Large-Scale Video Processing Architecture

Meitu replaced on-device beautification and video processing with a cloud-native architecture that routes requests by region, uses a dedicated upload SDK for detailed monitoring, employs edge-computing, a configuration-driven plug-in framework and Kubernetes-based elastic scaling, enabling fast, reliable, globally-distributed image and video services.

Cloud ComputingMeituMonitoring

0 likes · 12 min read

Meitu's Cloud-Based Image Beautification and Large-Scale Video Processing Architecture

58 Tech

Apr 16, 2019 · Mobile Development

Design and Architecture of the 58 Short Video SDK for Mobile Applications

The article outlines the technical challenges of short‑video apps and presents the modular, extensible architecture of the 58 Short Video SDK, detailing its layered design, design principles, advantages, and future evolution to support advanced features such as AR, hardware decoding, and h265 encoding.

MultimediaVideo processingshort video

0 likes · 12 min read

Design and Architecture of the 58 Short Video SDK for Mobile Applications

iQIYI Technical Product Team

Nov 16, 2018 · Artificial Intelligence

iQIYI AI Bullet‑Screen Masking: Semantic Segmentation System and Engineering Insights

iQIYI’s bullet‑screen masking employs a DeepLabv3+‑based two‑class semantic segmentation pipeline, preceded by a close‑up detector and followed by morphological refinement, trained on a custom annotated dataset that raises IoU to 93.6 %, processes hour‑long videos in under an hour, and is slated for future upgrades to instance and panoptic segmentation for finer‑grained masking.

AIVideo processingbullet screen masking

0 likes · 10 min read

iQIYI AI Bullet‑Screen Masking: Semantic Segmentation System and Engineering Insights

Youku Technology

Oct 31, 2018 · Artificial Intelligence

Technical Overview of Youku's Video Face Swapping System

Youku’s new video face‑swapping service lets users replace a celebrity’s face with a single uploaded photo by employing a 3D generative model, deep‑learning segmentation, multi‑scale super‑resolution, and trajectory smoothing to achieve fast, near‑photorealistic results across varied angles, expressions, and lighting, though it still lacks personalized models and struggles with extreme side views or heavy occlusions.

3D modelingAIVideo processing

0 likes · 10 min read

Technical Overview of Youku's Video Face Swapping System

Youku Technology

Oct 29, 2018 · Artificial Intelligence

Improving Online Video Experience: Youku’s End‑to‑End Video Quality Enhancement Techniques

Youku enhances online video by applying intelligent post‑production contrast mapping, device‑specific HDR tone‑mapping, high‑frame‑rate restoration through frame‑rate conversion, and ROI‑aware encoding that allocates bitrate to key visual areas, complemented by audio processing, to deliver cinema‑grade quality across diverse screens.

HDRMachine LearningROI encoding

0 likes · 9 min read

Improving Online Video Experience: Youku’s End‑to‑End Video Quality Enhancement Techniques

360 Quality & Efficiency

Apr 25, 2018 · Fundamentals

Introduction to FFmpeg: Libraries, Tools, and Basic Command Usage

This article introduces FFmpeg, outlines its eight core libraries, describes the main command‑line tools (ffmpeg, ffplay, ffprobe), and provides a step‑by‑step example of converting an MP4 video to HEVC with MP3 audio on Windows, including useful help commands and additional features.

MultimediaVideo processingffmpeg

0 likes · 5 min read

Introduction to FFmpeg: Libraries, Tools, and Basic Command Usage

Qizhuo Club

Mar 13, 2018 · Mobile Development

Mastering Android MediaCodec: From Basics to Advanced Video Processing

This article explores Android’s MediaCodec API, detailing its role in hardware video encoding/decoding, buffer management, data types, lifecycle states, and practical code examples, providing developers with a comprehensive guide to implementing advanced video processing features such as watermarking and transcoding on mobile devices.

AndroidHardware DecodingMediaCodec

0 likes · 10 min read

Mastering Android MediaCodec: From Basics to Advanced Video Processing

Liulishuo Tech Team

Dec 11, 2017 · Artificial Intelligence

Observations from AWS re:Invent 2017: AI, Voice, ML Frameworks, and Video Processing

The author recounts a 16‑hour drive to Las Vegas for AWS re:Invent, highlighting AI‑focused sessions such as Alexa, Lex, Polly, serverless Lambda, the MXNet vs TensorFlow competition, and emerging video‑processing research, while noting strengths, limitations, and future growth prospects.

AILambdaMachine Learning

0 likes · 5 min read

Observations from AWS re:Invent 2017: AI, Voice, ML Frameworks, and Video Processing