Tag

video processing

1 views collected around this technical thread.

DaTaobao Tech
DaTaobao Tech
Mar 5, 2025 · Artificial Intelligence

Multimodal Large‑Model Cover Generation AI Agent for Taobao Video and Live Streams

Taobao’s new multimodal AI Agent automatically creates high‑quality static and dynamic video covers by planning tasks, consulting a memory of quality criteria, executing frame selection with ReKV streaming and dual‑stage evaluation, generating marketing copy via fine‑tuned Qwen2.5‑7B, and refining layout, resulting in significantly higher click‑through rates, lower latency, and reduced manual effort.

AIContent AILarge Models
0 likes · 17 min read
Multimodal Large‑Model Cover Generation AI Agent for Taobao Video and Live Streams
Kuaishou Audio & Video Technology
Kuaishou Audio & Video Technology
Jan 21, 2025 · Fundamentals

How Kuaishou Enables Full‑Chain Dolby Vision Support for UGC

Kuaishou partners with Dolby Labs to bring full‑chain Dolby Vision to its short‑video platform, detailing the technology behind HDR, dynamic metadata, and a brightness‑adjustment solution that ensures seamless playback and optimal visual experience for user‑generated content across devices.

Dolby VisionDynamic MetadataExtended Brightness
0 likes · 10 min read
How Kuaishou Enables Full‑Chain Dolby Vision Support for UGC
Python Programming Learning Circle
Python Programming Learning Circle
Jan 20, 2025 · Fundamentals

10 Practical Python Automation Scripts for Everyday Tasks

This article presents ten ready‑to‑use Python automation scripts that demonstrate how to optimize images, process videos, convert PDFs to images, fetch and post API data, monitor battery levels, correct grammar and spelling, download files, retrieve world news, and build a simple PySide2 GUI, providing clear code examples for each use case.

APIGUIImage Processing
0 likes · 11 min read
10 Practical Python Automation Scripts for Everyday Tasks
Kuaishou Tech
Kuaishou Tech
Jan 17, 2025 · Artificial Intelligence

Kuaishou Achieves 7 Papers Accepted at AAAI 2025

Kuaishou has achieved a significant milestone with 7 papers accepted at AAAI 2025, covering diverse AI research areas including video processing, recommendation systems, and image restoration, demonstrating the company's strong research capabilities in artificial intelligence.

AAAI 2025Artificial IntelligenceKuaishou
0 likes · 10 min read
Kuaishou Achieves 7 Papers Accepted at AAAI 2025
Kuaishou Tech
Kuaishou Tech
Sep 27, 2024 · Artificial Intelligence

XPSR: Cross‑modal Priors for Diffusion‑based Image Super‑Resolution

The paper introduces XPSR, a diffusion‑based image super‑resolution method that incorporates cross‑modal semantic priors from a large multimodal language model, achieving state‑of‑the‑art performance on both reference and no‑reference quality metrics across synthetic and real‑world video restoration tasks.

AI researchECCV2024Image Super-Resolution
0 likes · 8 min read
XPSR: Cross‑modal Priors for Diffusion‑based Image Super‑Resolution
Python Programming Learning Circle
Python Programming Learning Circle
Sep 12, 2024 · Fundamentals

Convert a Dancing Video into an ASCII Art Video with Python

This tutorial explains how to download a B‑site dance video, extract GIF frames, transform them into ASCII art, rename and sort the frames, convert them to images, assemble a video with OpenCV, and finally add background music using MoviePy.

MoviePyascii artopencv
0 likes · 8 min read
Convert a Dancing Video into an ASCII Art Video with Python
Python Programming Learning Circle
Python Programming Learning Circle
Sep 11, 2024 · Artificial Intelligence

Python Tutorial: Download Bilibili Video, Extract Frames, Perform Human Segmentation with Baidu AI, Generate Word Cloud, and Compose Final Video

This article demonstrates how to use Python to download a Bilibili video, extract frames with OpenCV, perform human segmentation via Baidu AI, generate a word‑cloud animation using MoviePy, and finally compose the processed clips into a complete video with added audio.

AI SegmentationBilibiliMoviePy
0 likes · 13 min read
Python Tutorial: Download Bilibili Video, Extract Frames, Perform Human Segmentation with Baidu AI, Generate Word Cloud, and Compose Final Video
JD Retail Technology
JD Retail Technology
Sep 3, 2024 · Backend Development

Design and Architecture of a New Video Review System with Streamlined Frame Extraction and Parallel Processing

This article presents the design goals, architecture, technology selection, and component details of a unified video review system that leverages FFmpeg for frame extraction, stream‑based parallel processing, and flexible synchronous/asynchronous workflows to achieve low latency and high scalability.

FFmpegParallel ProcessingStreaming
0 likes · 10 min read
Design and Architecture of a New Video Review System with Streamlined Frame Extraction and Parallel Processing
Kuaishou Tech
Kuaishou Tech
Jun 18, 2024 · Artificial Intelligence

CVPR 2024 Conference Papers: Advances in AI and Computer Vision

KuaiShou presents 8 papers at CVPR 2024, covering AI advancements in computer vision, video quality assessment, and 3D generation, showcasing cutting-edge research in machine learning and multimedia technologies.

3D GenerationAICVPR
0 likes · 11 min read
CVPR 2024 Conference Papers: Advances in AI and Computer Vision
Bilibili Tech
Bilibili Tech
Apr 26, 2024 · Artificial Intelligence

2024 Bilibili Technology Patent Awards – Highlights of Ten Winning Innovations

On World Intellectual Property Day, Bilibili honored ten breakthrough patents that together enable billion‑scale video duplicate detection, AI‑driven story generation, synchronized live rhythm‑games, automatic OTT casting, knowledge‑graph‑based content moderation, glitch‑free multi‑audio streaming, modular playback integration, neural‑network resolution encoding, AV1 reference‑frame pruning, and fine‑grained GPU isolation.

Artificial IntelligenceInteractive MediaPatent
0 likes · 6 min read
2024 Bilibili Technology Patent Awards – Highlights of Ten Winning Innovations
Python Programming Learning Circle
Python Programming Learning Circle
Apr 22, 2024 · Fundamentals

How to Convert a Dancing Video into an ASCII Art Video with Python

This guide walks you through downloading a B‑site video, extracting GIF frames, converting them to ASCII art, renaming and ordering the frames, turning the ASCII GIFs into images, assembling them into a video with OpenCV, and adding background music using moviepy, all with Python code.

MoviePyPythonascii art
0 likes · 8 min read
How to Convert a Dancing Video into an ASCII Art Video with Python
Bilibili Tech
Bilibili Tech
Apr 16, 2024 · Frontend Development

Design and Implementation of a High‑Performance Matroska Demuxer for Web Uploads

The new mkv-demuxer SDK replaces the slow FFmpeg-Wasm solution on Bilibili’s upload page by reading Matroska files in slice-sized ArrayBuffers, parsing EBML headers and SeekHead indexes, and exposing getMeta, getData, and seekFrame APIs, cutting memory use by 98 % and parsing time by 97 % while accelerating cover-generation and recommendation processing.

DemuxerMatroskaPerformance
0 likes · 17 min read
Design and Implementation of a High‑Performance Matroska Demuxer for Web Uploads
Test Development Learning Exchange
Test Development Learning Exchange
Apr 12, 2024 · Fundamentals

Upscaling 720p Video to 1080p with ffmpeg‑python and OpenCV in Python

The article explains how to use ffmpeg‑python and OpenCV in Python to transcode a 720p video to 1080p, discusses the limitations of simple upscaling, and provides code examples for resolution scaling as well as basic color and sharpening enhancements.

FFmpegopencvresolution
0 likes · 6 min read
Upscaling 720p Video to 1080p with ffmpeg‑python and OpenCV in Python
360 Smart Cloud
360 Smart Cloud
Apr 3, 2024 · Backend Development

Understanding FFmpeg Hardware Acceleration Architecture and Implementation

FFmpeg provides a comprehensive, cross‑platform hardware acceleration framework that abstracts diverse GPU and dedicated video codec interfaces, defines HWContext types, device and frame contexts, and various codec configuration methods, enabling efficient video encoding, decoding, and filtering while addressing performance, compatibility, and pipeline complexity challenges.

FFmpegGPUbackend development
0 likes · 10 min read
Understanding FFmpeg Hardware Acceleration Architecture and Implementation
Bilibili Tech
Bilibili Tech
Mar 5, 2024 · Game Development

Bilibili Color Space Conversion Engine for Video Processing

Bilibili's color space conversion engine processes user‑uploaded videos with varied color parameters into a unified format, using layered filters, precomputed optimizations, CPU and CUDA implementations, handling transformations, quantization, chroma subsampling, matrix conversion, transfer functions, gamut and tone mapping, HDR dynamic metadata, and achieving high performance for millions of users.

Color spaceGPU AccelerationHDR
0 likes · 19 min read
Bilibili Color Space Conversion Engine for Video Processing
Bilibili Tech
Bilibili Tech
Mar 1, 2024 · Artificial Intelligence

Bilibili's Self-Developed Video Super-Resolution Algorithm: Background, Optimization Directions, and Implementation Details

Bilibili’s self‑supervised video super‑resolution system upgrades low‑resolution streams to 4K by using three parallel degradation‑branch networks—texture‑enhancing, line‑recovering, and noise‑removing—tailored to anime, game, and real‑world content, delivering sharper edges, finer textures, and measurable quality gains across its online playback pipeline.

AIBilibiliModel Architecture
0 likes · 16 min read
Bilibili's Self-Developed Video Super-Resolution Algorithm: Background, Optimization Directions, and Implementation Details
Python Programming Learning Circle
Python Programming Learning Circle
Jan 26, 2024 · Operations

Eight Practical Python Automation Scripts for Everyday Tasks

This article presents eight ready‑to‑use Python scripts that automate common tasks such as image and video processing, scheduled email sending, PDF‑to‑image conversion, API data fetching, battery monitoring, testing with pytest, and file backup‑sync, complete with code examples.

APIEmailImage Processing
0 likes · 11 min read
Eight Practical Python Automation Scripts for Everyday Tasks
Bilibili Tech
Bilibili Tech
Jan 12, 2024 · Frontend Development

Understanding WebCodecs: Design Goals, Core API, Demos, and Application Scenarios

WebCodecs, introduced in Chrome 94, provides direct, low‑latency access to hardware‑accelerated audio and video codecs, enabling fine‑grained encoding and decoding control, composable streaming pipelines, and high‑performance demos such as controllable decoding, watermarking, chroma‑key, and client‑side video processing, while still lacking container support and broad browser compatibility.

Audio Video APIsBrowser MediaFrontend Development
0 likes · 15 min read
Understanding WebCodecs: Design Goals, Core API, Demos, and Application Scenarios
Test Development Learning Exchange
Test Development Learning Exchange
Jan 6, 2024 · Artificial Intelligence

Python Video Converter Using OpenCV: Convert Videos to Grayscale, Black‑White, and RGB

This article explains how to build a Python 3 video converter with OpenCV that reads an input video, optionally transforms each frame to grayscale, black‑and‑white, or RGB, and writes the result to a new output file, including command‑line argument parsing and dependency installation.

Computer VisionPythonopencv
0 likes · 6 min read
Python Video Converter Using OpenCV: Convert Videos to Grayscale, Black‑White, and RGB