Tag

multimedia

0 views collected around this technical thread.

OPPO Kernel Craftsman
OPPO Kernel Craftsman
Dec 13, 2024 · Fundamentals

Overview of H.266/VVC Video Coding Standard and Its Key Technologies

H.266/VVC, the next‑generation video coding standard finalized in 2020, delivers roughly 50 % bitrate savings over H.265/HEVC with modestly higher decoding complexity, introduces advanced intra‑ and inter‑prediction, transform, quantization, entropy and loop‑filtering tools, and faces patent‑pool and adoption challenges before widespread smartphone integration around 2026.

Encoding StandardsH.266VVC
0 likes · 20 min read
Overview of H.266/VVC Video Coding Standard and Its Key Technologies
Kuaishou Tech
Kuaishou Tech
Jul 31, 2024 · Artificial Intelligence

Kuaishou Showcases AI‑Driven Multimedia Innovations at China Multimedia 2024

At the China Multimedia 2024 conference in Yinchuan, Kuaishou presented its latest AI‑driven large‑model technologies—including text‑to‑image, text‑to‑video, and audio models—alongside advances in intelligent video coding, a new research‑fund initiative, and recent industry awards.

AIKuaishouLarge Models
0 likes · 5 min read
Kuaishou Showcases AI‑Driven Multimedia Innovations at China Multimedia 2024
Bilibili Tech
Bilibili Tech
Jun 11, 2024 · Artificial Intelligence

Intelligent Restoration System for Legacy Video Quality

Bilibili’s Multimedia Lab created an end‑to‑end intelligent restoration system that assesses video resolution, frame‑rate and quality, then automatically selects and applies image‑level enhancement, frame‑rate up‑sampling, background and face restoration, and optical‑flow interpolation to transform blurry, jittery, artifact‑laden legacy videos into clear, smooth, high‑quality streams, now deployed for on‑demand content and slated for live‑stream expansion.

AIframe interpolationimage enhancement
0 likes · 12 min read
Intelligent Restoration System for Legacy Video Quality
360 Smart Cloud
360 Smart Cloud
Apr 3, 2024 · Backend Development

Understanding FFmpeg Hardware Acceleration Architecture and Implementation

FFmpeg provides a comprehensive, cross‑platform hardware acceleration framework that abstracts diverse GPU and dedicated video codec interfaces, defines HWContext types, device and frame contexts, and various codec configuration methods, enabling efficient video encoding, decoding, and filtering while addressing performance, compatibility, and pipeline complexity challenges.

FFmpegGPUbackend development
0 likes · 10 min read
Understanding FFmpeg Hardware Acceleration Architecture and Implementation
DaTaobao Tech
DaTaobao Tech
Jan 31, 2024 · Artificial Intelligence

Highlights of Recent AI Research Papers from Top Conferences (2023)

The article curates standout AI papers from 2023 CCF‑A conferences—including CVPR, ICLR, ACM MM, and INFORMS—showcasing advances such as Swin‑Transformer video quality assessment, cross‑modal e‑commerce product search, transformer‑based vehicle routing heuristics, diffusion‑driven dance generation, and reinforcement‑learning inventory replenishment.

AIRecommendation systemscomputer vision
0 likes · 23 min read
Highlights of Recent AI Research Papers from Top Conferences (2023)
HelloTech
HelloTech
Jan 25, 2024 · Backend Development

Design and Implementation of a Custom Multimedia Framework Using FFmpeg

The Haro Street Cat mobile team created a custom multimedia framework that wraps FFmpeg 4.2.2 in a C++ core library with Android/iOS compatibility layers and Java wrappers for transcoding, live streaming, and composition, delivering hardware‑accelerated decoding, flexible filter pipelines, and reliable transcoding that boosted coverage to over 99 %, cut storage by more than 30 %, accelerated video start‑up, and improved streaming and watermarking performance.

AudioC++FFmpeg
0 likes · 27 min read
Design and Implementation of a Custom Multimedia Framework Using FFmpeg
Bilibili Tech
Bilibili Tech
Dec 15, 2023 · Artificial Intelligence

Bilibili's AI-Powered Video Frame Interpolation: Techniques, Challenges, and Deployment

Bilibili’s AI‑driven frame‑interpolation pipeline upgrades low‑frame-rate videos to smooth high‑frame-rate 1080p playback by optimizing optical‑flow models for large motion, texture and text artifacts, pruning for speed, and deploying via the BVT SDK across on‑demand and live streams.

AIReal-time ProcessingVideo Frame Interpolation
0 likes · 14 min read
Bilibili's AI-Powered Video Frame Interpolation: Techniques, Challenges, and Deployment
Kuaishou Tech
Kuaishou Tech
Oct 31, 2023 · Artificial Intelligence

Kuaishou’s Nine Accepted Papers at ACM MM 2023: Summaries and Links

This article presents concise English summaries of nine Kuaishou research papers accepted at ACM MM 2023, covering topics such as no‑reference video quality assessment, adaptive video quality models, blind image super‑resolution, audio‑visual‑language transfer learning, motion‑aware video diffusion, large‑scale e‑commerce retrieval, and interactive segmentation.

AIImage Super-Resolutionaudio-visual language
0 likes · 18 min read
Kuaishou’s Nine Accepted Papers at ACM MM 2023: Summaries and Links
Test Development Learning Exchange
Test Development Learning Exchange
Aug 20, 2023 · Fundamentals

Python Multimedia Service Modules: audioop, aifc, sunau, wave, chunk, colorsys, imghdr, sndhdr, ossaudiodev

This article introduces Python's multimedia service modules, explaining how to process raw audio data, read and write various audio file formats, detect image and sound file types, convert color systems, and access OSS‑compatible audio devices, all illustrated with practical code examples.

Audiofile handlingmultimedia
0 likes · 7 min read
Python Multimedia Service Modules: audioop, aifc, sunau, wave, chunk, colorsys, imghdr, sndhdr, ossaudiodev
Bilibili Tech
Bilibili Tech
Aug 2, 2023 · Fundamentals

BILIVVC Secures Third Place in 2022 MSU Encoder Competition (1080p 1fps & 5fps)

Bilibili’s self‑developed VVC encoder, BILIVVC, earned third place in both the 1080p 1 fps and 5 fps tracks of the 2022 MSU Encoder Competition by leveraging extensive VVC‑tool optimizations, fast‑algorithm cooperation, adaptive pre‑analysis and efficient implementation that deliver high quality YUV‑SSIM performance despite its small‑team, one‑year development.

BILIVVCMSU Encoder CompetitionVVC
0 likes · 7 min read
BILIVVC Secures Third Place in 2022 MSU Encoder Competition (1080p 1fps & 5fps)
DataFunTalk
DataFunTalk
May 13, 2023 · Artificial Intelligence

Multimedia Content Understanding at Weibo: Video Summarization, Quality Assessment, OCR, Embedding, and CV‑CUDA Optimization

This article presents Weibo's comprehensive multimedia content understanding pipeline, covering video summarization techniques, quality assessment models, OCR advancements, video embedding strategies, and the performance benefits of CV‑CUDA acceleration, while highlighting real‑world applications and engineering trade‑offs.

CV-CUDAOCRVideo Summarization
0 likes · 32 min read
Multimedia Content Understanding at Weibo: Video Summarization, Quality Assessment, OCR, Embedding, and CV‑CUDA Optimization
DaTaobao Tech
DaTaobao Tech
Apr 26, 2023 · Artificial Intelligence

MD-VQA: Multi-Dimensional No-Reference Video Quality Assessment for CVPR NTIRE 2023

Alibaba’s Taobao VQA team won the CVPR NTIRE 2023 Video Enhancement Challenge by introducing MD‑VQA, a multi‑dimensional no‑reference video quality model that combines a Swin‑Transformer‑V2 spatial backbone, a pre‑trained SlowFast motion encoder, and a convolutional fusion module, pre‑trained on LSVQ, fine‑tuned on NTIRE data, and augmented spatio‑temporally, achieving state‑of‑the‑art SROCC and PLCC scores and now powering quality monitoring on Alibaba’s live‑streaming and short‑video services.

No-ReferenceSwin Transformercomputer vision
0 likes · 15 min read
MD-VQA: Multi-Dimensional No-Reference Video Quality Assessment for CVPR NTIRE 2023
OPPO Kernel Craftsman
OPPO Kernel Craftsman
Apr 14, 2023 · Fundamentals

Pipeline Domain Design in Multimedia Frameworks: Concepts, Comparative Analysis, and Implementation

The article defines pipeline domain design concepts, compares major multimedia frameworks such as FFmpeg, GStreamer, MediaPipe and AVPipeline, and demonstrates a configurable, extensible node‑based architecture that enables fast plugin integration and adaptable audio‑video pipelines across diverse business scenarios and platforms.

AIGStreamerframework
0 likes · 39 min read
Pipeline Domain Design in Multimedia Frameworks: Concepts, Comparative Analysis, and Implementation
IT Services Circle
IT Services Circle
Mar 3, 2023 · Backend Development

FFmpeg 6.0 “Von Neumann” Released with New Encoders, Decoders, Filters, and ABI Versioning

FFmpeg 6.0 “Von Neumann” has been officially released, introducing numerous new encoders, decoders, and filters, adding ABI versioning to major releases, deprecating old APIs, and enhancing CLI performance with threading, statistics options, and file‑based filter options, while outlining upcoming features for version 6.1.

DecodersEncodersFFmpeg
0 likes · 6 min read
FFmpeg 6.0 “Von Neumann” Released with New Encoders, Decoders, Filters, and ABI Versioning
Laravel Tech Community
Laravel Tech Community
Mar 1, 2023 · Backend Development

FFmpeg 6.0 “Von Neumann” Release: New Codecs, Filters, and ABI Changes

FFmpeg 6.0 "Von Neumann" introduces a host of new decoders, encoders, filters, hardware‑accelerated AV1 support, ABI versioning, and numerous performance and API improvements, marking a major, more structured release cycle for the multimedia framework.

AV1FFmpegFilters
0 likes · 5 min read
FFmpeg 6.0 “Von Neumann” Release: New Codecs, Filters, and ABI Changes
DataFunSummit
DataFunSummit
Feb 10, 2023 · Information Security

Digital Watermarking Technology: Concepts, Models, Algorithms, and Applications

The article provides a comprehensive overview of digital watermarking, covering its fundamental concepts, security features, embedding/detection/extraction processes, major algorithm families, practical applications such as copyright protection and anti‑counterfeiting, and future research directions in multimedia information security.

cryptographydigital watermarkinginformation security
0 likes · 20 min read
Digital Watermarking Technology: Concepts, Models, Algorithms, and Applications
Tencent Advertising Technology
Tencent Advertising Technology
Dec 15, 2022 · Artificial Intelligence

AI‑Driven Element Selection for Advertising Video Creative Generation

This article explains how Tencent's advertising system leverages multimedia AI techniques—including multi‑armed bandit, pairwise learning, and DeepFM models—to automatically select optimal templates, music, and stickers for image and video assets, thereby reducing production cost, improving creative quality, and boosting ad performance.

MABadvertising AIdeepFM
0 likes · 17 min read
AI‑Driven Element Selection for Advertising Video Creative Generation
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Nov 30, 2022 · Frontend Development

Building an Interactive 3D Phone Showcase with Three.js Multimedia Elements (Text, Image, Audio, Video)

This article explains how to use Three.js to create a realistic 3D phone product page by loading and applying multimedia assets such as custom fonts, textures, audio sources, and video textures, and demonstrates interactive features like ray‑casting for material switching and first‑person controls.

3dJavaScriptThree.js
0 likes · 19 min read
Building an Interactive 3D Phone Showcase with Three.js Multimedia Elements (Text, Image, Audio, Video)
Kuaishou Audio & Video Technology
Kuaishou Audio & Video Technology
Sep 29, 2022 · Artificial Intelligence

How DeViT Revolutionizes Video Inpainting with Deformed Vision Transformers

The article introduces DeViT, a novel Deformed Vision Transformer framework for video inpainting that leverages a deformable patch homography estimator, mask‑pruned attention, and spatio‑temporal weight adaptation, achieving state‑of‑the‑art results on benchmark datasets and highlighting its potential for advanced video editing tools.

DeViTTransformercomputer vision
0 likes · 10 min read
How DeViT Revolutionizes Video Inpainting with Deformed Vision Transformers
Kuaishou Audio & Video Technology
Kuaishou Audio & Video Technology
Aug 16, 2022 · Artificial Intelligence

Deep Learning Turns SDR Video into HDR: ACM Multimedia 2022 Breakthrough

Researchers from Kuaishou and Xi’an University of Electronic Science and Technology presented a novel deep‑learning‑based SDR‑to‑HDR video conversion method at ACM Multimedia 2022, introducing hierarchical dynamic context feature mapping, a layered dynamic feature modulation module, and a patch‑discriminator GAN that together achieve superior objective and subjective HDR quality.

HDR videocomputer visiondeep learning
0 likes · 6 min read
Deep Learning Turns SDR Video into HDR: ACM Multimedia 2022 Breakthrough