Tag

Swin Transformer

0 views collected around this technical thread.

DaTaobao Tech
DaTaobao Tech
Apr 26, 2023 · Artificial Intelligence

MD-VQA: Multi-Dimensional No-Reference Video Quality Assessment for CVPR NTIRE 2023

Alibaba’s Taobao VQA team won the CVPR NTIRE 2023 Video Enhancement Challenge by introducing MD‑VQA, a multi‑dimensional no‑reference video quality model that combines a Swin‑Transformer‑V2 spatial backbone, a pre‑trained SlowFast motion encoder, and a convolutional fusion module, pre‑trained on LSVQ, fine‑tuned on NTIRE data, and augmented spatio‑temporally, achieving state‑of‑the‑art SROCC and PLCC scores and now powering quality monitoring on Alibaba’s live‑streaming and short‑video services.

No-ReferenceSwin Transformercomputer vision
0 likes · 15 min read
MD-VQA: Multi-Dimensional No-Reference Video Quality Assessment for CVPR NTIRE 2023
HaoDF Tech Team
HaoDF Tech Team
Oct 8, 2022 · Artificial Intelligence

Exploring Transformer Technology and Its Applications in NLP, Computer Vision, and OCR at Haodf.com

This article introduces the Transformer architecture, explains its attention mechanism, details its adaptations for natural language processing, computer vision, and OCR tasks, and presents experimental results of various models such as BERT, ELECTRA, Swin Transformer, and CRNN-BCN on large-scale medical data from Haodf.com.

NLPOCRSwin Transformer
0 likes · 39 min read
Exploring Transformer Technology and Its Applications in NLP, Computer Vision, and OCR at Haodf.com