Tag

dataset

0 views collected around this technical thread.

JD Tech
JD Tech
Apr 7, 2025 · Artificial Intelligence

Embodied Intelligence: From Data Scarcity to Real-World Robotic Manipulation – JD Explore Academy’s System Architecture and Research Advances

The article outlines JD Explore Academy’s recent embodied‑intelligence research, describing the challenges of data scarcity and precise manipulation, their ROS‑based high‑extensibility system architecture, dual‑arm teleoperation technology, a data‑efficient end‑effector imitation method, and the open JD ManiData dataset that together push robots from lab demos to practical tasks such as coffee‑making.

AIROSRobotics
0 likes · 7 min read
Embodied Intelligence: From Data Scarcity to Real-World Robotic Manipulation – JD Explore Academy’s System Architecture and Research Advances
Amap Tech
Amap Tech
Mar 19, 2025 · Artificial Intelligence

Driving by the Rules: Integrating Lane-Level Traffic Regulations into Online HD Maps

Gaode Map and Xi'an Jiaotong University introduce the “Driving by the Rules” task, releasing the MapDR benchmark that integrates lane‑level traffic‑sign regulations into online‑constructed HD maps, and provide modular (VLE‑MEE) and end‑to‑end (RuleVLM) baselines to evaluate rule extraction and lane association.

AIHD mapsautonomous driving
0 likes · 8 min read
Driving by the Rules: Integrating Lane-Level Traffic Regulations into Online HD Maps
Kuaishou Tech
Kuaishou Tech
Feb 20, 2025 · Artificial Intelligence

Second Short-Form Video Quality Assessment and Enhancement Challenge (CVPR NTIRE 2025)

The second short-form video quality assessment and enhancement challenge, co‑organized by Kuaishou's audio‑video team and the Intelligent Media Computing Lab, invites global researchers to develop efficient quality assessment models and diffusion‑based super‑resolution methods using the new KwaiSR dataset, with prize money and potential CVPR workshop paper invitations.

AI competitionCVPR NTIREImage Super-Resolution
0 likes · 9 min read
Second Short-Form Video Quality Assessment and Enhancement Challenge (CVPR NTIRE 2025)
DataFunTalk
DataFunTalk
Feb 18, 2025 · Artificial Intelligence

CODEI/O: Leveraging Code to Train Large Language Models for Enhanced Reasoning

The DeepSeek team introduced CODEI/O, a massive dataset that converts code into natural‑language reasoning chains, and demonstrated that training large language models on this data markedly improves their performance on diverse inference tasks, including non‑code domains, through a two‑stage training strategy.

AI trainingCODEI/OLarge Language Models
0 likes · 8 min read
CODEI/O: Leveraging Code to Train Large Language Models for Enhanced Reasoning
JD Tech
JD Tech
Feb 5, 2025 · Artificial Intelligence

Tech Insight: Highlights of Ten JD Retail Technology Papers Published in Top AI Conferences (2024)

Tech Insight presents concise overviews of ten JD retail technology papers accepted at top AI conferences in 2024, covering topics such as open‑vocabulary object detection, multi‑scenario ranking, diversity‑aware re‑ranking, a diversified product search dataset, semi‑supervised query classification, plug‑in CTR models, and methods to mitigate LLM hallucinations.

AIRankingcomputer vision
0 likes · 17 min read
Tech Insight: Highlights of Ten JD Retail Technology Papers Published in Top AI Conferences (2024)
DataFunSummit
DataFunSummit
Jan 1, 2025 · Artificial Intelligence

Challenges and Evaluation Strategies for LLM Agents in 2024

The article outlines the rapid progress of LLM agents in 2024 while highlighting key difficulties in planning capabilities, evaluation methods, dataset generation, and metric design, and suggests practical combinations and product‑level enhancements to improve efficiency, accuracy, and usability.

AILLMagent
0 likes · 3 min read
Challenges and Evaluation Strategies for LLM Agents in 2024
AntTech
AntTech
Sep 3, 2024 · Artificial Intelligence

2024 Inclusion Bund Conference AI Innovation Competition and Deepfake Challenge Results

The 2024 Inclusion Bund Conference in Shanghai announced the winners of its newly added AI Innovation Competition, including the AFAC Financial Intelligence Contest and the Global Deepfake Attack‑Defense Challenge, highlighting participation from over 7,000 teams across more than 20 countries and showcasing cutting‑edge deepfake detection achievements.

AIFinTechInnovation Competition
0 likes · 7 min read
2024 Inclusion Bund Conference AI Innovation Competition and Deepfake Challenge Results
AntTech
AntTech
Aug 16, 2024 · Artificial Intelligence

PC²: Pseudo‑Classification Based Pseudo‑Captioning for Noisy Correspondence Learning in Cross‑Modal Retrieval

The paper introduces PC², a novel framework that combines pseudo‑classification and pseudo‑captioning to mitigate noisy correspondence in cross‑modal retrieval, presents a large‑scale web‑page/image‑meta‑description dataset called Noise of Web (NoW), and demonstrates significant performance gains on multiple benchmark datasets including Flickr30K, MS‑COCO, and the newly released NoW.

PC2cross-modal retrievaldataset
0 likes · 16 min read
PC²: Pseudo‑Classification Based Pseudo‑Captioning for Noisy Correspondence Learning in Cross‑Modal Retrieval
AntTech
AntTech
Aug 6, 2024 · Artificial Intelligence

Self‑Supervised Video Copy Localization with Regional Token Representation

The article presents a self‑supervised framework that uses a regional token structure within a Vision Transformer to accurately locate video plagiarism segments, dramatically reducing annotation costs and achieving state‑of‑the‑art performance without manual labeling, while also highlighting its real‑world deployment for copyright protection.

AIcomputer visioncopyright protection
0 likes · 5 min read
Self‑Supervised Video Copy Localization with Regional Token Representation
Kuaishou Tech
Kuaishou Tech
Jul 1, 2024 · Artificial Intelligence

Short-Form Video Quality Assessment Competition at CVPR NTIRE 2024: Dataset, Challenge Overview, and Top Winning Solutions

The CVPR NTIRE 2024 short-form video quality assessment competition introduced the KVQ dataset, attracted over 200 teams, evaluated submissions using SROCC and PLCC metrics, and highlighted the winning approaches of SJTU MMLab, IH‑VQA, and TVQE, showcasing advances in AI‑driven video quality evaluation.

AI competitionNTIRE 2024dataset
0 likes · 9 min read
Short-Form Video Quality Assessment Competition at CVPR NTIRE 2024: Dataset, Challenge Overview, and Top Winning Solutions
Kuaishou Tech
Kuaishou Tech
Mar 6, 2024 · Artificial Intelligence

Short Video Quality Assessment Competition (KVQ) at CVPR NTIRE 2024

The CVPR NTIRE 2024 workshop hosts the first short‑video quality assessment competition, introducing the KVQ dataset of 4,200 videos across nine scenes, providing training/validation data, a baseline 3D Swin‑Transformer model, detailed competition rules, rewards, and organizer contacts.

AIcompetitioncomputer vision
0 likes · 7 min read
Short Video Quality Assessment Competition (KVQ) at CVPR NTIRE 2024
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Dec 29, 2023 · Artificial Intelligence

Overview of Major Benchmark Datasets for Evaluating Large Language Models

This article provides a comprehensive overview of major benchmark datasets—including CMMLU, MMLU, C‑Eval, GSM8K, Gaokao‑Bench, AGIEval, MATH, BBH, HumanEval, and MBPP—used to evaluate large language models' knowledge, reasoning, and coding abilities, and summarizes related leaderboards and evaluation tools.

Artificial IntelligenceLLMbenchmark
0 likes · 14 min read
Overview of Major Benchmark Datasets for Evaluating Large Language Models
AntTech
AntTech
Dec 19, 2023 · Artificial Intelligence

RJUA‑QA: A Comprehensive Urology QA Dataset for Large Language Model Evaluation

RJUA‑QA is a newly released, large‑scale urology question‑answer dataset constructed from virtual patient records based on clinical experience, featuring 2,132 QA pairs with extensive context, designed to benchmark and improve large language models’ medical reasoning, diagnosis, and treatment recommendation capabilities.

Large Language ModelsMedical AIQA dataset
0 likes · 12 min read
RJUA‑QA: A Comprehensive Urology QA Dataset for Large Language Model Evaluation
AntTech
AntTech
Oct 30, 2023 · Artificial Intelligence

AntM2C: A Large-Scale Multi‑Scenario Multi‑Modal CTR Prediction Dataset from Alipay

AntM2C is a publicly released, billion‑sample click‑through‑rate (CTR) dataset covering five distinct Alipay business scenarios, providing both ID and rich multi‑modal (text and image) features to enable comprehensive evaluation of multi‑scenario, cold‑start, and multi‑modal CTR models at industrial scale.

Large Scalectrdataset
0 likes · 14 min read
AntM2C: A Large-Scale Multi‑Scenario Multi‑Modal CTR Prediction Dataset from Alipay
Kuaishou Tech
Kuaishou Tech
Sep 26, 2023 · Artificial Intelligence

Cross-Domain Product Representation (COPE): A Large-Scale Dataset and Baseline Model for Rich‑Content E‑Commerce

The paper introduces ROPE, the first large‑scale cross‑domain product recognition dataset covering detail pages, short videos and live streams, and proposes COPE, a dual‑tower multimodal model that learns unified product embeddings using contrastive and classification losses, achieving superior retrieval and few‑shot classification performance across domains.

Cross-Domaincontrastive learningdataset
0 likes · 13 min read
Cross-Domain Product Representation (COPE): A Large-Scale Dataset and Baseline Model for Rich‑Content E‑Commerce
AntTech
AntTech
Sep 21, 2023 · Artificial Intelligence

AFAC2023 Financial Intelligence Challenge Highlights and the Release of the Fin‑Eval Dataset

The inaugural AFAC2023 Financial Intelligence Challenge, co‑organized by the China Computer Federation and Ant Group, attracted over 4,700 teams, showcased cutting‑edge AI solutions for finance such as market opinion generation, compliance detection, and pet‑age recognition, and culminated in the public launch of the Fin‑Eval benchmark dataset for financial large‑model evaluation.

AIFin-EvalFinance
0 likes · 12 min read
AFAC2023 Financial Intelligence Challenge Highlights and the Release of the Fin‑Eval Dataset
DataFunTalk
DataFunTalk
Sep 21, 2023 · Artificial Intelligence

2023 Chinese Continuous Visual Speech Recognition Challenge (CNVSRC) Overview

The 2023 Chinese Continuous Visual Speech Recognition Challenge (CNVSRC), organized by Tsinghua University and partners, introduces the large-scale CN-CVS dataset, defines single- and multi-speaker lip‑reading tasks, provides baseline Conformer models, outlines registration, data access, evaluation metrics, and competition schedule.

AIChallengeconformer
0 likes · 7 min read
2023 Chinese Continuous Visual Speech Recognition Challenge (CNVSRC) Overview
Kuaishou Large Model
Kuaishou Large Model
Jul 7, 2023 · Artificial Intelligence

How HairStep Revolutionizes Single-View 3D Hair Reconstruction

This paper introduces HairStep, a novel intermediate representation combining Strand Maps and Depth Maps, and demonstrates how it reduces domain gap and improves single‑view 3D hair reconstruction accuracy across multiple algorithms, supported by new annotated datasets (HiSa, HiDa) and fair evaluation metrics.

3D hair reconstructionHairStepcomputer vision
0 likes · 11 min read
How HairStep Revolutionizes Single-View 3D Hair Reconstruction
DataFunTalk
DataFunTalk
Mar 1, 2023 · Artificial Intelligence

ACL 2023 Multi‑lingual Document‑grounded Dialogue Competition Overview

The ACL 2023 Multi‑lingual Document‑grounded Dialogue Competition, hosted by Alibaba DAMO Academy and Nanjing University, introduces the first multilingual document‑dialogue dataset, provides a baseline system, offers a $7,000 prize pool, and invites participants to submit papers to the Doc2dial Workshop for Best Paper awards.

ACL2023NLPcompetition
0 likes · 6 min read
ACL 2023 Multi‑lingual Document‑grounded Dialogue Competition Overview
DataFunTalk
DataFunTalk
Feb 10, 2023 · Artificial Intelligence

ICDAR 2023 BDVT-QA Competition: Born Digital Video Text Question Answering

The ICDAR 2023 BDVT-QA competition, organized by Alibaba DAMO Academy, introduces a novel dataset of 1,000 born‑digital video clips for end‑to‑end video text recognition and video text question answering, offering cash prizes, detailed dataset access, and a lineup of leading academic and industry experts.

AIICDARVideo Text Recognition
0 likes · 5 min read
ICDAR 2023 BDVT-QA Competition: Born Digital Video Text Question Answering