Tag

multitask learning

0 views collected around this technical thread.

AntTech
AntTech
Feb 26, 2025 · Artificial Intelligence

Ant Group’s 18 Accepted Papers at AAAI 2025: Summaries and Highlights

This article presents concise English summaries of the 18 Ant Group papers accepted at AAAI 2025, covering topics such as privacy‑preserving large‑model tuning, knowledge‑graph integration, AI‑generated image detection, multi‑task learning, generative retrieval, role‑playing evaluation, and video hallucination mitigation.

AAAI 2025AI evaluationVideo Hallucination
0 likes · 29 min read
Ant Group’s 18 Accepted Papers at AAAI 2025: Summaries and Highlights
DataFunTalk
DataFunTalk
Jan 30, 2023 · Artificial Intelligence

Domain Knowledge Enhanced Pretrained Language Model for Medicinal Product Vertical Search

This article presents a domain‑knowledge‑enhanced pretrained language model that combines ELECTRA‑based token‑level masking with a novel product‑attribute prediction (PAP) task to improve query understanding, intent classification, and relevance matching in vertical drug e‑commerce search, and validates its effectiveness through extensive experiments on public and proprietary datasets.

ELECTRAdomain knowledgemedical NLP
0 likes · 13 min read
Domain Knowledge Enhanced Pretrained Language Model for Medicinal Product Vertical Search
DataFunTalk
DataFunTalk
Aug 22, 2022 · Artificial Intelligence

Live‑Streaming Recommendation System: Interaction Scenarios, User Cold‑Start, Prior Modeling, and Scene Modeling

The article presents a comprehensive technical overview of a live‑streaming recommendation system, covering common and specific characteristics, user cold‑start strategies using unbiased clustering, prior knowledge integration, multi‑task modeling, and scene‑aware routing to improve relevance and engagement in interactive environments.

Artificial IntelligenceClusteringLive Streaming
0 likes · 19 min read
Live‑Streaming Recommendation System: Interaction Scenarios, User Cold‑Start, Prior Modeling, and Scene Modeling
DataFunTalk
DataFunTalk
Dec 14, 2021 · Artificial Intelligence

Speech Translation: Enterprise Applications and Research

This article presents an overview of speech translation, discusses its motivations and applications at ByteDance, compares cascade and end‑to‑end modeling approaches, introduces advanced encoder and decoder designs such as LUT, Chimera, and COSTT, outlines progressive multi‑task training and data‑augmentation strategies, and shares experimental results and Q&A.

AIAudio Processingend-to-end models
0 likes · 16 min read
Speech Translation: Enterprise Applications and Research
DataFunSummit
DataFunSummit
Mar 30, 2021 · Artificial Intelligence

Chinese Short‑Text Entity Linking: Model Design, Multitask Learning, and Experimental Results on the Qianyan Dataset

This article presents a comprehensive approach to Chinese short‑text entity linking, describing the Qianyan dataset, pipeline and end‑to‑end task formulations, sample construction, a multitask model that jointly performs entity ranking and NIL classification, various optimization techniques including confidence learning and adversarial training, and detailed experimental analysis showing state‑of‑the‑art performance.

Chinese NLPadversarial trainingconfidence learning
0 likes · 13 min read
Chinese Short‑Text Entity Linking: Model Design, Multitask Learning, and Experimental Results on the Qianyan Dataset
DataFunTalk
DataFunTalk
Feb 26, 2021 · Artificial Intelligence

Fine‑Grained Sentiment Analysis and Opinion Quadruple Extraction: Methods, Tasks, and Applications

This article introduces the concepts, tasks, and recent advances in text sentiment analysis, focusing on attribute‑level sentiment (TG‑ABSA) and opinion‑quadruple extraction, describing unsupervised, reading‑comprehension, and multi‑task deep‑learning approaches, their implementation on Huawei Cloud, experimental results, and future research directions.

NLPaspect‑based sentimentdeep learning
0 likes · 20 min read
Fine‑Grained Sentiment Analysis and Opinion Quadruple Extraction: Methods, Tasks, and Applications
DataFunTalk
DataFunTalk
Oct 23, 2020 · Artificial Intelligence

Feedback‑Aware Deep Matching Model for Music Recommendation in Tmall Genie

This article presents DeepMatch, a behavior‑sequence based deep learning recall model enhanced with play‑rate and intent‑type embeddings, describes its self‑attention architecture, factorized embedding parameterization, multitask loss design, distributed TensorFlow training tricks, and demonstrates significant offline and online improvements in music recommendation performance.

Self-AttentionTensorFlowdeep learning
0 likes · 15 min read
Feedback‑Aware Deep Matching Model for Music Recommendation in Tmall Genie
JD Tech Talk
JD Tech Talk
Nov 5, 2019 · Artificial Intelligence

GeoBERT: A Multi‑Task Pre‑trained Language Model for Chinese Address Text

This article introduces GeoBERT, a novel pre‑training method for Chinese address strings that leverages seven jointly constrained tasks to capture spatial semantics, administrative hierarchy, and similarity relationships, enabling downstream address classification, segmentation, POI extraction, similarity comparison, and authenticity verification with reduced annotation dependence.

Chinese languageGeoBERTNLP
0 likes · 15 min read
GeoBERT: A Multi‑Task Pre‑trained Language Model for Chinese Address Text
High Availability Architecture
High Availability Architecture
May 27, 2019 · Artificial Intelligence

A Survey of Transfer Learning and Model Pre‑training Techniques for Natural Language Processing

This article reviews the taxonomy of transfer learning in NLP, summarizes representative pre‑training models such as ELMo, ULMFiT, BERT, GPT, MASS and UNILM, discusses their strengths and limitations, and provides practical recommendations for applying these techniques in real‑world projects.

BERTELMoNLP
0 likes · 34 min read
A Survey of Transfer Learning and Model Pre‑training Techniques for Natural Language Processing
JD Tech
JD Tech
Jun 29, 2018 · Artificial Intelligence

JD AI's JDAI-Face: Real-Time Multi-Task Facial Attribute Recognition System

The article introduces JD AI's JDAI-Face system, a deep‑learning based real‑time multi‑task facial attribute recognition platform that detects gender, age, ethnicity, expression and attractiveness, outlines its technical pipeline, showcases retail applications, and cites recent academic publications and expert contributors.

Artificial Intelligencecomputer visiondeep learning
0 likes · 12 min read
JD AI's JDAI-Face: Real-Time Multi-Task Facial Attribute Recognition System