Tag

semantic similarity

0 views collected around this technical thread.

Model Perspective
Model Perspective
May 29, 2024 · Artificial Intelligence

How to Build Word Vectors from Scratch: A Step‑by‑Step Guide

This article explains the fundamentals of word vectors in NLP, walks through constructing them via co‑occurrence matrices and dimensionality reduction, demonstrates the process with a concrete example and Python code, and evaluates the resulting embeddings using cosine similarity.

NLPPythonSVD
0 likes · 7 min read
How to Build Word Vectors from Scratch: A Step‑by‑Step Guide
TAL Education Technology
TAL Education Technology
Apr 14, 2022 · Artificial Intelligence

Intelligent Call Recording Quality Inspection Using Dual‑Mode Detection

This article proposes a dual‑mode detection solution for call‑recording quality inspection that combines rule‑based semantic similarity matching with BERT‑based sentence segmentation and RoBERTa multi‑label classification to achieve high accuracy, fast task adaptation, and strong generalization for customer‑service scenarios.

BERTNLPRoBERTa
0 likes · 7 min read
Intelligent Call Recording Quality Inspection Using Dual‑Mode Detection
HaoDF Tech Team
HaoDF Tech Team
Sep 15, 2021 · Artificial Intelligence

Optimizing Question‑Answer Search Similarity in Haodf Online: A Semantic Similarity Model Case Study

This article describes how Haodf Online improved its medical question‑answer search by analyzing search challenges, adopting semantic similarity models based on pre‑trained language embeddings, designing contrastive training tasks, and evaluating the resulting increase in click‑through rate and user engagement.

Medical AISearch Relevancemodel optimization
0 likes · 12 min read
Optimizing Question‑Answer Search Similarity in Haodf Online: A Semantic Similarity Model Case Study
DataFunTalk
DataFunTalk
Jun 6, 2021 · Artificial Intelligence

ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

ConSERT introduces a contrastive self‑supervised framework that enhances BERT‑derived sentence embeddings by applying efficient embedding‑level data augmentations, achieving significant improvements on semantic textual similarity tasks, especially in low‑resource settings, and outperforming previous state‑of‑the‑art methods.

BERTcontrastive learningself-supervised
0 likes · 20 min read
ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer
Baidu Geek Talk
Baidu Geek Talk
Apr 16, 2021 · Artificial Intelligence

Large-Scale Short Text Clustering System Design and Practice at Baidu Search

At Baidu Search, a large‑scale short‑text clustering system was built using multi‑level semantic splitting, fine‑grained aggregation and error‑correction, evolving from v1.0 to v2.0, and now clusters 100 million queries with 95 % accuracy and 80 % recall within three days.

Baidu Searchmulti-level splittingsearch query processing
0 likes · 19 min read
Large-Scale Short Text Clustering System Design and Practice at Baidu Search
Tencent Cloud Developer
Tencent Cloud Developer
Jul 8, 2020 · Artificial Intelligence

Graph-Based Chinese Word Embedding (AlphaEmbedding) for Improved Text Matching

AlphaEmbedding builds a weighted graph linking Chinese words, sub‑words, characters and pinyin, then uses random‑walk‑based node2vec training to produce embeddings that capture orthographic and phonetic similarity, markedly improving recall and ranking for homophones, typos and OOV terms in enterprise search.

Chinese NLPText Matchinggraph computing
0 likes · 17 min read
Graph-Based Chinese Word Embedding (AlphaEmbedding) for Improved Text Matching
58 Tech
58 Tech
Oct 16, 2019 · Artificial Intelligence

Design and Implementation of Intent Recognition, Semantic Similarity Matching, and Slot Filling for a Voice Robot

This article details the architecture and algorithms behind a voice robot's natural language understanding module, covering single‑sentence intent classification with TextCNN, acoustic quality detection using VGGish‑BiLSTM, semantic similarity matching via DSSM and TextCNN‑Transformer, and slot‑filling with IDCNN‑CRF, along with performance results and future directions.

AINLUTextCNN
0 likes · 11 min read
Design and Implementation of Intent Recognition, Semantic Similarity Matching, and Slot Filling for a Voice Robot