Tag

Zhihu

0 views collected around this technical thread.

Python Programming Learning Circle
Python Programming Learning Circle
Jan 23, 2025 · Artificial Intelligence

Python Script for Scraping Zhihu “Beauty” Topic Images with Baidu AI Face Detection

This tutorial explains how to use Python 3 with Requests, lxml, and Baidu's AipFace SDK to crawl images from Zhihu's "美女" topic, filter them by face detection, gender, authenticity, and beauty score, and store the qualified pictures locally.

Web ScrapingZhihubaidu-ai
0 likes · 4 min read
Python Script for Scraping Zhihu “Beauty” Topic Images with Baidu AI Face Detection
Zhihu Tech Column
Zhihu Tech Column
Oct 10, 2024 · Artificial Intelligence

Massive Multi-Label Text Classification via Semantic Retrieval and Large AI Model

This article presents a method for massive multi-label text classification on Zhihu content by combining a semantic retrieval model with a proprietary large AI model, detailing the challenges of large label spaces, model architecture, loss optimization, and experimental results showing significant accuracy gains.

BGEZhihularge language model
0 likes · 16 min read
Massive Multi-Label Text Classification via Semantic Retrieval and Large AI Model
DataFunTalk
DataFunTalk
Dec 8, 2023 · Big Data

Zhihu Bridge Platform: Architecture, Capabilities, and Future Trends of Content Operations

This article presents a comprehensive overview of Zhihu's Bridge platform, detailing its content‑operation architecture—including content pool, management, analysis, monitoring, and intervention modules—explaining the underlying streaming and batch technologies such as Flink, Doris, and Elasticsearch, and outlining future automation and AI‑driven workflow directions.

AIBig DataContent Platform
0 likes · 17 min read
Zhihu Bridge Platform: Architecture, Capabilities, and Future Trends of Content Operations
Python Programming Learning Circle
Python Programming Learning Circle
Dec 4, 2023 · Backend Development

Scraping Zhihu "Beauty" Topic Images with Python and Baidu AI Face Detection

This article explains how to collect images from Zhihu's "beauty" topic using Python's Requests and lxml libraries, filter them with Baidu AI's AipFace face detection service, and store the qualified pictures locally, detailing the required environment, logic, and preparation steps.

PythonZhihubaidu-ai
0 likes · 5 min read
Scraping Zhihu "Beauty" Topic Images with Python and Baidu AI Face Detection
Python Programming Learning Circle
Python Programming Learning Circle
May 23, 2022 · Backend Development

Simulating Zhihu Login with Python Using urllib and Fiddler

This article demonstrates how to automate Zhihu login on Windows by analyzing network traffic with Fiddler, extracting required parameters, and implementing a Python script that builds HTTP requests using urllib2, handles cookies, captcha retrieval, and logs the results, complete with sample code and execution screenshots.

FiddlerHTTPPython
0 likes · 8 min read
Simulating Zhihu Login with Python Using urllib and Fiddler
Python Programming Learning Circle
Python Programming Learning Circle
Apr 6, 2022 · Backend Development

Scrapy‑Based Zhihu User Follow/Followers Crawler with MongoDB Storage

This tutorial demonstrates how to build a Scrapy spider that crawls Zhihu user follow and follower data via Zhihu’s public APIs, handles request headers, parses JSON responses, paginates results, and stores the extracted information into MongoDB using a custom item pipeline.

APIMongoDBScrapy
0 likes · 11 min read
Scrapy‑Based Zhihu User Follow/Followers Crawler with MongoDB Storage
DataFunSummit
DataFunSummit
Aug 29, 2021 · Artificial Intelligence

Zhihu Recommendation Page Ranking: Architecture, Feature Design, Model Evolution, and Practical Insights

This article presents a comprehensive overview of Zhihu's recommendation page ranking system, detailing the request flow, ranking evolution from time‑based to deep‑learning models, feature engineering strategies, model architectures such as DNN, DeepFM, DIN, multi‑task learning, and lessons learned for production deployment.

Feature EngineeringRankingZhihu
0 likes · 12 min read
Zhihu Recommendation Page Ranking: Architecture, Feature Design, Model Evolution, and Practical Insights
Architecture Digest
Architecture Digest
May 15, 2021 · Backend Development

Design and Migration of Zhihu's Read Service: High Availability, Performance, and TiDB Adoption

This article details Zhihu's read‑service architecture, covering its business requirements, high‑availability and high‑performance design goals, key components such as Proxy, Cache and Storage, extensive performance metrics, the migration from MySQL to TiDB, and the benefits brought by TiDB 3.0 features.

High AvailabilityTiDBZhihu
0 likes · 18 min read
Design and Migration of Zhihu's Read Service: High Availability, Performance, and TiDB Adoption
Python Programming Learning Circle
Python Programming Learning Circle
Apr 7, 2021 · Backend Development

Simulating Zhihu Login and Scraping Content with Python Requests

This tutorial demonstrates how to use Python's requests library to simulate Zhihu login by handling dynamic _xsrf tokens, optional captcha verification, saving cookies, and then crawling the main page to extract questions and answer abstracts.

Login SimulationPythonRequests
0 likes · 10 min read
Simulating Zhihu Login and Scraping Content with Python Requests
Youku Technology
Youku Technology
May 20, 2019 · Big Data

Data‑Driven Dating Guide: Analyzing Zhihu Answers to Identify Potential Partners

In a playful data‑driven experiment, the author scraped 27,664 Zhihu answers to “What are your dating criteria?”, filtered out short, outdated, high‑profile or already‑matched posts, applied follower‑and engagement‑thresholds to narrow the pool to 480 candidates, then ranked the top 30 by a like‑to‑comment ratio, sharing the code and dataset for reproducibility.

RankingZhihudata analysis
0 likes · 8 min read
Data‑Driven Dating Guide: Analyzing Zhihu Answers to Identify Potential Partners
360 Tech Engineering
360 Tech Engineering
May 20, 2019 · Fundamentals

A Data‑Driven Guide to Finding a Partner: From Crawling Zhihu Answers to Ranking Candidates

This article walks through a complete data‑analysis workflow—scraping Zhihu dating‑preference answers, cleaning and filtering the data, deriving gender and activity metrics, designing a four‑step screening process, and finally ranking candidates with a custom like‑to‑comment index—to help a single programmer create a concise, high‑quality list of potential partners.

MetricsRankingWeb Scraping
0 likes · 9 min read
A Data‑Driven Guide to Finding a Partner: From Crawling Zhihu Answers to Ranking Candidates
Efficient Ops
Efficient Ops
Nov 17, 2018 · Backend Development

How to Crawl Zhihu’s Funniest Answers with Python: A Simple Two‑Step Guide

This article shows how to use Python to scrape Zhihu answers, store them in MongoDB, filter for short high‑upvote replies, and then presents a collection of programmer‑centric jokes that illustrate the kind of "god replies" the crawler can retrieve.

MongoDBPythonWeb Scraping
0 likes · 14 min read
How to Crawl Zhihu’s Funniest Answers with Python: A Simple Two‑Step Guide