Tagged articles

13 articles

Page 1 of 1

Apr 26, 2026 · Artificial Intelligence

Can You Explain Large Model Training Without Complex Formulas? A Simple, Clear Guide

This article breaks down the fundamentals of large model training—covering data, parameters, neural networks, loss functions, gradient descent, pre‑training, and fine‑tuning—in plain language so readers can grasp how massive models learn without needing to dive into complex mathematics.

fine-tuninggradient descentloss function

0 likes · 12 min read

Can You Explain Large Model Training Without Complex Formulas? A Simple, Clear Guide

Machine Learning Algorithms & Natural Language Processing

Feb 9, 2026 · Artificial Intelligence

Time‑o1: Overcoming Time‑Series Forecasting Bottlenecks with a Novel Loss Function

The paper identifies two fundamental issues in time‑series forecasting—label autocorrelation bias and task‑scale explosion caused by the standard TMSE loss—and proposes Time‑o1, a PCA‑based orthogonal label transformation that eliminates bias, reduces optimization complexity, and yields consistent performance gains across multiple models and datasets.

NeurIPS 2025PCATime‑o1

0 likes · 12 min read

Time‑o1: Overcoming Time‑Series Forecasting Bottlenecks with a Novel Loss Function

Qborfy AI

Jul 3, 2025 · Artificial Intelligence

Why Loss Functions Matter: From Theory to Real‑World AI Applications

This article explains what loss functions are, outlines their three essential components, categorizes them for regression, classification, and generation tasks, reviews five classic loss functions with their noise resistance and gradient traits, and offers practical guidelines for selecting the right loss for AI models.

AI fundamentalsMachine Learningclassification

0 likes · 4 min read

Why Loss Functions Matter: From Theory to Real‑World AI Applications

AI Frontier Lectures

May 11, 2025 · Artificial Intelligence

How VA‑VAE Boosts Diffusion Model Generation: SOTA Results & LightningDiT Insights

This article analyzes the VA‑VAE approach that aligns visual tokenizers with vision foundation models to resolve the reconstruction‑generation trade‑off in latent diffusion models, detailing the VF loss design, adaptive weighting, LightningDiT enhancements, experimental setup, and state‑of‑the‑art ImageNet performance.

LightningDiTVAEloss function

0 likes · 16 min read

How VA‑VAE Boosts Diffusion Model Generation: SOTA Results & LightningDiT Insights

Cognitive Technology Team

Apr 9, 2025 · Artificial Intelligence

How Neural Networks Learn: Gradient Descent and Loss Functions

This article explains how neural networks learn by using labeled training data, describing the role of weights, biases, activation functions, and how gradient descent iteratively adjusts parameters to minimize loss, illustrated with the MNIST digit‑recognition example.

MNISTdeep learninggradient descent

0 likes · 16 min read

How Neural Networks Learn: Gradient Descent and Loss Functions

IT Services Circle

Dec 31, 2024 · Artificial Intelligence

Understanding Linear Regression, Loss Functions, and Gradient Descent: A Conversational Guide

This article uses a dialogue format to introduce the fundamentals of linear regression, explain how loss functions such as mean squared error quantify prediction errors, and describe gradient descent as an iterative optimization technique for finding the best model parameters, illustrated with simple numeric examples and visual aids.

AI basicsMachine Learninggradient descent

0 likes · 13 min read

Understanding Linear Regression, Loss Functions, and Gradient Descent: A Conversational Guide

NewBeeNLP

Dec 3, 2024 · Artificial Intelligence

Can LLMs Self‑Correct Their Answers? Exploring Reward Models, Loss Functions, and Training Dynamics

The article reflects on open‑source LLMs like Qwen2 and Llama 3.1, questioning whether models should self‑review answers, how hidden states might signal uncertainty, the role of loss‑function design, scaling laws, and the trade‑offs between PPO and DPO in alignment.

Large Language ModelsReward ModelScaling Law

0 likes · 9 min read

Can LLMs Self‑Correct Their Answers? Exploring Reward Models, Loss Functions, and Training Dynamics

Model Perspective

Sep 10, 2024 · Artificial Intelligence

Why Cross-Entropy Is the Key Loss Function for Classification Models

This article explains how loss functions evaluate model performance, contrasts regression’s mean squared error with classification’s cross‑entropy, describes one‑hot encoding and softmax outputs, and shows why higher predicted probabilities for the correct class yield lower loss, highlighting applications in image, language, and speech tasks.

Machine LearningSoftmaxclassification

0 likes · 5 min read

Why Cross-Entropy Is the Key Loss Function for Classification Models

Liangxu Linux

Mar 23, 2024 · Artificial Intelligence

Understanding AI Neurons: A Storytelling Guide to Basics of Neural Networks

This article uses a narrative of an AI neuron to explain fundamental concepts of neural networks, including neuron structure, weighted sums, activation functions, loss functions, gradient descent, and learning rate, making complex AI topics accessible to beginners.

AI basicsMachine LearningNeural Network

0 likes · 9 min read

Understanding AI Neurons: A Storytelling Guide to Basics of Neural Networks

Bilibili Tech

Mar 1, 2024 · Artificial Intelligence

Bilibili's Self-Developed Video Super-Resolution Algorithm: Background, Optimization Directions, and Implementation Details

Bilibili’s self‑supervised video super‑resolution system upgrades low‑resolution streams to 4K by using three parallel degradation‑branch networks—texture‑enhancing, line‑recovering, and noise‑removing—tailored to anime, game, and real‑world content, delivering sharper edges, finer textures, and measurable quality gains across its online playback pipeline.

AIBilibilideep learning

0 likes · 16 min read

Bilibili's Self-Developed Video Super-Resolution Algorithm: Background, Optimization Directions, and Implementation Details

DataFunTalk

Aug 27, 2019 · Artificial Intelligence

How Machines Learn: From Newton’s Second Law to the Core Steps of Supervised Learning

This article illustrates how a machine can rediscover Newton’s second law by treating force and acceleration data as a simple linear regression problem, detailing the three fundamental steps of hypothesis space definition, loss function design, and optimization through calculus or gradient methods.

Machine LearningNewton's lawOptimization

0 likes · 15 min read

How Machines Learn: From Newton’s Second Law to the Core Steps of Supervised Learning

DataFunTalk

Apr 25, 2019 · Artificial Intelligence

Comparison of Classification and Ranking Models in Recommendation Systems

This article examines the differences and similarities between classification (pointwise) and ranking (pairwise) models for recommendation systems, covering their probabilistic foundations, loss functions, parameter updates, and practical implications such as sensitivity to statistical features and robustness.

Machine LearningRecommendation Systemsclassification model

0 likes · 10 min read

Comparison of Classification and Ranking Models in Recommendation Systems

Meituan Technology Team

Feb 21, 2019 · Artificial Intelligence

Deep Learning-Based ETA Estimation in Meituan's Delivery System

Meituan’s delivery ETA system progressed from linear regression to DeepFM, enriching user, rider, merchant, and spatiotemporal features, employing an asymmetric loss and business‑rule integration to favor early arrivals, adding a tail‑adjustment term, and is engineered with Spark‑assembled TFRecords, multi‑GPU TensorFlow training, and remote‑served TensorFlow Java inference achieving sub‑5 ms TP99 latency.

ETALong TailTensorFlow Serving

0 likes · 15 min read

Deep Learning-Based ETA Estimation in Meituan's Delivery System