Tag

open-source AI

0 views collected around this technical thread.

Java Architecture Diary
Java Architecture Diary
Apr 29, 2025 · Artificial Intelligence

Why Qwen3 Is the New Powerhouse in Open‑Source AI Models

Qwen3 introduces a suite of open‑source models—from a 235B expert model to compact 0.6B versions—offering competitive performance against top proprietary models, multilingual support, flexible thinking modes, and low deployment requirements, with detailed usage instructions via Ollama and OpenRouter.

OllamaQwen3large language model
0 likes · 8 min read
Why Qwen3 Is the New Powerhouse in Open‑Source AI Models
AntTech
AntTech
Apr 21, 2025 · Artificial Intelligence

InclusionAI Community to Present AReaL Reinforcement Learning Framework and AWorld Multi‑Agent Framework at ICLR 2025

The InclusionAI open‑source community, initiated by Ant Group, will showcase the latest advances of its reinforcement‑learning framework AReaL and multi‑agent framework AWorld at the ICLR 2025 conference in Singapore, highlighting performance breakthroughs, open‑source contributions, and industry‑focused AI research.

AReaLAWorldAnt Group
0 likes · 5 min read
InclusionAI Community to Present AReaL Reinforcement Learning Framework and AWorld Multi‑Agent Framework at ICLR 2025
Code Mala Tang
Code Mala Tang
Apr 5, 2025 · Artificial Intelligence

Open-Source AI Video Models Are Redefining the Industry – China Leads the Charge

While most eyes remain on familiar AI giants, China’s Alibaba and DeepSeek are unveiling open‑source video and inference models that run on consumer GPUs, sparking a regulatory scramble and threatening the dominance of closed‑source AI, heralding a rapid, disruptive shift across the industry.

AI localizationAI regulationAI video
0 likes · 10 min read
Open-Source AI Video Models Are Redefining the Industry – China Leads the Charge
DataFunSummit
DataFunSummit
Feb 25, 2025 · Artificial Intelligence

Tiny‑R1‑32B‑Preview: A 5% Parameter Model Matching Deepseek‑R1‑671B Performance

On February 24, 2025, 360 and Peking University unveiled Tiny‑R1‑32B‑Preview, a medium‑scale inference model that uses only 5% of the parameters yet achieves performance comparable to the 671‑billion‑parameter Deepseek‑R1, with leading results on math, programming, and scientific benchmarks.

AI modelBenchmarkingModel Distillation
0 likes · 7 min read
Tiny‑R1‑32B‑Preview: A 5% Parameter Model Matching Deepseek‑R1‑671B Performance
ZhongAn Tech Team
ZhongAn Tech Team
Feb 16, 2025 · Artificial Intelligence

DeepSeek R1 and V3: Model Innovations, Industry Impact, and Future Trends

The article reviews DeepSeek's open‑source R1 and V3 large language models, highlighting their technical breakthroughs, cost advantages, expert opinions, industry adoption across chips, cloud services, and applications, and discusses future directions for model scaling, distillation, and AI competition.

AI IndustryAI competitionDeepSeek
0 likes · 13 min read
DeepSeek R1 and V3: Model Innovations, Industry Impact, and Future Trends
Java Captain
Java Captain
Feb 7, 2025 · Artificial Intelligence

DeepSeek: Disruptive Innovations in Large Language Model Architecture, Efficiency, and Ecosystem

DeepSeek reshapes the AI landscape by replacing brute‑force compute scaling with algorithmic breakthroughs such as a novel MoE architecture, memory compression, active‑learning data pipelines, and open‑source tooling, delivering dramatically lower training and inference costs while enabling edge deployment and a vibrant developer ecosystem.

Algorithmic EfficiencyDeepSeekLarge Language Models
0 likes · 11 min read
DeepSeek: Disruptive Innovations in Large Language Model Architecture, Efficiency, and Ecosystem
Java Tech Enthusiast
Java Tech Enthusiast
Feb 5, 2025 · Artificial Intelligence

DeepSeek: AI Breakthrough and Recruitment Insights

DeepSeek’s open‑source R1 model shattered the prevailing belief that closed‑source giants like OpenAI dominate AI progress by introducing a pure reinforcement‑learning‑driven inference breakthrough with its GRPO algorithm, sparking global excitement, prompting political concern, and leading the company to aggressively hire engineers in Beijing and Hangzhou with competitive 14‑month salaries despite demanding top‑conference publications.

AI DevelopmentDeepSeekGRPO algorithm
0 likes · 7 min read
DeepSeek: AI Breakthrough and Recruitment Insights
DevOps
DevOps
Jan 25, 2025 · Artificial Intelligence

DeepSeek R1: An Open‑Source Large Model Matching OpenAI’s o1 at a Fraction of the Cost

DeepSeek’s newly released R1 model delivers performance comparable to OpenAI’s o1 while cutting inference costs by 90‑95%, leveraging innovative MLA and MoE architectures, low‑cost hardware training, an open‑source strategy, and a youthful, flat‑structured team that challenges the AI industry’s high‑spending model.

AI StartupArtificial IntelligenceCost‑Efficient Training
0 likes · 12 min read
DeepSeek R1: An Open‑Source Large Model Matching OpenAI’s o1 at a Fraction of the Cost
Kuaishou Tech
Kuaishou Tech
Jul 11, 2024 · Artificial Intelligence

Kuaishou Open-Sources Kolors: A High-Performance Text-to-Image Model Rivaling Midjourney v6

Kuaishou has officially open-sourced Kolors, a state-of-the-art text-to-image diffusion model that leverages ChatGLM3 for advanced bilingual text understanding and employs a two-stage training strategy to achieve photographic image quality rivaling leading proprietary systems.

Large Language ModelsText-to-Image Generationcomputer vision
0 likes · 8 min read
Kuaishou Open-Sources Kolors: A High-Performance Text-to-Image Model Rivaling Midjourney v6
IT Services Circle
IT Services Circle
Jun 9, 2024 · Artificial Intelligence

Plagiarism Allegations Between Stanford's Llama3‑V and China's MiniCPM‑Llama3‑V 2.5 Model

The article details the controversy surrounding Stanford's Llama3‑V team admitting to copying the architecture and code of the Chinese MiniCPM‑Llama3‑V 2.5 model, presents new evidence of weight similarity, compares performance metrics, and discusses broader concerns about the recognition of Chinese AI research in the open‑source community.

AI ethicsLlama3-VMiniCPM
0 likes · 9 min read
Plagiarism Allegations Between Stanford's Llama3‑V and China's MiniCPM‑Llama3‑V 2.5 Model
DataFunSummit
DataFunSummit
Oct 27, 2023 · Artificial Intelligence

ChatGPT Technology, Domesticization Attempts, and Open‑Source Large Models

This article reviews the evolution and challenges of ChatGPT technology, describes the authors' efforts to localize and commercialize the model for the Chinese market, and introduces their open‑source Chinese large‑model initiative, including training methods, performance gaps, and future improvement directions.

ChatGPTChinese NLPLarge Language Models
0 likes · 11 min read
ChatGPT Technology, Domesticization Attempts, and Open‑Source Large Models
DataFunSummit
DataFunSummit
May 17, 2023 · Artificial Intelligence

OpenAI Announces Plans to Release a New Open‑Source Large Language Model

OpenAI is set to launch its first open‑source large language model in four years, sparking debate over how this move could reshape the competitive landscape of AI, affect models like LLaMA, and intensify the open‑source versus closed‑source rivalry with Google.

AI competitionArtificial IntelligenceLarge Language Models
0 likes · 6 min read
OpenAI Announces Plans to Release a New Open‑Source Large Language Model
DataFunTalk
DataFunTalk
Feb 20, 2023 · Artificial Intelligence

ChatGPT Technology, Localization Efforts, and Open‑Source Large Models – Overview and Practices

This article presents an overview of ChatGPT technology, its evolution, current challenges, a three‑stage learning process, data organization and evaluation, details of domestic localization efforts, practical solutions, and the release of a Chinese open‑source large model with training guidance.

ChatGPTModel Localizationdata annotation
0 likes · 12 min read
ChatGPT Technology, Localization Efforts, and Open‑Source Large Models – Overview and Practices
Baidu Tech Salon
Baidu Tech Salon
Sep 2, 2022 · Artificial Intelligence

WAIC 2022: AI Open Source and Industrial Intelligence Summit Highlights China's AI Ecosystem Development

At the WAIC 2022 AI Open Source and Industrial Intelligence Summit in Shanghai, Baidu’s CTO outlined a TSMC‑like model for large‑scale AI, academicians highlighted intelligent vehicle connectivity and open‑source leadership, a new deep‑learning transformation base was unveiled, and PaddlePaddle’s 4.77 million developers underscored China’s rapidly expanding AI ecosystem across industry.

Artificial IntelligenceBaiduChina AI Ecosystem
0 likes · 6 min read
WAIC 2022: AI Open Source and Industrial Intelligence Summit Highlights China's AI Ecosystem Development