Author

AI Cyberspace

AI, big data, cloud computing, and networking.

147

Articles

Likes

228

Views

Comments

Latest from AI Cyberspace

100 recent articles max

AI Cyberspace

Feb 13, 2026 · Artificial Intelligence

How Attention Mechanisms Revolutionized Computer Vision and Machine Translation

This article traces the evolution of attention mechanisms from their inaugural application in computer vision and machine translation to their central role in modern Transformer models, detailing the underlying RNN‑Attention designs, the breakthrough in sequence alignment, and the innovations that enabled high‑performance, parallelizable deep learning architectures.

Transformerattention mechanismcomputer vision

0 likes · 14 min read

How Attention Mechanisms Revolutionized Computer Vision and Machine Translation

AI Cyberspace

Feb 11, 2026 · Artificial Intelligence

From RNNs to LSTMs and GRUs: A Hands‑On Guide to Sequence Modeling in PyTorch

This tutorial explains the nature of sequential data, why traditional feed‑forward networks struggle with it, and how recurrent architectures such as RNN, LSTM, and GRU capture temporal dependencies, complete with mathematical foundations, training algorithms, and full PyTorch implementations for sentiment analysis, text generation, and encoder‑decoder models.

Encoder-DecoderGRULSTM

0 likes · 57 min read

From RNNs to LSTMs and GRUs: A Hands‑On Guide to Sequence Modeling in PyTorch

AI Cyberspace

Jan 29, 2026 · Artificial Intelligence

Step‑by‑Step Guide to Efficient LLM Fine‑Tuning with LoRA, QLoRA, and Llama‑Factory

This tutorial explains the concepts, methods, and practical commands for fine‑tuning large language models using efficient techniques like LoRA and QLoRA, covering model selection, resource considerations, Docker deployment, dataset preparation, training configuration, evaluation metrics, model merging, and deployment with GGUF and Ollama.

GGUFGPU memory optimizationLLM fine-tuning

0 likes · 27 min read

Step‑by‑Step Guide to Efficient LLM Fine‑Tuning with LoRA, QLoRA, and Llama‑Factory

AI Cyberspace

Jan 26, 2026 · Artificial Intelligence

How NVFP4 Quantization Supercharges LLM Inference on NVIDIA DGX

This article explains the NVFP4 4‑bit floating‑point quantization technique, shows how to deploy Qwen3‑30B‑A3B models with TensorRT‑LLM and vLLM, compares performance across NVFP4, AWQ and INT8 quantizations, and provides practical profiling commands for NVIDIA DGX systems.

LLMNVFP4NVIDIA DGX

0 likes · 23 min read

How NVFP4 Quantization Supercharges LLM Inference on NVIDIA DGX

AI Cyberspace

Jan 18, 2026 · Artificial Intelligence

Understanding Supervised, Unsupervised, Self‑Supervised, Semi‑Supervised, and Reinforcement Learning for Large Language Model Training

The article explains various learning paradigms (supervised, unsupervised, self‑supervised, semi‑supervised, and reinforcement), describes dataset types and quality considerations, outlines preprocessing steps like filtering, deduplication, and tokenization, and discusses scaling laws linking model size, data volume, and compute resources, with concrete examples and code.

Machine Learningdata preprocessingmodel training

0 likes · 26 min read

Understanding Supervised, Unsupervised, Self‑Supervised, Semi‑Supervised, and Reinforcement Learning for Large Language Model Training

AI Cyberspace

Jan 13, 2026 · Artificial Intelligence

From Symbolic AI to LLMs: A Complete NLP History and Model Guide

This article provides a comprehensive overview of natural language processing, tracing its evolution from early symbolic and statistical stages through deep learning breakthroughs, detailing sequence models, key NLP tasks, text representation methods, and the development of modern architectures like RNN, LSTM, GRU, Transformer, and GPT series.

GPTLSTMNLP

0 likes · 60 min read

From Symbolic AI to LLMs: A Complete NLP History and Model Guide

AI Cyberspace

Nov 19, 2025 · Artificial Intelligence

Why MPI and NCCL Are Critical for Scaling AI Models Across Thousands of GPUs

This article explains how AI model training has evolved from single‑GPU workloads to massive distributed training using MPI for CPU‑centric communication and NCCL for GPU‑centric communication, covering their histories, core concepts, programming interfaces, topology discovery, protocol choices, and performance testing on multi‑GPU clusters.

AI distributed trainingGPU communicationHigh Performance Computing

0 likes · 71 min read

Why MPI and NCCL Are Critical for Scaling AI Models Across Thousands of GPUs

AI Cyberspace

Oct 15, 2025 · Artificial Intelligence

Why MCP Is Poised to Replace Function Calling for LLM Agents

The Model Context Protocol (MCP) introduced by Anthropic addresses the scalability, integration, and context‑transfer limitations of traditional Function Calling by offering a standardized, bidirectional, and context‑aware communication layer that simplifies tool discovery, security, and workflow orchestration for LLM‑driven agents.

AI IntegrationLLMagent

0 likes · 24 min read

Why MCP Is Poised to Replace Function Calling for LLM Agents

AI Cyberspace

Oct 5, 2025 · Artificial Intelligence

AI Agent vs AI Workflow: Which Approach Suits Your Projects?

The article explains the differences between AI Agents and AI Workflows, compares their characteristics, introduces the hybrid Agentic Workflow concept, and offers practical recommendations for building enhanced LLM applications using simple prompts or advanced frameworks.

AI workflowArtificial IntelligenceLLM

0 likes · 10 min read

AI Agent vs AI Workflow: Which Approach Suits Your Projects?

AI Cyberspace

Oct 4, 2025 · Artificial Intelligence

Exploring OpenManus: A Deep Dive into an Open‑Source AI Agent Framework

This article provides a comprehensive overview of OpenManus, an open‑source, general‑purpose AI agent framework, covering its installation, configuration, core architecture—including BaseAgent, ReActAgent, ToolCallAgent, and Manus—its extensive tool collection, execution logs, and detailed code analysis for developers and AI researchers.

AI AgentOpenManusPython

0 likes · 74 min read

Exploring OpenManus: A Deep Dive into an Open‑Source AI Agent Framework