Tagged articles

9 articles

Page 1 of 1

Jul 6, 2024 · Artificial Intelligence

ChatGLM Evolution: Deep Dive into GLM Architecture, Pretraining, and ChatGLM‑4

This article provides a comprehensive technical overview of the ChatGLM series—from the original ChatGLM‑6B model and its GLM‑based pre‑training framework to the enhancements in ChatGLM‑2, the architectural parity of ChatGLM‑3, and the advanced capabilities of the latest ChatGLM‑4, covering model structure, position encoding, attention mechanisms, multi‑task pretraining, and tool integration.

AIChatGLMGLM

0 likes · 25 min read

ChatGLM Evolution: Deep Dive into GLM Architecture, Pretraining, and ChatGLM‑4

Rare Earth Juejin Tech Community

Nov 24, 2023 · Artificial Intelligence

Step-by-Step Guide to Deploying LangChain‑Chatchat with ChatGLM‑2 on a Local Machine

This article provides a comprehensive tutorial on setting up the LangChain‑Chatchat open‑source project, covering environment preparation, model and embedding downloads, configuration files, database initialization, API service launch, and example curl commands for interacting with both the large language model and the local knowledge base.

APIChatGLMEmbedding

0 likes · 9 min read

Step-by-Step Guide to Deploying LangChain‑Chatchat with ChatGLM‑2 on a Local Machine

JD Retail Technology

Oct 26, 2023 · Artificial Intelligence

Leveraging Large Language Models for Text-to-SQL: Prompt Design and End-to-End Pipeline

This article explains how large language models can be used to convert natural language queries into SQL statements, describes two main approaches—direct generation and fine‑tuned open‑source models—details prompt engineering techniques, and outlines an end‑to‑end pipeline that executes the generated SQL and summarizes results.

ChatGLMData PrivacyLLM

0 likes · 7 min read

Leveraging Large Language Models for Text-to-SQL: Prompt Design and End-to-End Pipeline

Huawei Cloud Developer Alliance

Oct 25, 2023 · Artificial Intelligence

Unlocking GLM & ChatGLM: Deep Dive into MindSpore Large‑Model Techniques

The MindSpore Season 2 open class offers a comprehensive overview of GLM to ChatGLM architectures, positional‑embedding strategies, stable training optimizations, and step‑by‑step instructions for deploying large language models with Ascend, ModelArts, and MindSpore Transformers, while previewing upcoming multimodal remote‑sensing sessions.

Artificial IntelligenceChatGLMGLM

0 likes · 6 min read

Unlocking GLM & ChatGLM: Deep Dive into MindSpore Large‑Model Techniques

DaTaobao Tech

Sep 11, 2023 · Artificial Intelligence

Large Language Model Upgrade Paths and Architecture Selection

This article analyzes upgrade paths of major LLMs—ChatGLM, LLaMA, Baichuan—detailing performance, context length, and architectural changes, then examines essential capabilities, data cleaning, tokenizer and attention design, and offers practical guidance for balanced scaling and efficient model construction.

BaichuanChatGLMLLM architecture

0 likes · 32 min read

Large Language Model Upgrade Paths and Architecture Selection

UCloud Tech

Aug 14, 2023 · Artificial Intelligence

Deploy ChatGLM2‑6B on UCloud K8S: Complete Guide to Large Language Model Inference

This article reviews the architectures, training methods, and key characteristics of major open‑source large language models such as BERT, GPT, T5, LLaMA and ChatGLM, and then provides a step‑by‑step tutorial for deploying ChatGLM2‑6B on UCloud's UK8S platform using UFS storage, Kubernetes manifests, and command‑line tools.

ChatGLMKubernetesUCloud

0 likes · 14 min read

Deploy ChatGLM2‑6B on UCloud K8S: Complete Guide to Large Language Model Inference

DaTaobao Tech

Jul 12, 2023 · Artificial Intelligence

Optimizing ChatGLM-6B Deployment with MNN: Model Conversion, Quantization, and Edge Inference

The article details a workflow that converts the PyTorch ChatGLM‑6B model to MNN, splits and compresses embeddings, applies int4/int8 quantization, supports dynamic shapes, and uses hybrid GPU/CPU or CPU‑only loading to enable low‑memory edge inference on PCs and mobile devices with competitive token‑per‑second performance.

ChatGLMLLMMNN

0 likes · 16 min read

Optimizing ChatGLM-6B Deployment with MNN: Model Conversion, Quantization, and Edge Inference

Volcano Engine Developer Services

Jun 30, 2023 · Cloud Native

Deploy Langchain‑ChatGLM on Volcengine VKE: A Step‑by‑Step Cloud‑Native Guide

This tutorial walks you through preparing a VKE cluster, pulling the Langchain‑ChatGLM container image, creating the necessary Deployment and Service resources, and adding a local knowledge base, enabling you to run a Langchain‑based ChatGLM service with GPU support on Volcengine’s cloud‑native platform.

AI deploymentChatGLMGPU

0 likes · 6 min read

Deploy Langchain‑ChatGLM on Volcengine VKE: A Step‑by‑Step Cloud‑Native Guide

Full-Stack Trendsetter

May 18, 2023 · Artificial Intelligence

How 360 and ChatGLM Are Building China’s “Microsoft + OpenAI” Large‑Model Duo

On May 16, 360 and Zhipu AI announced a strategic partnership to co‑develop the trillion‑parameter models 360GLM and 360GPT, positioning them as China’s answer to Microsoft‑OpenAI by combining large‑scale pre‑training, bilingual capabilities, and integration with 360’s search and browser ecosystem.

360AI CollaborationChatGLM

0 likes · 7 min read

How 360 and ChatGLM Are Building China’s “Microsoft + OpenAI” Large‑Model Duo