Sign in Subscribe

shobyun

Mastering NVIDIA DGX Spark: From Local Connectivity to Serving 120B Models

Unlock your NVIDIA DGX Spark's full potential. Build a Private AI Cloud using NVIDIA Sync, Tailscale, and Docker scripts to deploy Ollama, ComfyUI, and TensorRT-LLM with one click.

NVIDIA DGX Spark 활용하기: 로컬 연결부터 120B 모델 서빙까지

고성능 AI 워크스테이션인 NVIDIA DGX Spark를 도입했지만, 단순한 로컬 접속만으로는 그 잠재력을 모두 끌어내기 어렵습니다. 이 글에서는 NVIDIA Sync를 통한 간편한 기기 관리, Tailscale을 이용한 안전한 원격 액세스, 그리고 Docker Custom Script를 활용해 클릭 한 번으로 생성형 AI 서비스(Ollama, ComfyUI, TensorRT-LLM)를 배포하고 관리하는 ‘나만의 AI 프라이빗 클라우드’ 구축 과정을 상세히 다룹니다.

Extending Notebook-Centric AI Workflows with Jupyter AI

This article introduces how to extend your Jupyter Notebook environment into a generative AI playground using Jupyter AI. From installation methods to chat interface usage, key slash (/) commands, and %%ai magic commands, we cover the essential features step by step.

Jupyter AI로 노트북 중심 AI 워크플로우 확장하기

이 글에서는 Jupyter AI를 활용해 Jupyter Notebook 환경을 생성형 AI 플레이그라운드로 확장하는 방법을 소개합니다. 설치 방법부터 채팅 인터페이스 활용, 주요 슬래시(/) 명령어와 %%ai 매직 명령어 사용까지, 기본 기능을 단계별로 정리했습니다.

NeMo Curator로 텍스트 큐레이션 파이프라인 구축하기

이 가이드는 NVIDIA NeMo Curator를 활용해 대규모 언어 모델(LLM) 학습에 필요한 고품질 데이터셋을 구축하는 방법을 다룹니다. 우리는 간단한 테스트 예시를 해 데이터 수집부터 클리닝, 중복 제거, 언어 라벨링까지, 체계적인 텍스트 큐레이션 파이프라인을 구축하고 실행하는 엔드투엔드 절차를 실습 중심으로 정리했습니다.

Building a Text Curation Pipeline with NeMo Curator

This guide uses NVIDIA NeMo Curator to build high-quality datasets for LLMs. It's a hands-on walk-through of a text curation pipeline—from cleaning and deduplication to language labeling—for preparing large-scale data quickly and reliably.

Evaluating LLMs with NeMo Evaluator: An End-to-End Guide from Standard Benchmarks to Custom Datasets

This guide shows how to use NVIDIA NeMo Evaluator in a PAASUP DIP environment. It covers the end-to-end process of evaluating LLMs connected via an NIM Proxy, using both standard benchmarks and custom data, from setup to result interpretation.

NeMo Evaluator로 LLM 평가하기: 표준 벤치마크부터 커스텀까지 엔드투엔드 가이드

이번 가이드는 PAASUP DIP 환경에서 NVIDIA NeMo Evaluator를 활용해 OpenAI 호환 엔드포인트(NIM Proxy) 에 연결하고, 표준 벤치마크(LM Evaluation Harness)와 커스텀 데이터로 LLM을 일관된 절차로 평가하는 방법을 다룹니다. 설정 → 타깃 등록 → 실행 → 결과 해석까지 엔드투엔드 흐름을 실습 중심으로 정리했습니다.

Deploying Serverless VLLM in a Private Environment

This is a practical guide for building an enterprise LLM serving environment using KServe with a vLLM backend on the PAASUP DIP platform. It presents various AI service operation methods, from programming through Jupyter Notebook to no-code web interfaces using OpenWebUI and Flowise.

PAASUP DIP로 구축하는 엔터프라이즈 LLM 서빙 가이드: vllm #1

PAASUP DIP 플랫폼에서 vLLM 백엔드 기반으로 KServe를 활용해 엔터프라이즈 LLM 서빙 환경을 구축하고, OpenWebUI와 Flowise를 통해 프로그래밍부터 노코드 웹 인터페이스까지 다양한 AI 서비스 운영법을 제시하는 실전 가이드입니다.