Show GN: 실제 사람처럼 테스트를 수행하는 SaaS
VLM 기반 화면 인식 및 제어를 통한 범용 AI 테스트 자동화
VLM 기반 화면 인식 및 제어를 통한 범용 AI 테스트 자동화
Claude Code 기반 DICOM 분석을 통한 MRI 오진 검증 사례
NASA tests AI medic for astronauts too far from Earth to call a doctor
12B DiT 기반의 창작 탐색형 이미지 모델 Krea 2 설계 및 학습 전략
Why stop gaming saved my tokens: Building my own local AI Lab
Lịch Sử OCR và Sự Ra Đời Khái Niệm Vision-First OCR
Every browser engine company builds on rented land. Anti-Enshittification doesn't.
M2 Max 기반 DiffusionGemma 26B 4-bit 양자화로 31.6 tok/s 달성
MolmoMotion: Language-guided 3D motion forecasting
Your RAG App Is Broken Because You're Still Parsing PDFs Like It's 2023
Physical AI has Scaling Laws now. The Race just became something else.
Document Automation in 2026: A Honest Comparison of the AI-Native Platforms
Mbodi AI (YC P25) Is Hiring Founding Machine Learning Engineer (Robotics)
한국어 공공기관 Long-Document 분석을 위한 KOLongDoc 벤치마크 공개
Your Next PC Is Not a Productivity Tool - It Is a Runtime for AI Agents
AI.Insaf (@ai_tablet) — Полный архив постов канала
I distilled a 7B vision model into a 2B one for screenshots — and the 7B teacher scored worse
📄Paper: RORA-VLM: Robust Retrieval Augmentation for Vision Language Models
AI-Orchestrated 3D Asset Pipeline: From JPEG to Game-Ready GLB Without Touching Blender
When AI Reads Blueprints: The Hidden Attack Surface of Multimodal Engineering Intelligence