展示HN：一个能从文章、视频和PDF中提取想法的人工智能笔记卡箱系统

展示HN：一个能从文章、视频和PDF中提取想法的人工智能笔记卡箱系统
Show HN: An AI zettelkasten that extracts ideas from articles, videos, and PDFs

原始链接: https://github.com/schoblaska/jargon

## 术语：你的AI驱动知识库术语是一个AI驱动的卡片盒笔记系统，旨在帮助你构建和探索一个互联的知识库。它能自动处理文章、论文和视频——包括YouTube字幕，通过总结内容并将关键想法提取成独立的、可链接的“洞察卡片”。利用语义嵌入，术语能够识别概念之间的关系，合并重复信息，并在你的知识库中展现关联。它还能通过根据每个洞察生成网络搜索来动态扩展你的研究，然后将新的发现直接整合到你的知识库中。你可以直接查询术语以检索相关的洞察，并辅以通过神经搜索获得的实时网络结果。术语使用Ruby构建，并利用OpenAI、pgvector和Exa等工具，提供了一个强大而灵活的知识管理和发现系统。它可以通过Docker Compose部署，并可以通过API密钥定制各种LLM提供商。

## 术语：一款AI驱动的Zettelkasten 开发者schoblaska推出了Jargon，一款使用Rails、pgvector等工具构建的、由AI管理的Zettelkasten系统。Jargon可以自动从文章、PDF和YouTube视频中提取关键思想，创建“洞察卡片”并链接相关概念。用户可以通过链接或粘贴输入内容，或直接提问，从而触发库搜索和新的网络结果（使用RAG）。一个关键功能是能够生成“研究线程”——基于初始洞察的自动化网络搜索。该项目利用Opus 4.5进行思想提取，并提供可视化图表来探索连接。一位评论员指出，传统Zettelkasten方法中*主动*思想提取的重要性，引发了关于AI在此过程中的作用的讨论。代码可在GitHub上找到：[https://github.com/schoblaska/jargon](https://github.com/schoblaska/jargon)。

原文

Jargon is an AI-managed zettelkasten that parses articles, papers, and videos into index card-sized key ideas. It summarizing sources, extracts ideas, links related concepts, and collapses duplicates. Semantic embeddings surface connections across the library.

Each source is parsed in context of existing cards, generating new insights that link back to the original material. The result is a knowledge base of interlinked ideas that can be explored directly or used as a RAG to answer questions. Questions also pull results from the web, which flow through the same extract/summarize/link pipeline before being synthesized with library content.

Ingest — Articles, PDFs, and YouTube videos are scraped and parsed
Summarize — LLM distills each source into a concise summary
Extract — Key ideas become standalone insight cards with source links
Connect — Embeddings find related insights; duplicates collapse automatically
Thread — Each node gets research questions that search the web for more sources

Academic papers and PDFs are automatically downloaded and converted to text using pdftotext. Jargon follows "full text" and DOI links from abstracts.

YouTube URLs are detected and transcripts are fetched directly from YouTube's API. Speakers are extracted from video titles when available.

Key findings are extracted as standalone insights with titles, explanations, and source snippets. Insights are independently searchable and linkable.

Articles and insights are embedded using OpenAI's text-embedding-3-small model. Embeddings power similarity search and automatic clustering.

Similar articles (syndicated content, republished pieces) are automatically grouped using vector similarity and title matching. Similar insights cluster into themes.

Each insight can spawn research threads—questions that trigger web searches via Exa to find related articles. Discovered articles are automatically ingested and indexed.

Ask a question or enter a topic to search your library. Jargon finds relevant insights using semantic similarity and displays them alongside the source articles.

Augment library results with fresh content from the web. Results are fetched via Exa's neural search and automatically ingested into your library.

Rails and Hotwire
Falcon - Async Ruby application server with fiber-based concurrency
async-job - Background job processing without a separate worker process
RubyLLM - Unified interface to OpenAI, Anthropic, Gemini, and OpenRouter
ruby_llm-schema - Structured JSON output from LLMs via schema definitions
pgvector - Vector similarity search in PostgreSQL
Exa - Neural search API for finding related content
crawl4ai - Fallback web scraper with browser rendering
pdftotext - Text extractor for PDF content

Copy .env.example to .env and configure:

Set API keys for the providers you want to use. RubyLLM supports OpenRouter, OpenAI, Anthropic, and Google Gemini:

OPENROUTER_API_KEY=your-key    # OpenRouter (default, proxies all providers)
OPENAI_API_KEY=your-key        # Direct OpenAI access
ANTHROPIC_API_KEY=your-key     # Direct Anthropic access
GEMINI_API_KEY=your-key        # Direct Google Gemini access

Model and Provider Selection

Override default models and providers via environment variables:

LLM_MODEL=google/gemini-2.5-flash              # Chat model (default)
LLM_PROVIDER=openrouter                        # Chat provider (default)
EMBEDDING_MODEL=openai/text-embedding-3-small  # Embedding model (default)
EMBEDDING_PROVIDER=openrouter                  # Embedding provider (default)

Provider must match the API key you're using. OpenRouter model names use provider/model format.

Set SECRET_KEY_BASE instead of using config/master.key:

SECRET_KEY_BASE=secret-key-base

Fallback crawler when Exa is unavailable. Install via pip:

pip install crawl4ai
crawl4ai-setup  # Downloads browser dependencies

Used for extracting text from PDF documents (academic papers, etc.). Install via poppler:

# macOS
brew install poppler

# Ubuntu/Debian
apt-get install poppler-utils

Run Jargon with Docker Compose using the published image from GitHub Container Registry.

Create a docker-compose.yml:

services:
  jargon:
    image: ghcr.io/schoblaska/jargon:latest
    ports:
      - "3000:80"
    env_file: .env
    environment:
      DATABASE_URL: postgres://postgres:postgres@db:5432/jargon
      REDIS_URL: redis://redis:6379/0
    depends_on:
      db:
        condition: service_healthy
      redis:
        condition: service_started

  db:
    image: pgvector/pgvector:pg17
    volumes:
      - postgres_data:/var/lib/postgresql/data
    environment:
      POSTGRES_USER: postgres
      POSTGRES_PASSWORD: postgres
      POSTGRES_DB: jargon
    healthcheck:
      test: ["CMD-SHELL", "pg_isready -U postgres"]
      interval: 5s
      timeout: 5s
      retries: 5

  redis:
    image: redis:7-alpine
    volumes:
      - redis_data:/data

volumes:
  postgres_data:
  redis_data:

Create a .env file with your secrets:

SECRET_KEY_BASE=secret-key-base
OPENROUTER_API_KEY=your-openrouter-key
EXA_API_KEY=your-exa-key

Start the stack:

The app will be available at http://localhost:3000.

replace Exa with Brave search and use crawl4ai to ingest
- do parallel searches across multiple LLM-generated queries?
refactor IngestArticleJob
define schemas inline where they're used
export markdown to clipboard
full text search
generate search query async
don't show superseded insights in autocomplete
youtube thumbnails as article image
visual distinction between articles and ideas

展示HN：一个能从文章、视频和PDF中提取想法的人工智能笔记卡箱系统 Show HN: An AI zettelkasten that extracts ideas from articles, videos, and PDFs

Model and Provider Selection

展示HN：一个能从文章、视频和PDF中提取想法的人工智能笔记卡箱系统
Show HN: An AI zettelkasten that extracts ideas from articles, videos, and PDFs