pdf4llm.com

.com crawl

First seen 2026-04-29 · Last seen 2026-05-07 · ok HTTP/1.1 200 1180 ms crawled 2026-05-07

US · 216.150.1.1 · AS16509 Amazon.com, Inc.

Reputation 92/100 no dmarc policy

Classifying

HTML metadata

Title
PDF4LLM — The Pre-LLM Document Processing Layer
Description
PDF4LLM converts raw PDFs into structured, layout-aware Markdown before they ever reach your LLM. Python, .NET, and JavaScript. No GPU. No cloud. Self-hosted.
Language
en
Canonical
https://pdf4llm.com

Open Graph

url
https://pdf4llm.com
title
PDF4LLM — The Pre-LLM Document Processing Layer
locale
en_US
site name
PDF4LLM
description
PDF4LLM converts raw PDFs into structured, layout-aware Markdown before they ever reach your LLM. Python, .NET, and JavaScript. No GPU. No cloud. Self-hosted.

Technology

CDN
Vercel
CMS
Next.js
Analytics
  • Google Tag Manager
Cookie consent
  • Cookiebot

Third-party hosts loaded (2)

  • consent.cookiebot.com×1
  • www.googletagmanager.com×1

Registration

Registrar
Gandi SAS
Created
2024-04-23
Expires
2027-04-23 339 days left
Updated
2026-03-24
Name servers
  • ns-11-b.gandi.net
  • ns-112-c.gandi.net
  • ns-28-a.gandi.net

DNS records live

NS
  • ns-11-b.gandi.net
  • ns-112-c.gandi.net
  • ns-28-a.gandi.net
MX
  • 10 spool.mail.gandi.net
  • 50 fb.mail.gandi.net

Email authentication partial

SPF
v=spf1 include:_mailcust.gandi.net ?all
neutral (?all)
DMARC
not published
DKIM
no key found at common selectors

Certificate (current)

R12
from 2026-04-23 to 2026-07-22
Expires in 64 days

HTTP security headers

Header hygiene 50/100 Checked live page: https://www.pdf4llm.com/

present
  • strict-transport-security
findings
  • missing Content Security Policy
  • missing frame protection
  • missing content type protection
  • missing Referrer Policy
  • missing Permissions Policy
Header values
strict-transport-security
max-age=63072000

Links to (4)

Linked from (2)