english-corpora.org

.org crawl

First seen 2026-04-14 · Last seen 2026-05-17 · ok HTTP/1.1 200 3548 ms crawled 2026-05-09

US · 198.49.23.144 · AS53831 Squarespace, Inc.

Reputation 87/100 weak security headers no dmarc policy

Classifying

HTML metadata

Title
English Corpora: most widely used online corpora. Billions of words of data: free online access
Description
Compare genres, dialects, time periods; use AI; search by PoS, collocates, synonyms, and much more.

Technology

Server
Microsoft-IIS
Fonts
  • Google Fonts

Third-party hosts loaded (2)

  • cdnjs.cloudflare.com×1
  • fonts.googleapis.com×1

Social

Registration

Registrar
Squarespace Domains II LLC
Created
2017-11-26
Expires
2027-11-26 554 days left
Updated
2025-07-17
Name servers
  • ns-cloud-d4.googledomains.com
  • ns-cloud-d1.googledomains.com
  • ns-cloud-d3.googledomains.com
  • ns-cloud-d2.googledomains.com

DNS records live

NS
  • ns-cloud-d1.googledomains.com
  • ns-cloud-d2.googledomains.com
  • ns-cloud-d3.googledomains.com
  • ns-cloud-d4.googledomains.com
MX
  • 1 aspmx.l.google.com
  • 10 alt3.aspmx.l.google.com
  • 10 alt4.aspmx.l.google.com
  • 5 alt1.aspmx.l.google.com
  • 5 alt2.aspmx.l.google.com

Email authentication weak

SPF
v=spf1 include:_spf.google.com ~all
softfail (~all)
DMARC
not published
DKIM
  • google: v=DKIM1; k=rsa; p=MIIBIjANBgkqhkiG9w0BAQEFAAOCAQ8AMIIBCgKCAQEAqz4A8v4iFzf22Y6m3ukXQKMD3DI9Onfv3ix7gghXQcKAmYWe54xwpvLQOeUftGH5yQNr+WgmIdVRWA…
selectors probed

Certificate (current)

R12
from 2026-03-22 to 2026-06-20
Expires in 31 days

HTTP security headers

Header hygiene 30/100 Checked live page: https://www.english-corpora.org/

findings
  • missing HSTS
  • missing Content Security Policy
  • missing frame protection
  • missing content type protection
  • missing Referrer Policy
  • missing Permissions Policy

Links to (7)

Linked from (12)