corpusdoportugues.org

.org crawl

First seen 2026-04-22 · Last seen 2026-05-15 · ok HTTP/1.1 200 2158 ms crawled 2026-05-15

US · 198.185.159.144 · AS53831 Squarespace, Inc.

Reputation 87/100 weak security headers no dmarc policy

Classifying

HTML metadata

Title
Corpus do Português: 2.5 billion words: Dialects / Genres / Historical
Description
Largest full-featured corpora of Portuguese: Search by PoS, collocates, synonyms, genre, dialect, historical, etc. Downloadable data also.

Technology

Server
Microsoft-IIS
Fonts
  • Google Fonts

Third-party hosts loaded (3)

  • cdnjs.cloudflare.com×1
  • fonts.googleapis.com×1
  • www.english-corpora.org×1

Registration

Registrar
Squarespace Domains II LLC
Created
2006-05-22
Expires
2028-05-22 732 days left
Updated
2025-07-17
Name servers
  • ns-cloud-b1.googledomains.com
  • ns-cloud-b4.googledomains.com
  • ns-cloud-b2.googledomains.com
  • ns-cloud-b3.googledomains.com

DNS records live

NS
  • ns-cloud-b1.googledomains.com
  • ns-cloud-b2.googledomains.com
  • ns-cloud-b3.googledomains.com
  • ns-cloud-b4.googledomains.com
MX
  • 1 aspmx.l.google.com
  • 10 alt3.aspmx.l.google.com
  • 10 alt4.aspmx.l.google.com
  • 5 alt1.aspmx.l.google.com
  • 5 alt2.aspmx.l.google.com

Email authentication weak

SPF
v=spf1 include:_spf.google.com ~all
softfail (~all)
DMARC
not published
DKIM
  • google: v=DKIM1; k=rsa; p=MIIBIjANBgkqhkiG9w0BAQEFAAOCAQ8AMIIBCgKCAQEAknZK0KryQisxKnFmjeYqEY++pEbKRKGIjhH9N5yevIcdy/oDJbn9gZcUYjqY7PZge2bZNh5MKee8nK…
selectors probed

Certificate (current)

R12
from 2026-03-21 to 2026-06-19
Expires in 29 days

HTTP security headers

Header hygiene 30/100 Checked live page: https://www.corpusdoportugues.org/

findings
  • missing HSTS
  • missing Content Security Policy
  • missing frame protection
  • missing content type protection
  • missing Referrer Policy
  • missing Permissions Policy

Links to (2)

Linked from (2)