nacadeiradapapa.com
robots.txt

Robots Exclusion Standard data for nacadeiradapapa.com

Resource Scan

Scan Details

Site Domain nacadeiradapapa.com
Base Domain nacadeiradapapa.com
Scan Status Ok
Last Scan2026-03-02T10:11:39+00:00
Next Scan 2026-03-09T10:11:39+00:00

Last Scan

Scanned2026-03-02T10:11:39+00:00
URL https://nacadeiradapapa.com/robots.txt
Redirect https://www.nacadeiradapapa.com/robots.txt
Redirect Domain www.nacadeiradapapa.com
Redirect Base nacadeiradapapa.com
Domain IPs 104.21.4.88, 172.67.131.224, 2606:4700:3035::6815:458, 2606:4700:3035::ac43:83e0
Redirect IPs 104.21.4.88, 172.67.131.224, 2606:4700:3035::6815:458, 2606:4700:3035::ac43:83e0
Response IP 104.21.4.88
Found Yes
Hash bfae020bfec324e96dbec6868a218c8768c99d96d85181114127ec9ac20fbac2
SimHash 6805d904c311

Groups

scrapy

Rule Path
Allow /

*

Rule Path
Disallow /?attachment_id
Disallow /attachment/
Disallow /attachment/*
Disallow /members/
Disallow /membros/
Disallow /readme.html
Disallow /refer/
Disallow /page/
Disallow /page/*
Disallow /undefined/*
Disallow /undefined/
Disallow /*.php$

Other Records

Field Value
sitemap https://www.nacadeiradapapa.com/sitemap_index.xml

Comments

  • Paths (no clean URLs)