pai.pt
robots.txt

Robots Exclusion Standard data for pai.pt

Resource Scan

Scan Details

Site Domain pai.pt
Base Domain pai.pt
Scan Status Ok
Last Scan2024-05-26T20:07:15+00:00
Next Scan 2024-06-02T20:07:15+00:00

Last Scan

Scanned2024-05-26T20:07:15+00:00
URL https://pai.pt/robots.txt
Redirect https://www.pai.pt/robots.txt
Redirect Domain www.pai.pt
Redirect Base pai.pt
Domain IPs 2001:4860:4802:32::15, 2001:4860:4802:34::15, 2001:4860:4802:36::15, 2001:4860:4802:38::15, 216.239.32.21, 216.239.34.21, 216.239.36.21, 216.239.38.21
Redirect IPs 2001:4860:4802:32::15, 2001:4860:4802:34::15, 2001:4860:4802:36::15, 2001:4860:4802:38::15, 216.239.32.21, 216.239.34.21, 216.239.36.21, 216.239.38.21
Response IP 216.239.38.21
Found Yes
Hash d2e2dfddbf11db7d8e5c4bd2678b33c368890da95a910852af1c1c4feb142565
SimHash 4a5fdc30f898

Groups

*

Rule Path
Disallow /pa/
Disallow /*/claims/
Disallow /reports/
Disallow /users/

blexbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

twengabot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

zumbot

Rule Path
Disallow /

mozilla/5.0 (windows nt 5.1) applewebkit/537.36 (khtml, like gecko) chrome/30.0.1599.101 safari/537.36

Rule Path
Disallow /

Other Records

Field Value
sitemap https://storage.googleapis.com/poetic-primer-235017.appspot.com/public/sitemap.xml.gz

Warnings

  • 2 invalid lines.