pedportal.net
robots.txt

Robots Exclusion Standard data for pedportal.net

Resource Scan

Scan Details

Site Domain pedportal.net
Base Domain pedportal.net
Scan Status Ok
Last Scan2024-07-03T22:20:20+00:00
Next Scan 2024-07-10T22:20:20+00:00

Last Scan

Scanned2024-07-03T22:20:20+00:00
URL https://pedportal.net/robots.txt
Domain IPs 185.191.197.97
Response IP 185.191.197.97
Found Yes
Hash a5f1b825ebe20328b1c83d415df5cd3702897e5a9db3e9194155b34b5e392ec2
SimHash 49008e624793

Groups

*

Rule Path
Disallow /*?
Allow /

ia_archiver

Rule Path
Disallow /

Other Records

Field Value
sitemap https://pedportal.net/sitemap.xml

Warnings

  • `host` is not a known field.