pagalworld.us
robots.txt
Robots Exclusion Standard data for pagalworld.us
Resource Scan
Scan Details
Site Domain | pagalworld.us |
Base Domain | pagalworld.us |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-11-17T21:30:06+00:00 |
Next Scan | 2024-12-01T21:30:06+00:00 |
Last Successful Scan
Scanned | 2024-10-10T21:29:09+00:00 |
URL | https://pagalworld.us/robots.txt |
Redirect | https://www.pagalworld.us/robots.txt |
Redirect Domain | www.pagalworld.us |
Redirect Base | pagalworld.us |
Domain IPs | 104.21.37.83, 172.67.206.34, 2606:4700:3034::6815:2553, 2606:4700:3034::ac43:ce22 |
Redirect IPs | 104.21.37.83, 172.67.206.34, 2606:4700:3034::6815:2553, 2606:4700:3034::ac43:ce22 |
Response IP | 104.21.37.83 |
Found | Yes |
Hash | 0539ffab938a5ba1c476901b40bd9e9a2da01e1c44cc090a8e17edb1da306a80 |
SimHash | 955f5bcacf77 |
Groups
*
Rule | Path |
---|---|
Allow | /*.html |
Allow | /_big/* |
Allow | /_small/* |
Allow | /includes/* |
Allow | /*.xml |
Disallow | /db/ |