onthisday.com
robots.txt
Robots Exclusion Standard data for onthisday.com
Resource Scan
Scan Details
Site Domain | onthisday.com |
Base Domain | onthisday.com |
Scan Status | Ok |
Last Scan | 2024-05-22T00:48:11+00:00 |
Next Scan | 2024-05-29T00:48:11+00:00 |
Last Scan
Scanned | 2024-05-22T00:48:11+00:00 |
URL | https://onthisday.com/robots.txt |
Redirect | https://www.onthisday.com/robots.txt |
Redirect Domain | www.onthisday.com |
Redirect Base | onthisday.com |
Domain IPs | 104.27.201.89, 104.27.202.89, 2606:4700:21::681b:c959, 2606:4700:21::681b:ca59 |
Redirect IPs | 104.27.201.89, 104.27.202.89, 2606:4700:21::681b:c959, 2606:4700:21::681b:ca59 |
Response IP | 104.27.201.89 |
Found | Yes |
Hash | 4d5d7c7f51e60e0a8df64205674842f81094b98d0a91959e3ed982670a20dd5b |
SimHash | 34134942e710 |
Groups
*
Rule | Path |
---|---|
Disallow | /cdn-cgi/ |
Disallow | /cgi-sys/ |
Disallow | /1006136/ |
Disallow | /film-tv/film-tv/ |
Disallow | /music/music/ |
Disallow | /sport/sports.php/ |