crichighlights.com
robots.txt
Robots Exclusion Standard data for crichighlights.com
Resource Scan
Scan Details
Site Domain | crichighlights.com |
Base Domain | crichighlights.com |
Scan Status | Ok |
Last Scan | 2024-09-30T09:55:11+00:00 |
Next Scan | 2024-10-07T09:55:11+00:00 |
Last Scan
Scanned | 2024-09-30T09:55:11+00:00 |
URL | https://crichighlights.com/robots.txt |
Domain IPs | 104.21.39.138, 172.67.146.24, 2606:4700:3030::6815:278a, 2606:4700:3031::ac43:9218 |
Response IP | 172.67.146.24 |
Found | Yes |
Hash | 7e24ad3eb020099f493b804d7543c8eab8eaa84c3e38d6520cba53d2198328e2 |
SimHash | e13c980269b2 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Allow | /wp-admin/admin-ajax.php |
Disallow | /page/ |
Disallow | /tag/ |
Disallow | /feed/ |
Disallow | /?tabgarb=tab5%2F |
Disallow | /?tabgarb=tab4%2F |
Disallow | /?tabgarb=tab3%2F |
Disallow | /?tabgarb=tab2%2F |
Disallow | /?tabgarb=tab1%2F |
Disallow | /?tabgarb%2F |
Disallow | /wp-includes/ |
Disallow | /amp/ |
Other Records
Field | Value |
---|---|
sitemap | https://crichighlights.com/sitemap.xml |