reference.com
robots.txt
Robots Exclusion Standard data for reference.com
Resource Scan
Scan Details
Site Domain | reference.com |
Base Domain | reference.com |
Scan Status | Ok |
Last Scan | 2024-05-08T00:53:30+00:00 |
Next Scan | 2024-05-15T00:53:30+00:00 |
Last Scan
Scanned | 2024-05-08T00:53:30+00:00 |
URL | https://reference.com/robots.txt |
Redirect | https://www.reference.com/robots.txt |
Redirect Domain | www.reference.com |
Redirect Base | reference.com |
Domain IPs | 199.232.46.114 |
Redirect IPs | 151.101.130.114, 151.101.194.114, 151.101.2.114, 151.101.66.114 |
Response IP | 199.232.46.114 |
Found | Yes |
Hash | 76ddfeafaf1ebd7fa7d0cc21c79ac1405fa2dcaf7df920873de2ba9c798c4c44 |
SimHash | 81241a40c792 |
Groups
*
Rule | Path |
---|---|
Disallow | /web |
Disallow | /Web |
Disallow | /WEB |
Disallow | /browse |
Disallow | /article/ |
Disallow | /slideshow/ |
Disallow | /fragment |
Disallow | /_clk |
Disallow | /slp |
Disallow | /log/ |
Disallow | *ad%3D |
Disallow | *aq%3D |
Disallow | /wp-admin/ |
Allow | /wp-admin/admin-ajax.php |
Other Records
Field | Value |
---|---|
sitemap | https://www.reference.com/sitemap.xml |
Comments