phy.so
robots.txt
Robots Exclusion Standard data for phy.so
Resource Scan
Scan Details
Site Domain | phy.so |
Base Domain | phy.so |
Scan Status | Ok |
Last Scan | 2024-10-05T03:42:08+00:00 |
Next Scan | 2024-10-12T03:42:08+00:00 |
Last Scan
Scanned | 2024-10-05T03:42:08+00:00 |
URL | http://phy.so/robots.txt |
Redirect | https://phys.org/robots.txt |
Redirect Domain | phys.org |
Redirect Base | phys.org |
Domain IPs | 72.251.236.55 |
Redirect IPs | 2001:48c8:13:5::52, 72.251.233.232 |
Response IP | 72.251.233.232 |
Found | Yes |
Hash | 10b4f964466974da792a3bbfb3f593762e27b366db37a6a69dd6c64cf5c5a81e |
SimHash | 301c5943e0e3 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /search/ |
Disallow | /rss-feed/search/ |
Disallow | /rss-feed/breaking/search/ |
Disallow | /rss-feed/tags/ |
Disallow | /*/sort/ |
Other Records
Field | Value |
---|---|
sitemap | https://phys.org/sitemap/indx/ |