studylight.org
robots.txt
Robots Exclusion Standard data for studylight.org
Resource Scan
Scan Details
Site Domain | studylight.org |
Base Domain | studylight.org |
Scan Status | Ok |
Last Scan | 2024-11-12T12:03:57+00:00 |
Next Scan | 2024-11-19T12:03:57+00:00 |
Last Scan
Scanned | 2024-11-12T12:03:57+00:00 |
URL | https://studylight.org/robots.txt |
Redirect | https://www.studylight.org/robots.txt |
Redirect Domain | www.studylight.org |
Redirect Base | studylight.org |
Domain IPs | 74.63.248.118 |
Redirect IPs | 104.26.0.234, 104.26.1.234, 172.67.68.251, 2606:4700:20::681a:1ea, 2606:4700:20::681a:ea, 2606:4700:20::ac43:44fb |
Response IP | 172.67.68.251 |
Found | Yes |
Hash | 971ca78069c32e549ffa7639b9514bd2371efa6eefd9249688fca4aaee013477 |
SimHash | 12577b69efda |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin/ |
Disallow | /ajax/ |
Disallow | /scratchpad/ |
Disallow | /jscripts/ |
Disallow | /*?print=yes |
Disallow | /*?search= |
Disallow | /*?pn= |
Disallow | /*?st= |
Disallow | /*?q= |
Disallow | /*?q1= |
Disallow | /*?tr1= |
Disallow | /*?tr2= |
Disallow | /*?tr3= |
Comments