clgw.net
robots.txt
Robots Exclusion Standard data for clgw.net
Resource Scan
Scan Details
Site Domain | clgw.net |
Base Domain | clgw.net |
Scan Status | Ok |
Last Scan | 2024-09-23T15:56:48+00:00 |
Next Scan | 2024-10-23T15:56:48+00:00 |
Last Scan
Scanned | 2024-09-23T15:56:48+00:00 |
URL | https://clgw.net/robots.txt |
Domain IPs | 64.192.69.145 |
Response IP | 64.192.69.145 |
Found | Yes |
Hash | 8f9882e6b5613e3f86a960483075488213775c86ac72c763759e2a89c76e61ee |
SimHash | 2c219b905676 |
Groups
*
Rule | Path |
---|---|
Disallow | /contact.php |
Disallow | /cgi-bin |
Disallow | /wp-admin |
Disallow | /wp-includes |
Disallow | /wp-content |
Disallow | /wp-login.php |
*
Rule | Path |
---|---|
Disallow | /disallowed_page.php |
*
Rule | Path |
---|---|
Disallow | /address |
Disallow | /blackhole |
adsbot-google
adsbot-google-mobile
adsbot-google-mobile-apps
adidxbot
applebot
applenewsbot
bingbot
bingpreview
bublupbot
ccbot
duckduckbot
duckduckgo-favicons-bot
googlebot
googlebot-image
googlebot-mobile
googlebot-news
googlebot-video
mediapartners-google
mojeekbot
msnbot
msnbot-media
orangebot
pinterest
twitterbot
Rule | Path |
---|---|
Allow | / |
*
Rule | Path |
---|---|
Disallow | / |
Comments