green-search-engine.com
robots.txt

Robots Exclusion Standard data for green-search-engine.com

Resource Scan

Scan Details

Site Domain green-search-engine.com
Base Domain green-search-engine.com
Scan Status Ok
Last Scan2024-05-28T13:38:53+00:00
Next Scan 2024-06-27T13:38:53+00:00

Last Scan

Scanned2024-05-28T13:38:53+00:00
URL https://green-search-engine.com/robots.txt
Redirect https://www.green-search-engine.com/robots.txt
Redirect Domain www.green-search-engine.com
Redirect Base green-search-engine.com
Domain IPs 104.21.38.27, 172.67.218.33, 2606:4700:3031::ac43:da21, 2606:4700:3033::6815:261b
Redirect IPs 104.21.38.27, 172.67.218.33, 2606:4700:3031::ac43:da21, 2606:4700:3033::6815:261b
Response IP 172.67.218.33
Found Yes
Hash ec0067fb0925e79d7a0158e78266ecc507f27b30733f13fad8df5958bfad6110
SimHash 4814d6008792

Groups

googlebot

Rule Path
Disallow /info/
Disallow /search/

mediapartners-google

Rule Path
Disallow /info/
Disallow /search/

yahoo! slurp

Rule Path
Allow /$
Disallow /

bingbot

Rule Path
Allow /$
Disallow /

yandex

Rule Path
Allow /$
Disallow /

baiduspider

Rule Path
Disallow /

sogou

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow

ips-agent

Rule Path
Disallow /parking.php4

blexbot

Rule Path
Disallow /

pandalytics

Rule Path
Disallow /info/
Disallow /search/

ioncrawl

Rule Path
Disallow /info/
Disallow /search/

*

Rule Path
Disallow /