gogakuru.com
robots.txt

Robots Exclusion Standard data for gogakuru.com

Resource Scan

Scan Details

Site Domain gogakuru.com
Base Domain gogakuru.com
Scan Status Ok
Last Scan2024-11-09T14:29:51+00:00
Next Scan 2024-11-16T14:29:51+00:00

Last Scan

Scanned2024-11-09T14:29:51+00:00
URL https://gogakuru.com/robots.txt
Domain IPs 13.35.210.112, 13.35.210.23, 13.35.210.73, 13.35.210.91
Response IP 13.35.210.112
Found Yes
Hash 4188667fd84bbe34d6d694cbe8c0664571dcd83501b888613e5579db44698f8d
SimHash c11d1954fd93

Groups

sogou web spider

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

sitesucker

Rule Path
Disallow /

googlebot

Rule Path
Disallow /english/special/minitest.html
Disallow /chinese/special/minitest.html
Disallow /hangeul/special/minitest.html
Disallow /mypage/myphrasetest.html
Disallow /mypage/mycollectiontest.html
Disallow /mobile/%E4%BC%9A%E5%93%A1%E7%89%B9%E5%85%B8.html

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /faq/
Disallow /contact/
Disallow /legal/
Disallow /index.php?flow=enPrint&
Disallow /index.php?flow=resetPassword
Disallow /index.php?flow=entryStep1

Other Records

Field Value
crawl-delay 1800

Other Records

Field Value
sitemap http://gogakuru.com/sitemap.xml