teratail.com
robots.txt

Robots Exclusion Standard data for teratail.com

Resource Scan

Scan Details

Site Domain teratail.com
Base Domain teratail.com
Scan Status Ok
Last Scan2024-10-31T01:46:33+00:00
Next Scan 2024-11-07T01:46:33+00:00

Last Scan

Scanned2024-10-31T01:46:33+00:00
URL https://teratail.com/robots.txt
Domain IPs 13.112.92.95, 35.73.232.219, 57.180.182.237
Response IP 35.73.232.219
Found Yes
Hash 5ef0a27f578fc32f60713a6c668710abeeff3c9753b4680ce2c5ca15f0de0774
SimHash 150cfb30dcf3

Groups

*

Rule Path
Disallow /questions/search
Disallow /questions/input
Disallow /questions/complete
Disallow /users/
Disallow /*connections
Disallow /*badges
Disallow /login
Disallow /signup
Disallow /register
Disallow /notifications
Disallow /lp/
Disallow /sakura-cloud/certification/
Disallow /search
Disallow /rss/
Disallow /api/
Allow /users

twitterbot
facebookexternalhit
facebot

Rule Path
Allow /sakura-cloud/certification/

baiduspider

Rule Path
Disallow /

baiduimagespider

Rule Path
Disallow /

googlebot-mobile

Rule Path
Disallow /

baidumobaider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://teratail.com/sitemap-index.xml

Comments

  • robotstxt.org/