schooljoin.com
robots.txt

Robots Exclusion Standard data for schooljoin.com

Resource Scan

Scan Details

Site Domain schooljoin.com
Base Domain schooljoin.com
Scan Status Ok
Last Scan2024-11-11T00:42:27+00:00
Next Scan 2024-11-18T00:42:27+00:00

Last Scan

Scanned2024-11-11T00:42:27+00:00
URL https://schooljoin.com/robots.txt
Domain IPs 104.21.88.158, 172.67.185.201, 2606:4700:3034::ac43:b9c9, 2606:4700:3035::6815:589e
Response IP 104.21.88.158
Found Yes
Hash 3330393c61f89bc74a8e9f2c694b66cabafa2e7df66cc0841d38d2e3d903606b
SimHash fd34ded747a3

Groups

googlebot

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /
Disallow /404.shtml

Other Records

Field Value
sitemap https://schooljoin.com/sitemap.xml
sitemap https://schooljoin.com/sitemap_images.xml
sitemap https://schooljoin.com/feed_rss.xml

Warnings

  • 3 invalid lines.