wwwcache.highschoolot.com
robots.txt

Robots Exclusion Standard data for wwwcache.highschoolot.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	wwwcache.highschoolot.com
Base Domain	highschoolot.com
Scan Status	Ok
Last Scan	2024-09-18T04:29:09+00:00
Next Scan	2024-09-25T04:29:09+00:00

Last Scan

Scanned	2024-09-18T04:29:09+00:00
URL	https://wwwcache.highschoolot.com/robots.txt
Domain IPs	3.164.85.105, 3.164.85.120, 3.164.85.128, 3.164.85.17
Response IP	18.165.140.110
Found	Yes
Hash	ff1bade7a3e6f00eb0f1919ebc23db1711f7193c0f1ecdf8d6e08e01bff17025
SimHash	f0175851a5c0

Groups

grapeshot
ia_archiver
bingbot
bing
facebot
facebookexternalhit
googlebot
google
mediapartners-google
slurp
twitterbot
ubersuggest

Rule	Path
Disallow	/*?print_friendly
Disallow	/search/

Rule

Path

Disallow

/*?print_friendly

Disallow

/search/

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

awariorssbot
awariosmartbot

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

chatgpt-user

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

claude-web

Rule	Path
Disallow	/

Rule

Path

Disallow

cohere-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

dataforseobot

Rule	Path
Disallow	/

Rule

Path

Disallow

diffbot

Rule	Path
Disallow	/

Rule

Path

Disallow

facebookbot

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

magpie-crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

newsnow

Rule	Path
Disallow	/

Rule

Path

Disallow

news-please

Rule	Path
Disallow	/

Rule

Path

Disallow

omgili

Rule	Path
Disallow	/

Rule

Path

Disallow

omgilibot

Rule	Path
Disallow	/

Rule

Path

Disallow

peer39_crawler
peer39_crawler/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

perplexitybot

Rule	Path
Disallow	/

Rule

Path

Disallow

scrapy

Rule	Path
Disallow	/

Rule

Path

Disallow

turnitinbot

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	http://www.highschoolot.com/sitemap_index.xml

Field

Value

sitemap

http://www.highschoolot.com/sitemap_index.xml

Comments

HighSchoolOT.com robots.txt

wwwcache.highschoolot.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

grapeshotia_archiverbingbotbingfacebotfacebookexternalhitgooglebotgooglemediapartners-googleslurptwitterbotubersuggest

amazonbot

anthropic-ai

awariorssbotawariosmartbot

bytespider

ccbot

chatgpt-user

claudebot

claude-web

cohere-ai

dataforseobot

diffbot

facebookbot

google-extended

gptbot

magpie-crawler

newsnow

news-please

omgili

omgilibot

peer39_crawlerpeer39_crawler/1.0

perplexitybot

scrapy

turnitinbot

*

Other Records

Comments

wwwcache.highschoolot.com
robots.txt

grapeshot
ia_archiver
bingbot
bing
facebot
facebookexternalhit
googlebot
google
mediapartners-google
slurp
twitterbot
ubersuggest

awariorssbot
awariosmartbot

peer39_crawler
peer39_crawler/1.0