togesmp.pages.dev
robots.txt

Robots Exclusion Standard data for togesmp.pages.dev

Resource Scan

Scan Details

Site Domain togesmp.pages.dev
Base Domain togesmp.pages.dev
Scan Status Ok
Last Scan2025-10-15T17:35:53+00:00
Next Scan 2025-11-14T17:35:53+00:00

Last Scan

Scanned2025-10-15T17:35:53+00:00
URL https://togesmp.pages.dev/robots.txt
Domain IPs 172.66.44.252, 172.66.47.4, 2606:4700:310c::ac42:2cfc, 2606:4700:310c::ac42:2f04
Response IP 172.66.44.252
Found Yes
Hash f929e1cb87ea7e1949a487e1ae38a1c91951ebe94c827a64538eb4101514f74c
SimHash 091dd941e1c3

Groups

*

Rule Path
Disallow /video/*
Disallow /?s=*
Disallow /?q=*
Disallow /search/*
Disallow /?page=*
Allow /
Allow /category/

googlebot

Rule Path
Allow /video/*

bingbot

Rule Path
Allow /video/*

yandexbot

Rule Path
Allow /video/*

baiduspider

Rule Path
Allow /video/*

duckduckbot

Rule Path
Allow /video/*

applebot

Rule Path
Allow /video/*

sogou spider

Rule Path
Allow /video/*

yahoo slurp

Rule Path
Allow /video/*

gptbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://togesmp.pages.dev/sitemap.xml