smpcolmek.pages.dev
robots.txt

Robots Exclusion Standard data for smpcolmek.pages.dev

Resource Scan

Scan Details

Site Domain smpcolmek.pages.dev
Base Domain smpcolmek.pages.dev
Scan Status Ok
Last Scan2026-02-26T12:34:30+00:00
Next Scan 2026-03-28T12:34:30+00:00

Last Scan

Scanned2026-02-26T12:34:30+00:00
URL https://smpcolmek.pages.dev/robots.txt
Domain IPs 172.66.45.18, 172.66.46.238, 2606:4700:310c::ac42:2d12, 2606:4700:310c::ac42:2eee
Response IP 172.66.45.18
Found Yes
Hash 33a62994c0eb7f97368f23a29cbc0d09e36f03fbcdfbdf66d239dde3f710a739
SimHash 491dd941e543

Groups

*

Rule Path
Disallow /video/*
Disallow /?s=*
Disallow /?q=*
Disallow /search/*
Disallow /?page=*
Allow /
Allow /category/

googlebot

Rule Path
Allow /
Allow /video/*

bingbot

Rule Path
Allow /
Allow /video/*

yandexbot

Rule Path
Allow /
Allow /video/*

baiduspider

Rule Path
Allow /
Allow /video/*

duckduckbot

Rule Path
Allow /
Allow /video/*

applebot

Rule Path
Allow /
Allow /video/*

sogou spider

Rule Path
Allow /
Allow /video/*

yahoo slurp

Rule Path
Allow /
Allow /video/*

gptbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://smpcolmek.pages.dev/sitemap.xml