ngentotcewekcantik.pages.dev
robots.txt

Robots Exclusion Standard data for ngentotcewekcantik.pages.dev

Resource Scan

Scan Details

Site Domain ngentotcewekcantik.pages.dev
Base Domain ngentotcewekcantik.pages.dev
Scan Status Ok
Last Scan2025-12-29T16:04:03+00:00
Next Scan 2026-01-28T16:04:03+00:00

Last Scan

Scanned2025-12-29T16:04:03+00:00
URL https://ngentotcewekcantik.pages.dev/robots.txt
Domain IPs 172.66.44.118, 172.66.47.138, 2606:4700:310c::ac42:2c76, 2606:4700:310c::ac42:2f8a
Response IP 172.66.44.118
Found Yes
Hash 43ccaed17d6eea5f38056b7598a440cb8ae1aed047eaa3ebcb25c38718e0cd87
SimHash 491dd941e5c3

Groups

*

Rule Path
Disallow /video/*
Disallow /?s=*
Disallow /?q=*
Disallow /search/*
Disallow /?page=*
Allow /
Allow /category/

googlebot

Rule Path
Allow /video/*

bingbot

Rule Path
Allow /video/*

yandexbot

Rule Path
Allow /video/*

baiduspider

Rule Path
Allow /video/*

duckduckbot

Rule Path
Allow /video/*

applebot

Rule Path
Allow /video/*

sogou spider

Rule Path
Allow /video/*

yahoo slurp

Rule Path
Allow /video/*

gptbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://ngentotcewekcantik.pages.dev/sitemap.xml