webtwodirectory.com
robots.txt

Robots Exclusion Standard data for webtwodirectory.com

Resource Scan

Scan Details

Site Domain webtwodirectory.com
Base Domain webtwodirectory.com
Scan Status Ok
Last Scan2025-07-13T09:25:49+00:00
Next Scan 2025-07-20T09:25:49+00:00

Last Scan

Scanned2025-07-13T09:25:49+00:00
URL https://webtwodirectory.com/robots.txt
Domain IPs 192.250.231.20
Response IP 192.250.231.20
Found Yes
Hash 526800f8869586e53dcd1a95b48426186eeb25c8a566945f14456d8e01dd31cf
SimHash 7d1ed940e493

Groups

*

Rule Path
Allow /
Disallow /*/manage
Disallow /Identity/
Disallow /portal/

googlebot

Rule Path
Allow /

google-extended

Rule Path
Allow /

perplexitybot

Rule Path
Allow /en-us/blog/
Disallow /

obot

Rule Path
Disallow /

nbot

Rule Path
Disallow /

facebot

Rule Path
Allow /en-us/blog/
Disallow /

claudebot

Rule Path
Allow /en-us/blog/
Disallow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

httrack

Rule Path
Disallow /

httrack

Rule Path
Disallow /

wget

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://webtwodirectory.com/sitemap_index.xml