webtwodirectory.com
robots.txt

Robots Exclusion Standard data for webtwodirectory.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	webtwodirectory.com
Base Domain	webtwodirectory.com
Scan Status	Ok
Last Scan	2025-07-13T09:25:49+00:00
Next Scan	2025-07-20T09:25:49+00:00

Last Scan

Scanned	2025-07-13T09:25:49+00:00
URL	https://webtwodirectory.com/robots.txt
Domain IPs	192.250.231.20
Response IP	192.250.231.20
Found	Yes
Hash	526800f8869586e53dcd1a95b48426186eeb25c8a566945f14456d8e01dd31cf
SimHash	7d1ed940e493

Groups

*

Rule	Path
Allow	/
Disallow	/*/manage
Disallow	/Identity/
Disallow	/portal/

Rule

Path

Allow

Disallow

/*/manage

Disallow

/Identity/

Disallow

/portal/

googlebot

Rule	Path
Allow	/

Rule

Path

Allow

google-extended

Rule	Path
Allow	/

Rule

Path

Allow

perplexitybot

Rule	Path
Allow	/en-us/blog/
Disallow	/

Rule

Path

Allow

/en-us/blog/

Disallow

obot

Rule	Path
Disallow	/

Rule

Path

Disallow

nbot

Rule	Path
Disallow	/

Rule

Path

Disallow

facebot

Rule	Path
Allow	/en-us/blog/
Disallow	/

Rule

Path

Allow

/en-us/blog/

Disallow

claudebot

Rule	Path
Allow	/en-us/blog/
Disallow	/

Rule

Path

Allow

/en-us/blog/

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

chatgpt-user

Rule	Path
Disallow	/

Rule

Path

Disallow

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

claude-web

Rule	Path
Disallow	/

Rule

Path

Disallow

httrack

Rule	Path
Disallow	/

Rule

Path

Disallow

httrack

Rule	Path
Disallow	/

Rule

Path

Disallow

wget

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

seznambot

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

blexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://webtwodirectory.com/sitemap_index.xml

Field

Value

sitemap

https://webtwodirectory.com/sitemap_index.xml

webtwodirectory.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

googlebot

google-extended

perplexitybot

obot

nbot

facebot

claudebot

ccbot

gptbot

chatgpt-user

anthropic-ai

claude-web

httrack

httrack

wget

mj12bot

seznambot

dotbot

blexbot

Other Records

webtwodirectory.com
robots.txt