webtreeonline.com
robots.txt

Robots Exclusion Standard data for webtreeonline.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	webtreeonline.com
Base Domain	webtreeonline.com
Scan Status	Ok
Last Scan	2025-11-03T17:38:49+00:00
Next Scan	2025-12-03T17:38:49+00:00

Last Scan

Scanned	2025-11-03T17:38:49+00:00
URL	https://webtreeonline.com/robots.txt
Domain IPs	104.21.95.159, 172.67.145.233, 2606:4700:3036::ac43:91e9, 2606:4700:3037::6815:5f9f
Response IP	104.21.95.159
Found	Yes
Hash	da91ef874a0fa3288a38eb385ecb8918d2d41d42fd05eeedbbb2e7d411e60e28
SimHash	70389e02c732

Groups

*

Rule	Path
Disallow	/wp-admin/
Allow	/wp-admin/admin-ajax.php

Rule

Path

Disallow

/wp-admin/

Allow

/wp-admin/admin-ajax.php

oai-searchbot

Rule	Path
Allow	/

Rule

Path

Allow

/

gptbot

Rule	Path
Allow	/

Rule

Path

Allow

/

claudebot

Rule	Path
Allow	/

Rule

Path

Allow

/

perplexitybot

Rule	Path
Allow	/

Rule

Path

Allow

/

google-extended

Rule	Path
Allow	/

Rule

Path

Allow

/

ccbot

Rule	Path
Allow	/

Rule

Path

Allow

/

facebookbot

Rule	Path
Allow	/

Rule

Path

Allow

/

applebot

Rule	Path
Allow	/

Rule

Path

Allow

/

bytespider

Rule	Path
Allow	/

Rule

Path

Allow

/

Back to top

Other Records

Field	Value
sitemap	https://www.webtreeonline.com/sitemap_index.xml

Field

Value

sitemap

https://www.webtreeonline.com/sitemap_index.xml

Back to top

Comments

Default WordPress rules
Allow all major chatbot and AI crawlers
OpenAI (Search + Training)
Anthropic (Claude)
Perplexity
Google AI (Bard/Gemini)
Common AI research crawler
Facebook / Meta AI
AppleBot (used in Apple AI features & Siri)
Bytedance / TikTok AI

Back to top

webtreeonline.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

oai-searchbot

gptbot

claudebot

perplexitybot

google-extended

ccbot

facebookbot

applebot

bytespider

Other Records

Comments

webtreeonline.com
robots.txt