tweecampus.com
robots.txt

Robots Exclusion Standard data for tweecampus.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	tweecampus.com
Base Domain	tweecampus.com
Scan Status	Ok
Last Scan	2025-12-12T14:50:02+00:00
Next Scan	2026-01-11T14:50:02+00:00

Last Scan

Scanned	2025-12-12T14:50:02+00:00
URL	https://tweecampus.com/robots.txt
Domain IPs	104.21.38.187, 172.67.137.94, 2606:4700:3032::ac43:895e, 2606:4700:3037::6815:26bb
Response IP	104.21.38.187
Found	Yes
Hash	1da3cef70c61248c901740e6ff82af75049ba7240bd74981fbb2ec8b95b122e5
SimHash	6686c631e5a3

Groups

*

Rule	Path
Allow	/

Rule

Path

Allow

/

googlebot

Rule	Path
Allow	/

Rule

Path

Allow

/

bingbot

Rule	Path
Allow	/

Rule

Path

Allow

/

slurp

Rule	Path
Allow	/

Rule

Path

Allow

/

duckduckbot

Rule	Path
Allow	/

Rule

Path

Allow

/

baiduspider

Rule	Path
Allow	/

Rule

Path

Allow

/

yandexbot

Rule	Path
Allow	/

Rule

Path

Allow

/

facebookexternalhit

Rule	Path
Allow	/

Rule

Path

Allow

/

twitterbot

Rule	Path
Allow	/

Rule

Path

Allow

/

linkedinbot

Rule	Path
Allow	/

Rule

Path

Allow

/

whatsapp

Rule	Path
Allow	/

Rule

Path

Allow

/

applebot

Rule	Path
Allow	/

Rule

Path

Allow

/

Other Records

Field	Value
crawl-delay	1

Field

Value

crawl-delay

1

Back to top

Other Records

Field	Value
sitemap	https://tweecampus.com/sitemap.xml

Field

Value

sitemap

https://tweecampus.com/sitemap.xml

Back to top

Comments

Allow all search engines to crawl the site
Sitemap location
Crawl-delay for respectful crawling
Host directive

Back to top

Warnings

`host` is not a known field.

Back to top

tweecampus.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

googlebot

bingbot

slurp

duckduckbot

baiduspider

yandexbot

facebookexternalhit

twitterbot

linkedinbot

whatsapp

applebot

Other Records

Other Records

Comments

Warnings

tweecampus.com
robots.txt