joycolumbus.com
robots.txt

Robots Exclusion Standard data for joycolumbus.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	joycolumbus.com
Base Domain	joycolumbus.com
Scan Status	Ok
Last Scan	2024-11-08T12:42:35+00:00
Next Scan	2024-11-15T12:42:35+00:00

Last Scan

Scanned	2024-11-08T12:42:35+00:00
URL	https://joycolumbus.com/robots.txt
Domain IPs	192.0.66.208
Response IP	192.0.66.208
Found	Yes
Hash	b1c20e2f9cbca21aacb86abae85d7ad2922cc9eeacded6cecd8d1d137fda0edd
SimHash	701c5940a113

Groups

*

Rule	Path
Disallow
Disallow	/?s=*

Rule

Path

Disallow

/?s=*

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

applebot

Rule	Path
Disallow	/

Rule

Path

Disallow

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

chatgpt-user

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

claude-web

Rule	Path
Disallow	/

Rule

Path

Disallow

diffbot

Rule	Path
Disallow	/

Rule

Path

Disallow

facebookbot

Rule	Path
Disallow	/

Rule

Path

Disallow

imagesiftbot

Rule	Path
Disallow	/

Rule

Path

Disallow

omgilibot

Rule	Path
Disallow	/

Rule

Path

Disallow

omgili

Rule	Path
Disallow	/

Rule

Path

Disallow

perplexitybot

Rule	Path
Disallow	/

Rule

Path

Disallow

youbot

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://joycolumbus.com/sitemap.xml
sitemap	https://joycolumbus.com/news-sitemap.xml

Field

Value

sitemap

https://joycolumbus.com/sitemap.xml

sitemap

https://joycolumbus.com/news-sitemap.xml

joycolumbus.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

gptbot

google-extended

amazonbot

applebot

anthropic-ai

bytespider

ccbot

chatgpt-user

claudebot

claude-web

diffbot

facebookbot

imagesiftbot

omgilibot

omgili

perplexitybot

youbot

Other Records

joycolumbus.com
robots.txt