mrcrickethockey.com
robots.txt

Robots Exclusion Standard data for mrcrickethockey.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	mrcrickethockey.com
Base Domain	mrcrickethockey.com
Scan Status	Ok
Last Scan	2024-09-16T08:04:47+00:00
Next Scan	2024-10-16T08:04:47+00:00

Last Scan

Scanned	2024-09-16T08:04:47+00:00
URL	https://mrcrickethockey.com/robots.txt
Redirect	https://www.mrcrickethockey.com/robots.txt
Redirect Domain	www.mrcrickethockey.com
Redirect Base	mrcrickethockey.com
Domain IPs	192.124.249.188
Redirect IPs	192.124.249.188
Response IP	192.124.249.188
Found	Yes
Hash	86468b205d6cf519e8a0a0e8cb7a2f8c69f53739ba6bb42b0176c37355ada27a
SimHash	cce6c94176a1

Groups

*

Rule	Path
Disallow	/wp-admin/
Disallow	/wp-includes/
Disallow	/wp-content/plugins/
Disallow	/wp-content/themes/
Disallow	/cgi-bin/
Disallow	/trackback/
Disallow	/xmlrpc.php
Disallow	/wp-login.php
Disallow	/wp-signup.php
Disallow	/?s=*
Disallow	/search/
Disallow	/search/*
Disallow	/wp-json/
Disallow	/*/trackback/
Disallow	/*/comments/
Disallow	/?add-to-cart=
Disallow	/?orderby=
Disallow	/?filter_
Disallow	/cdn-cgi/bm/cv/
Disallow	/cdn-cgi/challenge-platform/
Allow	/wp-content/uploads/
Allow	/wp-content/cache/

Rule

Path

Disallow

/wp-admin/

Disallow

/wp-includes/

Disallow

/wp-content/plugins/

Disallow

/wp-content/themes/

Disallow

/cgi-bin/

Disallow

/trackback/

Disallow

/xmlrpc.php

Disallow

/wp-login.php

Disallow

/wp-signup.php

Disallow

/?s=*

Disallow

/search/

Disallow

/search/*

Disallow

/wp-json/

Disallow

/*/trackback/

Disallow

/*/comments/

Disallow

/*?add-to-cart=*

Disallow

/*?orderby=*

Disallow

/*?filter_*

Disallow

/cdn-cgi/bm/cv/

Disallow

/cdn-cgi/challenge-platform/

Allow

/wp-content/uploads/

Allow

/wp-content/cache/

ahrefsbot
semrushbot
mj12bot
dotbot
nuclei
wikido
riddler
petalbot
zoominfobot
go-http-client
node/simplecrawler
cazoodlebot
dotbot/1.0
gigabot
barkrowler
blexbot
magpie-crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

/

googlebot

Rule	Path
Disallow	/*/feed/

Rule

Path

Disallow

/*/feed/

bingbot

Rule	Path
Disallow	/*/feed/

Rule

Path

Disallow

/*/feed/

slurp

Rule	Path
Disallow	/*/feed/

Rule

Path

Disallow

/*/feed/

*

Rule	Path
Allow	/*/feed/

Rule

Path

Allow

/*/feed/

Back to top

Other Records

Field	Value
sitemap	https://www.mrcrickethockey.com/sitemap_index.xml

Field

Value

sitemap

https://www.mrcrickethockey.com/sitemap_index.xml

Back to top

Comments

Global rules
-----------------
Prevent crawling CF challenge URLs
Allow access to necessary assets
Sitemap
-----------------
Ban bots that don't benefit us.
--------------------------------
Block feeds for search engines to reduce server load
-------------------------------
Allow feeds for other user agents
-----------------------------------

Back to top

mrcrickethockey.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

ahrefsbotsemrushbotmj12botdotbotnucleiwikidoriddlerpetalbotzoominfobotgo-http-clientnode/simplecrawlercazoodlebotdotbot/1.0gigabotbarkrowlerblexbotmagpie-crawler

googlebot

bingbot

slurp

*

Other Records

Comments

mrcrickethockey.com
robots.txt

ahrefsbot
semrushbot
mj12bot
dotbot
nuclei
wikido
riddler
petalbot
zoominfobot
go-http-client
node/simplecrawler
cazoodlebot
dotbot/1.0
gigabot
barkrowler
blexbot
magpie-crawler