helpcrunch.com
robots.txt

Robots Exclusion Standard data for helpcrunch.com

Resource Scan

Scan Details

Site Domain helpcrunch.com
Base Domain helpcrunch.com
Scan Status Ok
Last Scan2024-06-03T21:10:23+00:00
Next Scan 2024-06-17T21:10:23+00:00

Last Scan

Scanned2024-06-03T21:10:23+00:00
URL https://helpcrunch.com/robots.txt
Domain IPs 104.26.0.83, 104.26.1.83, 172.67.72.244, 2606:4700:20::681a:153, 2606:4700:20::681a:53, 2606:4700:20::ac43:48f4
Response IP 172.67.72.244
Found Yes
Hash 985fa39169e1b3b00e485505fb411a6566a7f1d7d51a2a64d092c8f4fd9b482c
SimHash 4d612f5107b1

Groups

adsbot-google

Rule Path
Allow /lp/
Allow /uk/lp/
Allow /ru/lp/

adsbot-google-mobile

Rule Path
Allow /lp/
Allow /uk/lp/
Allow /ru/lp/

*

Rule Path
Allow /wp-content/uploads/
Disallow /blog/wp-content/plugins/
Disallow /blog/wp-admin/
Disallow /blog/readme.html
Disallow /blog/license.txt
Disallow /blog/.gitignore
Disallow /blog/composer*
Disallow /case/wp-content/plugins/
Disallow /case/wp-admin/
Disallow /case/readme.html
Disallow /case/license.txt
Disallow /case/.gitignore
Disallow /case/composer*
Disallow /lp/
Disallow /uk/lp/
Disallow /ru/lp/

Other Records

Field Value
sitemap https://helpcrunch.com/sitemap.xml

Warnings

  • `host` is not a known field.