numberock.com
robots.txt

Robots Exclusion Standard data for numberock.com

Resource Scan

Scan Details

Site Domain numberock.com
Base Domain numberock.com
Scan Status Ok
Last Scan2024-08-31T09:07:23+00:00
Next Scan 2024-09-30T09:07:23+00:00

Last Scan

Scanned2024-08-31T09:07:23+00:00
URL https://numberock.com/robots.txt
Domain IPs 104.26.14.136, 104.26.15.136, 172.67.73.125, 2606:4700:20::681a:e88, 2606:4700:20::681a:f88, 2606:4700:20::ac43:497d
Response IP 172.67.73.125
Found Yes
Hash 4ebde58e38d61560b0a121887a990e2af2a96e7653f7d0fff9dd923db192ada2
SimHash 2bb00fa567a0

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-content/
Disallow /wp-content/plugins/
Disallow /wp-content/cache/
Disallow /wp-content/themes/
Disallow /wp-includes/
Disallow /wp-json/
Disallow /blog/
Disallow /xmlrpc.php

googlebot

Rule Path
Allow /

ninjabot

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

googlebot-image

Rule Path
Allow /wp-content/uploads/

adsbot-google

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

Other Records

Field Value
sitemap https://numberock.com/sitemap.xml
sitemap https://numberock.com/ror.xml
sitemap https://numberock.com/sitemap_images.xml

Comments

  • ****************************************************************************
  • robots.txt
  • : Robots, spiders, and search engines use this file to determine which
  • content they should *not* crawl while indexing your website.
  • : This system is called "The Standard Robots Exclusion."
  • ****************************************************************************