smgtechreviews.wpcomstaging.com
robots.txt

Robots Exclusion Standard data for smgtechreviews.wpcomstaging.com

Resource Scan

Scan Details

Site Domain smgtechreviews.wpcomstaging.com
Base Domain wpcomstaging.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-05-06T11:08:55+00:00
Next Scan 2024-08-04T11:08:55+00:00

Last Successful Scan

Scanned2023-06-19T10:37:05+00:00
URL https://smgtechreviews.wpcomstaging.com/robots.txt
Domain IPs 192.0.78.20
Response IP 192.0.78.20
Found Yes
Hash 15f863e862ec834525b0fe6c08dc68b9b29306cfa76b49d30103c9b280a53c8d
SimHash a39e11600fb2

Groups

*

Rule Path
Allow /
Allow /wp-admin/admin-ajax.php
Disallow /wp-admin/
Disallow /readme.html$

mediapartners-google

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

adsbot-google

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

proximic

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

criteobot/0.1

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

grapeshot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

semrushbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

blexbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

serpstatbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

dataforseobot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

omgilibot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

barkrowler

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

webreaper

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

awariobot

Rule Path
Disallow /

awariorssbot

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

Comments

  • BEGIN Magic robots.txt
  • ---------------------------
  • General
  • Ad networks
  • Link analyzers
  • Downloaders
  • ---------------------------
  • END Magic robots.txt