howtolovecomics.com
robots.txt

Robots Exclusion Standard data for howtolovecomics.com

Resource Scan

Scan Details

Site Domain howtolovecomics.com
Base Domain howtolovecomics.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-12-02T06:20:15+00:00
Next Scan 2026-01-01T06:20:15+00:00

Last Successful Scan

Scanned2025-11-02T00:48:42+00:00
URL https://howtolovecomics.com/robots.txt
Domain IPs 104.21.33.110, 172.67.161.225, 2606:4700:3035::6815:216e, 2606:4700:3037::ac43:a1e1
Response IP 104.21.33.110
Found Yes
Hash d6b3d040f14320094f7201683a2c9e612fc0875617d5dfe02b0235a898f07ae5
SimHash 420c4988e13f

Groups

*

Rule Path
Disallow /wp-content/uploads/wpo-plugins-tables-list.json

criteobot/0.1

Rule Path
Disallow /

ias-or/3.3

Rule Path
Disallow /

ias-va/3.3

Rule Path
Disallow /

ias-sg/3.3

Rule Path
Disallow /

ias-ie/3.3

Rule Path
Disallow /

ias-jp/3.3

Rule Path
Disallow /

ias-au/3.3

Rule Path
Disallow /

ahc/2.1

Rule Path
Disallow /

omgili/0.5

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

proximic

Rule Path
Disallow /

um-ln

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

mozlila/5.0

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

trafficbot.live

Rule Path
Disallow /

bot-traffic.icu

Rule Path
Disallow /

bottraffic.live

Rule Path
Disallow /

gammatraffic.com

Rule Path
Disallow /

trafficmarket.me

Rule Path
Disallow /

extratraffic.shop

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.howtolovecomics.com/sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK