blissfulglutton.com
robots.txt

Robots Exclusion Standard data for blissfulglutton.com

Resource Scan

Scan Details

Site Domain blissfulglutton.com
Base Domain blissfulglutton.com
Scan Status Ok
Last Scan2024-11-14T23:54:39+00:00
Next Scan 2024-12-14T23:54:39+00:00

Last Scan

Scanned2024-11-14T23:54:39+00:00
URL https://blissfulglutton.com/robots.txt
Redirect https://xoilactv3.asia/robots.txt
Redirect Domain xoilactv3.asia
Redirect Base xoilactv3.asia
Domain IPs 104.21.95.99, 172.67.144.26, 2606:4700:3033::ac43:901a, 2606:4700:3034::6815:5f63
Redirect IPs 104.18.8.109, 104.18.9.109, 2606:4700::6812:86d, 2606:4700::6812:96d
Response IP 104.18.9.109
Found Yes
Hash 49ef553ccfc93c91a7feb95dba03e97934836d1cb78d6296b68c1808b5de90f3
SimHash 9a4f70c26911

Groups

ia_archiver

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webzip

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

web downloader

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

offline explorer pro

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

httrack website copier

Rule Path
Disallow /

offline commander

Rule Path
Disallow /

leech

Rule Path
Disallow /

websnake

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

http weazel

Rule Path
Disallow /

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/