hauterrfly.com
robots.txt

Robots Exclusion Standard data for hauterrfly.com

Resource Scan

Scan Details

Site Domain hauterrfly.com
Base Domain hauterrfly.com
Scan Status Ok
Last Scan2024-05-25T18:09:13+00:00
Next Scan 2024-06-01T18:09:13+00:00

Last Scan

Scanned2024-05-25T18:09:13+00:00
URL https://hauterrfly.com/robots.txt
Domain IPs 13.35.18.123, 13.35.18.128, 13.35.18.46, 13.35.18.85
Response IP 13.35.18.123
Found Yes
Hash a8dfedb6acefb2e7e77fab601235d5567acddfd4b631265cdcb770bbde5912d5
SimHash 48246249a233

Groups

*

Rule Path
Allow /
Disallow */feed$
Disallow */?s=*
Disallow /wp-admin/
Disallow /wp-content/plugins/
Disallow /events/
Disallow /wp-content/sitemaps/pagination-sitemap.xml
Disallow /page/
Disallow */2018/*
Disallow */2019/*
Disallow */2015/*
Disallow */2017/*
Disallow */2016/*
Disallow */?q=%2F*
Disallow */webp-express/*
Disallow */embed/?embed=true
Disallow */web-stories/page/*
Disallow */search/*
Disallow */embed/*
Disallow */page/*
Disallow */Kinjal
Disallow */undefined/*
Disallow */about%3Ablank*
Disallow */podcast/*
Disallow /xmlrpc.php
Disallow /*sex*
Disallow /tag/sex/
Disallow /home/

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

Other Records

Field Value
sitemap https://hauterrfly.com/post-sitemap1.xml
sitemap https://hauterrfly.com/post-sitemap2.xml
sitemap https://hauterrfly.com/post-sitemap3.xml
sitemap https://hauterrfly.com/post-sitemap4.xml
sitemap https://hauterrfly.com/post-sitemap5.xml
sitemap https://hauterrfly.com/page-sitemap.xml
sitemap https://hauterrfly.com/category-sitemap.xml
sitemap https://hauterrfly.com/feed/podcast/the-tits-bits