pupulani.net
robots.txt

Robots Exclusion Standard data for pupulani.net

Resource Scan

Scan Details

Site Domain pupulani.net
Base Domain pupulani.net
Scan Status Ok
Last Scan2026-01-23T01:23:14+00:00
Next Scan 2026-02-06T01:23:14+00:00

Last Scan

Scanned2026-01-23T01:23:14+00:00
URL https://www.pupulani.net/robots.txt
Domain IPs 104.18.12.81, 104.18.13.81
Response IP 104.18.13.81
Found Yes
Hash 9bfd4c2cb245c432e9541bf367f8ac6e815d12ba121e7311ec41df221682b262
SimHash c01ccc00dfd3

Groups

thesis-research-bot
fidget-spinner-bot
my-tiny-bot
semrushbot
ahrefsbot
dotbot
mj12bot
amazonbot
go-http-client
geedoproductsearch
python-requests
blexbot
aiohttp
serankingbacklinksbot

Rule Path
Disallow /

bingbot

Rule Path
Allow /
Disallow /cart/
Disallow /web_cart/
Disallow /shops/
Disallow /en/shops/
Disallow /api/shops/
Disallow /illegal_reports/report/

Other Records

Field Value
crawl-delay 300

*

Rule Path
Allow /
Disallow /cart/
Disallow /web_cart/
Disallow /shops/
Disallow /en/shops/
Disallow /api/shops/
Disallow /illegal_reports/report/

Other Records

Field Value
sitemap https://www.pupulani.net/sitemap.xml