newdag.com
robots.txt

Robots Exclusion Standard data for newdag.com

Resource Scan

Scan Details

Site Domain newdag.com
Base Domain newdag.com
Scan Status Ok
Last Scan2025-11-22T08:51:54+00:00
Next Scan 2025-12-22T08:51:54+00:00

Last Scan

Scanned2025-11-22T08:51:54+00:00
URL https://newdag.com/robots.txt
Domain IPs 172.66.1.12
Response IP 172.66.1.12
Found Yes
Hash dde51f310c76dc466bcf20e7c148e6c64a3dd6dc1cca3b391a4148911f24a945
SimHash 2710de62ca31

Groups

*

Rule Path
Disallow /admin
Disallow /cart
Disallow /checkout
Disallow /account
Disallow /account/*
Disallow /_next/data/*

adsbot-google

Rule Path
Disallow /admin
Disallow /cart
Disallow /checkout
Disallow /account
Disallow /account/*
Disallow /_next/data/*

nutch

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /admin
Disallow /cart
Disallow /checkout
Disallow /account
Disallow /account/*
Disallow /_next/data/*

ahrefssiteaudit

Rule Path
Disallow /admin
Disallow /cart
Disallow /checkout
Disallow /account
Disallow /account/*
Disallow /_next/data/*

pinterest

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://newdag.com/sitemap.xml
sitemap https://newdag.com/sitemap.xml
sitemap https://newdag.com/sitemap.xml