dig-in.com
robots.txt

Robots Exclusion Standard data for dig-in.com

Resource Scan

Scan Details

Site Domain dig-in.com
Base Domain dig-in.com
Scan Status Ok
Last Scan2025-09-26T06:25:31+00:00
Next Scan 2025-10-03T06:25:31+00:00

Last Scan

Scanned2025-09-26T06:25:31+00:00
URL https://dig-in.com/robots.txt
Redirect https://www.dig-in.com/robots.txt
Redirect Domain www.dig-in.com
Redirect Base dig-in.com
Domain IPs 15.197.254.45
Redirect IPs 13.33.45.120, 13.33.45.123, 13.33.45.87, 13.33.45.96
Response IP 13.33.45.87
Found Yes
Hash 23e31b9cc83baaa13ddb398d0bb823eb628f50da9ec22448bfe18b8fda4f05bc
SimHash 2905165576e3

Groups

*

Rule Path
Disallow /cms/
Disallow /_debug/
Allow /authors/
Disallow /search
Disallow /auth/

page-crawler

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

semrushbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

semrushbot-sa

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

semrushbot-si

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

semrushbot-ba

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

blp_bbot/0.1

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

ccbot/2.0

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

sentibot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

ahrefsbot/5.2

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

baiduspider/2.0

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.dig-in.com/sitemap.xml
sitemap https://www.dig-in.com/weekly-sitemap.xml
sitemap https://www.dig-in.com/news-sitemap.xml