idgcdn.com.au
robots.txt

Robots Exclusion Standard data for idgcdn.com.au

Resource Scan

Scan Details

Site Domain idgcdn.com.au
Base Domain idgcdn.com.au
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2024-08-24T16:49:55+00:00
Next Scan 2024-11-22T16:49:55+00:00

Last Successful Scan

Scanned2024-04-27T16:47:19+00:00
URL https://idgcdn.com.au/robots.txt
Redirect https://www.idgcdn.com.au/robots.txt
Redirect Domain www.idgcdn.com.au
Redirect Base idgcdn.com.au
Domain IPs 104.26.12.64, 104.26.13.64, 172.67.74.9, 2606:4700:20::681a:c40, 2606:4700:20::681a:d40, 2606:4700:20::ac43:4a09
Redirect IPs 104.26.12.64, 104.26.13.64, 172.67.74.9, 2606:4700:20::681a:c40, 2606:4700:20::681a:d40, 2606:4700:20::ac43:4a09
Response IP 172.67.74.9
Found Yes
Hash 8db77fc755985348a8659be1b1334ae09676b61f42ba62724ade54a561c437c6
SimHash ea7cde55aa13

Groups

*

Rule Path
Disallow /article/preview/
Disallow /8456/
Disallow /taxonomy/term/
Disallow /user/check_email_exists/
Disallow /shop/widget/

bingbot

Rule Path
Disallow /section/digital_cameras/products/
Disallow /section/mobile_phones/products/

Other Records

Field Value
crawl-delay 2

yandex

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 4

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 4

Other Records

Field Value
sitemap https://www.pcworld.idg.com.au/sitemap-index.xml