updates.internshala.com
robots.txt

Robots Exclusion Standard data for updates.internshala.com

Resource Scan

Scan Details

Site Domain updates.internshala.com
Base Domain internshala.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-03-25T06:21:35+00:00
Next Scan 2024-06-23T06:21:35+00:00

Last Successful Scan

Scanned2023-02-05T15:42:42+00:00
URL https://updates.internshala.com/robots.txt
Redirect https://internshala.com/robots.txt
Redirect Domain internshala.com
Redirect Base internshala.com
Domain IPs 15.206.118.4, 52.66.89.9
Redirect IPs 15.206.118.4, 52.66.89.9
Response IP 15.206.118.4
Found Yes
Hash 9fc43a078c1fe4dd871d0b48d491c9ec2adc47e6c93487e999d5d21141520b4a
SimHash 26105bf0e237

Groups

*

Rule Path
Disallow

Other Records

Field Value
crawl-delay 1

baiduspider

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

baiduspider-news

Rule Path
Disallow /

baiduspider-favo

Rule Path
Disallow /

baiduspider-cpro

Rule Path
Disallow /

baiduspider-ads

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

msnbot-media

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

adidxbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

bingpreview

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5