workrr.in
robots.txt

Robots Exclusion Standard data for workrr.in

Resource Scan

Scan Details

Site Domain workrr.in
Base Domain workrr.in
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2025-10-11T10:50:52+00:00
Next Scan 2025-10-18T10:50:52+00:00

Last Successful Scan

Scanned2025-10-03T10:49:16+00:00
URL https://workrr.in/robots.txt
Domain IPs 2a02:4780:11:1120:0:1310:88c0:2, 89.117.157.237
Response IP 89.117.157.237
Found Yes
Hash ed66c170b8a2d660c69a3f8b03b356b955fde26c9b406ba6ff29c6bf0120274a
SimHash 460049f1e237

Groups

*

Rule Path
Allow /
Disallow /*%2C*
Disallow /feed
Disallow /*%3F*
Disallow /*%3D*
Disallow /*%3F*
Disallow /*%3D*
Disallow /ddos
Disallow /wp-admin/
Disallow /trackback/
Disallow /comments/
Disallow /resume/
Disallow /resume-skill/
Disallow /broken-page-url/

bingbot
baiduspider
baiduspider-image
baiduspider-video
baiduspider-news
baiduspider-favo
baiduspider-cpro
baiduspider-ads
msnbot
msnbot-media
adidxbot
bingpreview

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

gptbot
google-extended
meta-externalagent
amazonbot
anthropic-ai
googleother
claude-web
perplexity
cohere
cohere-ai
applebot-extended
google-cloudvertexbot

Rule Path
Disallow /

gptbot
google-extended
meta-externalagent
amazonbot
anthropic-ai
googleother
claude-web
perplexity
cohere
cohere-ai
applebot-extended
google-cloudvertexbot

Rule Path
Allow /about-us

dotbot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

bytedance

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /

img2dataset

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.workrr.in/sitemap.xml