agcamritsar.in
robots.txt

Robots Exclusion Standard data for agcamritsar.in

Resource Scan

Scan Details

Site Domain agcamritsar.in
Base Domain agcamritsar.in
Scan Status Ok
Last Scan2024-06-19T08:19:56+00:00
Next Scan 2024-07-19T08:19:56+00:00

Last Scan

Scanned2024-06-19T08:19:56+00:00
URL https://agcamritsar.in/robots.txt
Domain IPs 2a02:4780:15:1f03:97db:5130:ba31:6e64, 84.32.84.106
Response IP 77.37.66.138
Found Yes
Hash f03ad53ed9bc295e8750f4d46375efacffa1d798ef49758e6e54274ec07ae724
SimHash c4454b478013

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /data/

yandex

Rule Path
Disallow /

moget
ichiro

Rule Path
Disallow /

naverbot
yeti

Rule Path
Disallow /

baiduspider
baiduspider-video
baiduspider-image

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

googlebot

Rule Path
Allow .js
Allow .css

Other Records

Field Value
sitemap https://agcamritsar.in/sitemap.xml