ncpc.gov
robots.txt

Robots Exclusion Standard data for ncpc.gov

Resource Scan

Scan Details

Site Domain ncpc.gov
Base Domain ncpc.gov
Scan Status Ok
Last Scan2024-11-19T16:08:46+00:00
Next Scan 2024-12-19T16:08:46+00:00

Last Scan

Scanned2024-11-19T16:08:46+00:00
URL https://ncpc.gov/robots.txt
Redirect https://www.ncpc.gov//robots.txt
Redirect Domain www.ncpc.gov
Redirect Base ncpc.gov
Domain IPs 66.228.35.108
Redirect IPs 66.228.35.108
Response IP 66.228.35.108
Found Yes
Hash d8437c2464fb7ad674b22f54374353c55ba505ed5b86c8949c849ca1a9fed5ef
SimHash 5706d3fbefb9

Groups

ccbot

Rule Path
Disallow /

blp_bbot/0.1

Rule Path
Disallow /

blp_bbot

Rule Path
Disallow /

seznambot/3.2

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

dotbot/1.1

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

pingdom.com_bot_version_1.4_(http://www.pingdom.com/)

Rule Path
Disallow /

pingdom

Rule Path
Disallow /

exabot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mail.ru_bot/2.0

Rule Path
Disallow /

scooperbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

idmarch

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

*

Rule Path
Disallow /web2.0/
Disallow /json/
Disallow /fcip/
Disallow /app/
Disallow /go/
Disallow /combox/
Disallow /code/
Disallow /db/
Disallow /rsvp/
Disallow /review/archive/

Comments

  • Directories