file.info
robots.txt

Robots Exclusion Standard data for file.info

Resource Scan

Scan Details

Site Domain file.info
Base Domain file.info
Scan Status Ok
Last Scan2024-06-11T16:42:54+00:00
Next Scan 2024-06-18T16:42:54+00:00

Last Scan

Scanned2024-06-11T16:42:54+00:00
URL https://file.info/robots.txt
Domain IPs 104.26.8.194, 104.26.9.194, 172.67.72.72, 2606:4700:20::681a:8c2, 2606:4700:20::681a:9c2, 2606:4700:20::ac43:4848
Response IP 104.26.9.194
Found Yes
Hash 228e00a57f322e49e68f2c008a86fa1a06ad473cb7a7138524b5348feb237ed8
SimHash 522d5840ee9a

Groups

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

mediatoolkit

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

linkdexbot

Rule Path
Disallow /

grapeshotcrawler

Rule Path
Disallow /

proximic

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://file.info/sitemap.gz

Warnings

  • 4 invalid lines.
  • `user-agent` is not a known field.