dinamalar.in
robots.txt

Robots Exclusion Standard data for dinamalar.in

Resource Scan

Scan Details

Site Domain dinamalar.in
Base Domain dinamalar.in
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-10-10T16:53:13+00:00
Next Scan 2025-01-08T16:53:13+00:00

Last Successful Scan

Scanned2024-06-13T16:26:01+00:00
URL http://www.dinamalar.in/robots.txt
Domain IPs 65.17.225.98
Response IP 65.17.225.98
Found Yes
Hash f550d2bb1f13d91a7190182ab904db58f27c015e1221e751395cc53a56859bad
SimHash 3994dc04ec71

Groups

*

Rule Path
Disallow show.asp
Disallow /dinaadmin/
Disallow /admin/
Disallow /testing/

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.dinamalar.com/sitemap.xml
sitemap https://www.dinamalar.com/sitemap_index.xml
sitemap https://www.dinamalar.com/sitemap_main.xml
sitemap https://www.dinamalar.com/sitemap_cities.xml

Comments

  • Block OpenAI
  • Block Google Bard AI
  • Block Common Crawl