neerajjaiswal.com
robots.txt

Robots Exclusion Standard data for neerajjaiswal.com

Resource Scan

Scan Details

Site Domain neerajjaiswal.com
Base Domain neerajjaiswal.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-11-13T15:23:39+00:00
Next Scan 2025-11-20T15:23:39+00:00

Last Successful Scan

Scanned2025-03-03T07:26:10+00:00
URL https://neerajjaiswal.com/robots.txt
Domain IPs 104.21.18.153, 172.67.182.161, 2606:4700:3033::6815:1299, 2606:4700:3037::ac43:b6a1
Response IP 172.67.182.161
Found Yes
Hash 1928ac1f343d133baadce54c60e691a2552346d6f9a6aa699b00fc2999a3db28
SimHash 21459b6955d3

Groups

*

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://neerajjaiswal.com/sitemap.xml

Comments

  • Google adsbot ignores robots.txt unless specifically named!