manoramayearbook.in
robots.txt

Robots Exclusion Standard data for manoramayearbook.in

Resource Scan

Scan Details

Site Domain manoramayearbook.in
Base Domain manoramayearbook.in
Scan Status Ok
Last Scan2024-10-13T08:09:10+00:00
Next Scan 2024-11-12T08:09:10+00:00

Last Scan

Scanned2024-10-13T08:09:10+00:00
URL https://www.manoramayearbook.in/robots.txt
Domain IPs 184.51.96.46, 2600:1413:b000:382::4a9, 2600:1413:b000:389::4a9
Response IP 23.54.56.229
Found Yes
Hash f79d1aa79ee9e02568ecad0482fa6379c82293b8a50e340f723a2351d0811099
SimHash 5934c8c4f773

Groups

*

Rule Path
Disallow /international-relations/*
Disallow /polity-and-constitution/*
Disallow /schemes/*
Disallow /awards/*
Disallow /getmentored/*
Disallow /getmentored/*
Disallow /evaluate/*
Disallow /referback/*
Disallow /notification.html*
Disallow /notification.html*

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /