folkpunjab.org
robots.txt

Robots Exclusion Standard data for folkpunjab.org

Resource Scan

Scan Details

Site Domain folkpunjab.org
Base Domain folkpunjab.org
Scan Status Ok
Last Scan2024-09-26T10:37:08+00:00
Next Scan 2024-10-03T10:37:08+00:00

Last Scan

Scanned2024-09-26T10:37:08+00:00
URL https://folkpunjab.org/robots.txt
Domain IPs 104.21.94.154, 172.67.137.201, 2606:4700:3030::6815:5e9a, 2606:4700:3032::ac43:89c9
Response IP 172.67.137.201
Found Yes
Hash 6ae2d81644966ec493f27fedda7aa5beaaa931c7c2717f5bcc74c51e19a7587d
SimHash 721d19c1a0e1

Groups

*

Rule Path
Disallow /fonts/

ahrefsbot
semrushbot
grapeshot
dotbot
petalbot

Rule Path
Disallow /

amazonbot
anthropic-ai
applebot-extended
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
diffbot
facebookbot
friendlycrawler
google-extended
googleother
googleother-image
googleother-video
gptbot
imagesiftbot
img2dataset
meltwaternews
meltwaternews www.meltwater.com
meta-externalagent
oai-searchbot
omgili
omgilibot
perplexitybot
pip|bot
youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://folkpunjab.org/sitemap/pages/
sitemap https://folkpunjab.org/sitemap/poets/
sitemap https://folkpunjab.org/sitemap/poets/shahmukhi/
sitemap https://folkpunjab.org/sitemap/poets/gurmukhi/
sitemap https://folkpunjab.org/sitemap/poems/
sitemap https://folkpunjab.org/sitemap/poems/shahmukhi/
sitemap https://folkpunjab.org/sitemap/poems/gurmukhi/