pixai.art
robots.txt

Robots Exclusion Standard data for pixai.art

Resource Scan

Scan Details

Site Domain pixai.art
Base Domain pixai.art
Scan Status Ok
Last Scan2025-09-14T18:38:44+00:00
Next Scan 2025-09-21T18:38:44+00:00

Last Scan

Scanned2025-09-14T18:38:44+00:00
URL https://pixai.art/robots.txt
Domain IPs 104.20.25.191, 172.66.155.89, 2606:4700:10::6814:19bf, 2606:4700:10::ac42:9b59
Response IP 104.20.25.191
Found Yes
Hash 4587ae8f7c763ac1ef73fe21288bfa80361cecd9ce21c79a443c1bc4a7a45c20
SimHash f31c9940c6b4

Groups

*

Rule Path
Disallow
Disallow *utm_source%3D
Disallow /generator/image?task=*
Disallow /generator/image?initialValues=*
Disallow /profile/*
Disallow /submit/upload
Disallow /market/submit
Disallow /*/followers
Disallow /*/following

adsbot-google
adsidxbot
google-inspectiontool
facebookexternalhit
perplexitybot
perplexity-user
canva-slackapp-linkexpanding

Rule Path
Allow /

applebot-extended
gptbot
google-extended
bytespider
claudebot
ccbot
meta-externalagent
imagesiftbot
timpibot
scrapy
ai2bot
diffbot
img2dataset
icc-crawler
facebookbot
anthropic-ai
velenpublicwebcrawler
friendlycrawler
claude-web
omgili
webzio-extended
ai2bot-dolma

Rule Path
Disallow /

twitterbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://cdn.pixai.art/seo/sitemap_index.xml

Comments

  • DaumWebMasterTool:147d6ebbf8344d67bc1a67ae7024c9ed71f4663bee9df65f26a850da1bfcbe23:SQz8q8gaEwm8DvTv0cIRow==