lightpdf.cn
robots.txt

Robots Exclusion Standard data for lightpdf.cn

Resource Scan

Scan Details

Site Domain lightpdf.cn
Base Domain lightpdf.cn
Scan Status Ok
Last Scan2025-10-10T15:27:17+00:00
Next Scan 2025-10-17T15:27:17+00:00

Last Scan

Scanned2025-10-10T15:27:17+00:00
URL https://lightpdf.cn/robots.txt
Domain IPs 119.23.148.22
Response IP 119.23.148.22
Found Yes
Hash 6241f5fe3cff33a7d878dd1f63ce6fa27bae78136785ba038d4d5b13c5270a91
SimHash 4e04d9525a90

Groups

twitterbot
*

Rule Path
Disallow /sitemapGenerator/
Disallow /pad/
Disallow /buy-vip/
Disallow /wp-admin/
Disallow /wp-content/
Disallow /contact/
Disallow /author/
Disallow /wp-images/
Disallow /docs/

googlebot

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

ia_archiver-web.archive.org

Rule Path
Allow /

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://lightpdf.cn/sitemap.xml