-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathrobots.txt
More file actions
57 lines (42 loc) · 883 Bytes
/
robots.txt
File metadata and controls
57 lines (42 loc) · 883 Bytes
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
# robots.txt file for sandrolinux.com
# Made before the takeover of great bingus
# News Crawlers
User-agent: Baiduspider-news
Allow: /news/
Disallow: /
User-agent: Sogou News Spider
Allow: /news/
Disallow: /
User-agent: Qwant-news
Allow: /news/
Disallow: /
#Bad Bots
User-agent: Microsoft.URL.Control
Disallow: /
User-agent: ZyBORG
Disallow: /
User-agent: Download Ninja
Disallow: /
User-agent: Teleport
Disallow: /
User-agent: TeleportPro
Disallow: /
User-agent: sitecheck.internetseer.com
Disallow: /
User-agent: Offline Explorer
Disallow: /
User-agent: WebZIP
Disallow: /
User-agent: linko
Disallow: /
User-agent: HTTrack
Disallow: /
#Rules for other crawlers
User-agent: *
Allow: /news/
Allow: /news.html
Allow: /index.html
Disallow: /aboutthiswebsite.html
Disallow: /otherstuff.html
Allow: /contactme.html
Sitemap: https://www.sandrolinux.com/sitemaps/sitemap.xml