Skip to content

Commit 677a2a1

Browse files
committed
更新爬虫过滤实例
1 parent bfd6f83 commit 677a2a1

File tree

1 file changed

+17
-0
lines changed

1 file changed

+17
-0
lines changed

README.md

Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -64,6 +64,7 @@ Nginx版本:`1.11.5`
6464
- [强制将http重定向到https](#强制将http重定向到https)
6565
- [两个虚拟主机](#两个虚拟主机)
6666
- [虚拟主机标准配置](#虚拟主机标准配置)
67+
- [爬虫过滤](#爬虫过滤)
6768
- [防盗链](#防盗链)
6869
- [虚拟目录配置](#虚拟目录配置)
6970
- [防盗图配置](#防盗图配置)
@@ -1233,6 +1234,22 @@ http {
12331234
}
12341235
```
12351236

1237+
### 爬虫过滤
1238+
1239+
根据 `User-Agent` 过滤请求,通过一个简单的正则表达式,就可以过滤不符合要求的爬虫请求(初级爬虫)。
1240+
1241+
> `~*` 表示不区分大小写的正则匹配
1242+
1243+
```nginx
1244+
location / {
1245+
if ($http_user_agent ~* "python|curl|java|wget|httpclient|okhttp") {
1246+
return 503;
1247+
}
1248+
# 正常处理
1249+
# ...
1250+
}
1251+
```
1252+
12361253
### 防盗链
12371254

12381255
```nginx

0 commit comments

Comments
 (0)