-
Notifications
You must be signed in to change notification settings - Fork 13
/
pa-qu-xiao-zhao-xin-xi.html
164 lines (127 loc) · 6.28 KB
/
pa-qu-xiao-zhao-xin-xi.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="utf-8">
<title>lizherui的程序世界</title>
<meta name="description" content="">
<meta name="author" content="lizherui">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<!-- Le HTML5 shim, for IE6-8 support of HTML elements -->
<!--[if lt IE 9]>
<script src="./theme/html5.js"></script>
<![endif]-->
<!-- Le styles -->
<link href="./theme/bootstrap.min.css" rel="stylesheet">
<link href="./theme/bootstrap.min.responsive.css" rel="stylesheet">
<link href="./theme/local.css" rel="stylesheet">
<link href="./theme/pygments.css" rel="stylesheet">
</head>
<body>
<div class="navbar navbar-inverse">
<div class="navbar-inner">
<div class="container">
<button type="button" class="btn btn-navbar" data-toggle="collapse" data-target=".nav-collapse">
<span class="icon-bar"></span>
<span class="icon-bar"></span>
<span class="icon-bar"></span>
</button>
<a class="brand" href=".">lizherui的程序世界</a>
<div class="nav-collapse collapse">
<ul class="nav">
<li><a href="./pages/about.html">About</a></li>
</ul>
<form class="navbar-search pull-right" action="/search.html">
<input type="text" class="search-query" placeholder="Search" name="q" id="s">
</form>
</div>
</div>
</div>
</div>
<div class="container">
<div class="content">
<div class="row">
<div class="span9">
<div class='article'>
<div class="content-title">
<h1>爬取校招信息</h1>
2013-07-31
by <a class="url fn" href="./author/lizherui.html">lizherui</a>
</div>
<div><p>抓取北邮人论坛和水木社区校招信息的爬虫程序, 直接运行main.py即可,非常简洁,可以扩展。</p>
<p>爬虫根据自定义关键字先对校招信息进行过滤,然后存储到本机redis中。本机若有lamp环境,可直接从redis读取信息到web页面上即可。</p>
<p>Talk is cheap, show you the code:<a href="https://github.com/lizherui/spider_python">https://github.com/lizherui/spider_python</a>.</p>
<p>Enjoy it.</p></div>
<hr>
<h2>Comments</h2>
<div id="disqus_thread"></div>
<script type="text/javascript">
var disqus_shortname = 'lizheruisworld';
var disqus_title = '爬取校招信息';
(function() {
var dsq = document.createElement('script'); dsq.type = 'text/javascript'; dsq.async = true;
dsq.src = 'http://' + disqus_shortname + '.disqus.com/embed.js';
(document.getElementsByTagName('head')[0] || document.getElementsByTagName('body')[0]).appendChild(dsq);
})();
</script>
<noscript>Please enable JavaScript to view the <a href="http://disqus.com/?ref_noscript">comments powered by Disqus.</a></noscript>
</div>
</div>
<div class="span3">
<div class="well" style="padding: 8px 0; background-color: #FBFBFB;">
<ul class="nav nav-list">
<li class="nav-header">
Site
</li>
<li><a href="./archives.html">Archives</a>
<li><a href="./tags.html">Tags</a>
<li><a href="http://www.lizherui.com/feeds/all.rss.xml" rel="alternate">RSS</a></li>
</ul>
</div>
<div class="well" style="padding: 8px 0; background-color: #FBFBFB;">
<ul class="nav nav-list">
<li class="nav-header">
Categories
</li>
<li><a href="./category/life.html">Life</a></li>
<li><a href="./category/tech.html">Tech</a></li>
<li><a href="./category/work.html">Work</a></li>
</ul>
</div>
<div class="social">
<div class="well" style="padding: 8px 0; background-color: #FBFBFB;">
<ul class="nav nav-list">
<li class="nav-header">
Social
</li>
<li><a href="https://github.com/lizherui">Github</a></li>
<li><a href="https://twitter.com/lzrak47">Twitter</a></li>
<li><a href="https://www.facebook.com/profile.php?id=100004875786021">Facebook</a></li>
<li><a href="http://www.linkedin.com/profile/view?id=232391796">Linkedin</a></li>
<li><a href="http://weibo.com/lzrm4a1">Weibo</a></li>
<li><a href="http://www.zhihu.com/people/li-zhe-rui">Zhihu</a></li>
</ul>
</div>
</div>
<div class="well" style="padding: 8px 0; background-color: #FBFBFB;">
<ul class="nav nav-list">
<li class="nav-header">
Links
</li>
<li><a href="https://www.google.com/ncr">Google</a></li>
<li><a href="http://python.org/">Python</a></li>
<li><a href="http://docs.getpelican.com/">Pelican</a></li>
</ul>
</div>
</div>
</div> </div>
<footer>
<br />
<p><a href=".">lizherui的程序世界</a> © lizherui 2013</p>
</footer>
</div> <!-- /container -->
<script src="http://ajax.googleapis.com/ajax/libs/jquery/1.7.1/jquery.min.js"></script>
<script src="http://twitter.github.com/bootstrap/assets/js/bootstrap-collapse.js"></script>
<script>var _gaq=[['_setAccount','UA-42648273-1'],['_trackPageview']];(function(d,t){var g=d.createElement(t),s=d.getElementsByTagName(t)[0];g.src='//www.google-analytics.com/ga.js';s.parentNode.insertBefore(g,s)}(document,'script'))</script>
<a href="https://github.com/lizherui"><img style="position: absolute; top: 40px; right: 0; border: 0;" src="http://s3.amazonaws.com/github/ribbons/forkme_right_white_ffffff.png" alt="Fork me on GitHub" /></a>
</body>
</html>