Abstract
In this paper, Python crawler technology is used to capture the information of China railway 12306 website, and the effective electronic ticket is generated by data flow analysis method, so as to provide the best plan for travelers. 12306 website openes the automatic reminder function through multiple channels to push subscription information to customers, which is convenient to travel migration people. However, the disadvantage is that the refresh frequency is fixed, serious fragmentation of tickets. A design system based on Python crawler technology puts forward a good solution. The system architecture mainly includes:invoking the query interface on the 12306 website; collecting and analyzing round trip number, station code table and other information; processing the data stream by means of splicing technology and breaking through the natural frequency to query the residual ticket information in real time; sending real-time information with the help of third-party information push tool Serve Chan. The experimental results show that the optimization algorithm of the system can integrate the fragmented residual tickets to the maximum extent, which is effective.
|