MICAPS4服务端系统架构设计

引用本文

王若曈, 王建民, 黄向东, 董一峰, 龙明盛. MICAPS4服务端系统架构设计[J]. 应用气象学报, 2018, 29(1): 1-12. 复制到剪切板

Wang Ruotong, Wang Jianmin, Huang Xiangdong, Dong Yifeng, Long Mingsheng. The Architecture Design of MICAPS4 Server System[J]. Journal of Applied Meteorological Science, 2018, 29(1): 1-12 复制到剪切板

MICAPS4服务端系统架构设计

王若曈¹, 王建民², 黄向东², 董一峰², 龙明盛²

1. 国家气象中心, 北京 100081;
2. 清华大学软件学院, 北京 100081

2017-07-25 收到, 2017-12-01 收到修改稿.

资助项目: 中国气象局“2015年山洪地质灾害防治气象保障工程建设”

通讯作者: 王若曈, E-mail:wangruo02@mails.thu.edu.cn.

摘要: MICAPS4体系采用客户端/服务器的系统架构，其中服务端系统是MICAPS4的重要部分，利用分布式存储与分布式计算技术，构建可容纳10² TB量级的气象实时数据，千万数据总量，面向数百并发用户的服务器集群系统。MICAPS4服务端系统在国内率先实现全部气象实时数据由文件到数据库、从集中式系统到分布式系统的迁移，该系统自2015年起在全国推广使用。在海量气象数据和大量用户并发访问的环境下，表现出很高的稳定性和优越的读写性能，同时具有便捷的扩展性和可维护性。MICAPS4服务端系统分为分布式存储系统、分布式前处理系统、站点实况轮询系统、查询服务器系统和监控系统5个子系统，分布式存储子系统为MICAPS4客户端提供了近实时数据的高速随机与顺序读取服务，分布式前处理系统利用对等分布式架构实现了海量气象实时数据的流式计算，站点实况轮询系统实现了跨系统的实况数据异构副本的同步功能，查询服务器系统利用多线程服务器技术实现了MICAPS4客户端的实时计算请求，监控系统利用部署于每个节点的探针实现监控信息的主动上报。

关键词: MICAPS4 大数据分布式存储分布式计算实时数据

The Architecture Design of MICAPS4 Server System

Wang Ruotong¹, Wang Jianmin², Huang Xiangdong², Dong Yifeng², Long Mingsheng²

1. National Meteorological Center, Beijing 100081;
2. School of Software, Tsinghua University, Beijing 100084

Abstract: Meteorological data are typical non-structure data, which reach dozens of TBs per day. Data pre-processing, data storage and data access based on RDBMS and file system become the bottleneck of MICAPS3. To fulfill MICAPS4 users' need of fast, in-time query of meteorological real-time data, according to the multi-dimension model and the user query behavior of meteorological data, using non-relational key-value DDBMS, a high performance massive meteorological data storage system and a stable 7×24 distributed data pre-processing system is designed and established. MICAPS4 uses a client/server system architecture, and high-performance server cluster system is the critical component of MICAPS4. Using distributed key-value data model and P2P infrastructure, MICAPS4 server system distributes all real-time data which arrive at a very high speed to multiple servers through an automatic load balance algorithm, and all data are stored in memory initially and persistent to hard disk periodically, which can not only reduce the disk I/O operating times, but also guarantee the reduction of writing pressure accompanying the high load of reading pressure. To enhance the data and system reliability, distributed system architecture and multiple data replica are used, which also improves the throughput capacity of the system. According to statistic results gained from product environment, the performance of MICAPS4 server system improves 100 times more than MICAPS3. MICAPS4 server system transits all meteorological real-time data storage from file system to database, from centralized system to distributed system. The system becomes the core production system of China Meteorological Administration in 2015 and is popularized nationwide. Under the condition of massive meteorological data and concurrent access of many users, it shows high stability and excellent read-write performance, and it is also highly scalable and maintenance friendly. MICAPS4 high performance server system includes 5 sub-systems including distributed storage system, distributed pre-processing system, station data polling system, data query server and monitoring probe. The distributed storage system provides high performance data accessing services of meteorological real-time data in both random and sequence mode, the distributed pre-processing system implements the stream computing function of massive meteorological real-time data by adopting the peer to peer distributed system infrastructure, the station data polling system implements the heterogeneous station observation replica data synchronization function over different systems, the data query server implements MICAPS4 client real-time computing function by means of the multi-threading server technology, and the monitoring probe is deployed in each server node and reports host health messages periodically. The overall design of MICAPS4 server system is depicted, and the motivation, core technologies and the design of each sub-system are also introduced.

Key words: MICAPS4 big data distributed storage distributed computation real-time data

[1]	李月安, 曹莉, 高嵩, 等. MICAPS预报业务平台现状与发展. 气象, 2010, 36, (7): 50–55. DOI:10.7519/j.issn.1000-0526.2010.07.010
[2]	高嵩, 毕宝贵, 李月安, 等. MICAPS4预报业务系统建设进展与未来发展. 应用气象学报, 2017, 28, (5): 513–530. DOI:10.11898/1001-7313.20170501
[3]	Batory D S. Concepts for a Database System Compiler//Proceedings of the Seventh ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, New York, USA: ACM, 1988: 184-192.
[4]	龚健雅. 空间数据库管理系统的概念与发展趋势. 测绘科学, 2001, 26, (3): 4–9.
[5]	齐贵滨, 周尔滨, 鞠洋. 利用samba服务实现信息共享. 黑龙江气象, 2012, 28, (4): 40–41.
[6]	赵春燕, 孙英锐, 董峰, 等. 高性能气象数据存储集群及在线扩展技术应用. 计算技术与自动化, 2013, 32, (3): 117–121.
[7]	肖华东, 孙婧, 张玺, 等. MARS软件在数值预报模式产品数据管理中的应用. 应用气象学报, 2015, 26, (2): 247–256. DOI:10.11898/1001-7313.20150213
[8]	沈文海, 赵芳, 高华云, 等. 国家级气象资料存储检索系统的建立. 应用气象学报, 2004, 15, (6): 727–736.
[9]	钱建梅, 孙安来, 徐喆, 等. 风云气象卫星数据存档与服务系统. 应用气象学报, 2012, 23, (3): 369–376.
[10]	李集明, 沈文海, 王国复. 气象信息共享平台及其关键技术研究. 应用气象学报, 2006, 17, (5): 621–628. DOI:10.11898/1001-7313.20060505
[11]	Dong B, Qiu J, Zheng Q, et al. A Novel Approach to Improving the Efficiency of Storing and Accessing Small Files on Hadoop: A Case Study by PowerPoint Files//2010 IEEE International Conference on Services Computing (SCC). 2010: 65-72.
[12]	刘高军, 王帝澳. 基于Redis的海量小文件分布式存储方法研究. 计算机工程与科学, 2013, 35, (10): 58–64. DOI:10.3969/j.issn.1007-130X.2013.10.007
[13]	王若曈, 黄向东. 海量气象数据实时解析与存储系统的设计与实现. 计算机工程与科学, 2015, 37, (11): 58–64.
[14]	肖卫青, 杨润芝. Hadoop在气象数据密集型处理领域中的应用. 气象科技, 2015, 43, (5): 823–828.
[15]	陈东辉, 曾乐. 基于HBase的气象地面分钟数据分布式存储系统. 计算机应用, 2014, 34, (9): 2617–2621. DOI:10.11772/j.issn.1001-9081.2014.09.2617
[16]	李永生, 曾沁, 徐美红, 等. 基于Hadoop的数值预报产品服务平台设计与实现. 应用气象学报, 2015, 26, (1): 122–128.
[17]	Videla A, Williams J J W. RabbitMQ in action: Distributed messaging for everyone. Manning, 2012.
[18]	Hintjens P. ZeroMQ: Messaging for Many Applications. O'Reilly Media, Inc, 2013.
[19]	Kreps J, Narkhede N, Rao J. Kafka: A Distributed Messaging System for Log Processing//Proceedings of the NetDB. 2011: 1-7.
[20]	Toshniwal A, Taneja S, Shukla A, et al. Storm@twitter//Proceedings of the 2014 ACM SIGMOD International Conference on Management of data. ACM, 2014: 147-156.
[21]	Zaharia M, Chowdhury M, Das T, et al. Fast and interactive analytics over Hadoop data with Spark. USENIX Login, 2012, 37, (4): 45–51.
[22]	Carbone P, Katsifodimos A, Ewen S, et al. Apache Flink:Stream and batch processing in a single engine. Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 2015, 38, (4): 28–38.
[23]	Ranjan R. Streaming big data processing in datacenter clouds. IEEE Cloud Computing, 2014, 1, (1): 78–83. DOI:10.1109/MCC.2014.22
[24]	Zaharia M, Chowdhury M, Das T, et al. Resilient distributed datasets: A Fault-tolerant Abstraction for In-memory Cluster Computing//Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation. USENIX Association, 2012: 2.
[25]	杨润芝, 马强, 李德泉, 等. 内存转发模型在CIMISS数据收发系统中的应用. 应用气象学报, 2012, 23, (3): 377–384.
[26]	邓莉, 王国复, 孙超, 等. 基本气象资料共享系统建设. 应用气象学报, 2004, 15, (增刊Ⅰ): 33–38.
[27]	王国复, 李集明, 邓莉, 等. 中国气象科学数据共享服务网总体设计与建设. 应用气象学报, 2004, 15, (增刊Ⅰ): 10–16.


图1 CIMISS-MICAPS4服务端系统架构 Fig.1 CIMISS-MICAPS4 server system architecture


图2 数据前处理系统 Fig.2 Data pre-processing system


图3 站点数据轮询系统 Fig.3 Station data polling system


图4 查询服务器的数据检索和实时计算 Fig.4 Data retrieval and real-time computing of data query server


图5 查询服务器的数据写入 Fig.5 Data writing of data query server


图6 监控探针与系统集成 Fig.6 Monitor agent and system integration