{"id":21793,"date":"2018-09-11T11:18:08","date_gmt":"2018-09-11T15:18:08","guid":{"rendered":"https:\/\/www.crim.ca\/blogue\/ce-que-jai-lu-cette-semaine-les-donnees-spatio-temporelles\/"},"modified":"2023-05-25T12:19:25","modified_gmt":"2023-05-25T16:19:25","slug":"what-i-read-this-week-spatiotemporal-data","status":"publish","type":"blogue","link":"https:\/\/www.crim.ca\/en\/blogue\/what-i-read-this-week-spatiotemporal-data\/","title":{"rendered":"What I read this week: Spatiotemporal data"},"content":{"rendered":"<p id=\"8b6f\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">This week, I took an interest in <strong>spatiotemporal data<\/strong>. There are several reasons why I have been looking into this subject. First, the City of Montreal makes available open data related to different spheres of activity: economy and business, education, health, society and culture. The sector that is of particular interest to me for Montreal is <strong>transportation<\/strong>.<\/p>\n<p id=\"58ac\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><a class=\"au ko\" href=\"https:\/\/www.crim.ca\/en\/\" target=\"_blank\" rel=\"noopener ugc nofollow\">CRIM<\/a> has worked with spatiotemporal data to predict the response time of firefighters at the <a class=\"au ko\" href=\"http:\/\/ville.montreal.qc.ca\/sim\/\" target=\"_blank\" rel=\"noopener ugc nofollow\">Service de s\u00e9curit\u00e9 incendie de Montr\u00e9al<\/a>. The article on this can be read on <a class=\"au ko\" href=\"https:\/\/medium.com\/crim\/predicting-the-response-times-of-firefighters-using-data-science-da79f6965f93\" rel=\"noopener\">Medium<\/a>. Given the experience gained during this project, it seemed logical to continue studying the subject. This new knowledge could eventually apply to resolving other <strong>mobility-related<\/strong> problems, such as <strong>travel in Montreal<\/strong>, thanks to the data collected by the <a class=\"au ko\" href=\"https:\/\/ville.montreal.qc.ca\/mtltrajet\/\" target=\"_blank\" rel=\"noopener ugc nofollow\">MTL Trajet<\/a> application or the cameras and detectors scattered around the Island of Montreal.<\/p>\n<p id=\"a699\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">Spatiotemporal data also offers interesting potential from a local perspective for <strong>flood prevention in Quebec<\/strong>. In 2018, public security officials at the <em>Minist\u00e8re de la S\u00e9curit\u00e9 publique<\/em> unveiled a civil security plan of action in cases of flooding (the Quebec government release, in French, is found <a class=\"au ko\" href=\"https:\/\/www.securitepublique.gouv.qc.ca\/fileadmin\/Documents\/securite_civile\/inondation\/Plan_action_inondations.pdf\" target=\"_blank\" rel=\"noopener ugc nofollow\">here<\/a>) as a follow-up to the major spring floods of the previous year. Over five years, an envelope of more than $30M was provided to update the mapping of flood-prone areas on Quebec territory. In <strong>climatology<\/strong>, spatiotemporal data have become such an important tool that many data science projects are being developed to organize, analyze and interpret them.<\/p>\n<h2 id=\"925a\" class=\"ln lo jd bn lp lq lr ls lt lu lv lw lx ly lz ma mb mc md me mf mg mh mi mj mk gi\">Spatiotemporal data<\/h2>\n<p id=\"d71a\" class=\"pw-post-body-paragraph kp kq jd kr b ks ml ku kv kw mm ky kz la mn lc ld le mo lg lh li mp lk ll lm iw gi\" data-selectable-paragraph=\"\">Spatiotemporal data are collected in several fields: in climate science, to predict the onset of extreme weather events; in Earth science, to detect anomalous areas within marine environments; or in epidemiology, to study the spread of diseases. Other fields, such as neuroscience, environmental science, social media and traffic dynamics, also use such information in various ways.<\/p>\n<blockquote class=\"mq mr ms\">\n<p id=\"6ea5\" class=\"kp kq mt kr b ks kt ku kv kw kx ky kz mu lb lc ld mv lf lg lh mw lj lk ll lm iw gi\" data-selectable-paragraph=\"\">But what is special about spatiotemporal data? They are characterized by <strong>spatial<\/strong> (distance, direction, position) and <strong>temporal<\/strong> (number of occurrences, changes in time, duration) attributes. In other words, they are data, or measurements, that change in time and space.<\/p>\n<\/blockquote>\n<p id=\"bf94\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">For example, for predicting meteorological events, the spatial representation is defined by a grid of the land area under study, and various parameters and atmospheric measurements characterize it. The temporal representation is then defined by the evolution of these parameters over time, which could allow, for instance, to design a prediction model of the evolution of hurricanes.<\/p>\n<p id=\"e3c6\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">There is ample literature on the subject: the diversity of models, approaches and applications is practically countless. To begin with, I have limited myself to four articles of interest to give me an idea of how to manage and use spatiotemporal data.<\/p>\n<p id=\"bf42\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">The first paper presents an approach using a 3D convolutional neural network to learn spatiotemporal features. The second paper presents data mining applications using association rules in the context of hurricane analysis. The third deals with a learning model of hierarchical structures to predict the onset of extreme weather events. The fourth defines a spatiotemporal data mining model analyzing anomalous association structures in a marine environment. The following sections will summarize the essential and relevant ideas and concepts from each paper.<\/p>\n<div class=\"o dz mx my ii mz\" role=\"separator\"><\/div>\n<div class=\"iw ix iy iz ja\">\n<h2 id=\"7f37\" class=\"ln lo jd bn lp lq ne ls lt lu nf lw lx ly ng ma mb mc nh me mf mg ni mi mj mk gi\">Learning Spatiotemporal Features with 3D Convolutional Networks<\/h2>\n<blockquote class=\"mq mr ms\">\n<p id=\"a3aa\" class=\"kp kq mt kr b ks kt ku kv kw kx ky kz mu lb lc ld mv lf lg lh mw lj lk ll lm iw gi\" data-selectable-paragraph=\"\">Tran, D., Bourdev, L., Fergus, R., Torresani, L., &amp; Paluri, M. (2015, December). Learning spatiotemporal features with 3d convolutional networks. In Computer Vision (ICCV), 2015 IEEE International Conference on (pp. 4489\u20134497). IEEE.<\/p>\n<\/blockquote>\n<h2 id=\"fc1a\" class=\"nj lo jd bn lp nk nl nm lt nn no np lx la nq nr mb le ns nt mf li nu nv mj nw gi\" data-selectable-paragraph=\"\">Objective<\/h2>\n<p id=\"066c\" class=\"pw-post-body-paragraph kp kq jd kr b ks ml ku kv kw mm ky kz la mn lc ld le mo lg lh li mp lk ll lm iw gi\" data-selectable-paragraph=\"\">Design of an approach for learning spatiotemporal features using a <strong>deep 3D convolutional neural network<\/strong> (3D ConvNets) trained on a large-scale supervised video dataset. The goal is to recognize different types of actions (101 actions) and objects (42 types) and to classify pairs of similar actions (432 actions) or individual scenes (14 scenes)<\/p>\n<h2 id=\"a807\" class=\"nj lo jd bn lp nk nl nm lt nn no np lx la nq nr mb le ns nt mf li nu nv mj nw gi\" data-selectable-paragraph=\"\">Summary<\/h2>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"kf kg dq kh cf ki\" tabindex=\"0\" role=\"button\">\n<div class=\"gt gu nx\"><img fetchpriority=\"high\" decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/1400\/1*UuLS12dcv8CIfEMgSNw0Rw.png\" alt=\"\" width=\"700\" height=\"139\" \/><\/div>\n<\/div>\n<\/figure>\n<p id=\"f5b9\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">The idea behind 3D convolution operations is to preserve the temporal information of an input signal. Indeed, 2D convolutions applied on one or multiple images generate an image, thus a two-dimensional output. Only 3D convolutions preserve temporal information and generate a volumetric output. The same phenomenon is observed for pooling stages in 2D and 3D, respectively. Thus, using 3D convolutions to encapsulate the information related to objects, scenes and actions.<\/p>\n<h2 id=\"1dbc\" class=\"nj lo jd bn lp nk nl nm lt nn no np lx la nq nr mb le ns nt mf li nu nv mj nw gi\" data-selectable-paragraph=\"\">Method<\/h2>\n<p id=\"c854\" class=\"pw-post-body-paragraph kp kq jd kr b ks ml ku kv kw mm ky kz la mn lc ld le mo lg lh li mp lk ll lm iw gi\" data-selectable-paragraph=\"\"><strong>Architecture<\/strong><\/p>\n<p id=\"2b4e\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">The basic architecture of the convolution kernel is a fixed receptive field of dimension {d x 3 x 3} where only the time dimension is modified for experimentation purposes. This limitation is due to the considerable training time that would result from numerous experiments. The notation for the dimensions of the video clips and kernels is as follows:<\/p>\n<p id=\"a5b0\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><em>Video clip\u00a0(c\u00a0x\u00a0l\u00a0x\u00a0h\u00a0x\u00a0w)<\/em><\/p>\n<ul class=\"\">\n<li id=\"d3ce\" class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\"><em>c<\/em> is the number of channels (different perspectives of an image, number of matrices defining it),<em> l<\/em> is the number of images, <em>h<\/em> is the height,<em> w<\/em> is the width<\/li>\n<\/ul>\n<p id=\"ed9f\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><em>Kernel\u00a0(d\u00a0x\u00a0k\u00a0x\u00a0k)<\/em><\/p>\n<ul class=\"\">\n<li id=\"cd0a\" class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\"><em>d<\/em>\u00a0is the temporal dimension,\u00a0<em>k\u00a0<\/em>is the spatial dimension (identical height and width)<\/li>\n<\/ul>\n<p id=\"31fd\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><strong>Basic parameters<\/strong><\/p>\n<p id=\"d4e2\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">Video clips are taken as input to subsequently establish their recognition or classification. The videos are 128 x 171 and are separated into clips of 16 images, with each frame divided into three channels (RGB). Several parameters can be modified to obtain the best possible results: number of convolution layers, pooling layers, fully connected layers, loss layers, filters per layer, and filling, stride, pooling size, batch size, learning rate and number of iterations.<\/p>\n<p id=\"e980\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><strong>Variations in parameters<\/strong><\/p>\n<p id=\"fb5d\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">Several parameter configurations are tested to optimize the encapsulation of the temporal information. In order to save time, only the temporal depth d of the kernel is modified. Two approaches are used:<\/p>\n<p id=\"06e6\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><em>Homogeneous temporal depth<\/em><\/p>\n<ul class=\"\">\n<li id=\"a173\" class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\">All convolution layers use kernels of identical temporal dimensions. Four configurations are tested, with depths of {1, 3, 5, 7}. For example, 1-1-1-1-1 for a five-layer convolutional architecture, where each kernel at each layer has a depth of 1.<\/li>\n<\/ul>\n<p id=\"8afb\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><em>Variable temporal depth<\/em><\/p>\n<ul class=\"\">\n<li id=\"328a\" class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\">The temporal depth of the kernel varies according to the convolution layer. Two configurations are tested, increasing and decreasing, respectively, with the following form: 3-3-5-5-7 and 7-5-5-3-3, for five-convolutional-layered architectures.<\/li>\n<\/ul>\n<p id=\"e1eb\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">For homogeneous architectures, ones with a depth of three (3-3-3-3-3) provide the best results, and these perform better than the two heterogeneous architectures. It is also shown that increasing the spatial field does not improve the results. Therefore, the 3 x 3 x 3 architecture is retained when dimensioning the kernels.<\/p>\n<p id=\"5c3c\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><strong>Final architecture<\/strong><\/p>\n<p id=\"0e1f\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">Considering the previous experiments and given the computational and memory limits of the available hardware, the 3D convolutional neural network is presented as follows:<\/p>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"kf kg dq kh cf ki\" tabindex=\"0\" role=\"button\">\n<div class=\"gt gu ol\"><img decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/1400\/1*f4fJp5jv_o1_RkumEkUjyg.png\" alt=\"\" width=\"700\" height=\"116\" \/><\/div>\n<\/div>\n<\/figure>\n<p id=\"d2ca\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">The 3D ConvNets contains eight convolution layers, each with, respectively {64, 128, 256, 256, 512, 512, 512} filters. These convolution layers allow images to be transformed and learn different features. Each kernel is of dimension {3 x 3 x 3} and has a stride of {1 x 1 x 1}.<\/p>\n<p id=\"c462\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">The network contains five layers of max-pooling, an operation where only the maximum value of a region bounded by a kernel is retained to reduce the dimensionality of the input matrix. The kernels for the layers {pool2, pool3, pool4, pool5} are of dimension {2 x 2 x 2} and {1 x 2 x 2} for the pool1 layer. Each has a stride of {1 x 2 x 2}.<\/p>\n<p id=\"0e88\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">Two fully connected layers with 4096 output units follow convolution and pooling operations. They are necessary to classify the images from the high-level features provided by the convolution steps.<\/p>\n<h2 id=\"7218\" class=\"nj lo jd bn lp nk nl nm lt nn no np lx la nq nr mb le ns nt mf li nu nv mj nw gi\" data-selectable-paragraph=\"\">Results<\/h2>\n<p id=\"5f21\" class=\"pw-post-body-paragraph kp kq jd kr b ks ml ku kv kw mm ky kz la mn lc ld le mo lg lh li mp lk ll lm iw gi\" data-selectable-paragraph=\"\">The 3D ConvNets is tested on human action classification, action pair similarity, and scene and object recognition.<\/p>\n<ul class=\"\">\n<li id=\"fbb3\" class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\">The 3D ConvNets is better suited for spatiotemporal data than a 2D ConvNet, and is better at detecting information related to appearance and motion.<\/li>\n<li id=\"f0d4\" class=\"oc od jd kr b ks om kw on la oo le op li oq lm oh oi oj ok gi\" data-selectable-paragraph=\"\">A homogeneous {3 x 3 x 3} architecture of convolution kernels for each layer delivers the best performance.<\/li>\n<li id=\"fd47\" class=\"oc od jd kr b ks om kw on la oo le op li oq lm oh oi oj ok gi\" data-selectable-paragraph=\"\">The features learned by the network, with a simple linear classifier, outperform or are comparable to those learned by current methods when the network is tested in four different application scenarios.<\/li>\n<li data-selectable-paragraph=\"\">The network is very user-friendly<\/li>\n<li id=\"1866\" class=\"oc od jd kr b ks om kw on la oo le op li oq lm oh oi oj ok gi\" data-selectable-paragraph=\"\">The 3D ConvNet initially focuses on appearance in the first few images, then tracks motion in the others.<\/li>\n<\/ul>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"kf kg dq kh cf ki\" tabindex=\"0\" role=\"button\">\n<div class=\"gt gu or\"><img decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/1400\/1*0tYpznD2hqLvhG-6yS_Z1A.png\" alt=\"\" width=\"700\" height=\"204\" \/><\/div>\n<\/div>\n<\/figure>\n<ul class=\"\">\n<li id=\"ae2b\" class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\">The features learned by the 3D ConvNets are compact and descriptive, as shown by the results following a dimensionality reduction using a Principal Component Analysis (PCA).<\/li>\n<li id=\"149b\" class=\"oc od jd kr b ks om kw on la oo le op li oq lm oh oi oj ok gi\" data-selectable-paragraph=\"\">The features show a good capacity for generalization.<\/li>\n<\/ul>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"gt gu os\"><img loading=\"lazy\" decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/890\/1*LxaXmcvXpT20wu7Ogn8nRQ.png\" alt=\"\" width=\"445\" height=\"349\" \/><\/div>\n<\/figure>\n<\/div>\n<div class=\"o dz mx my ii mz\" role=\"separator\"><\/div>\n<div class=\"iw ix iy iz ja\">\n<h2 id=\"7dbd\" class=\"ln lo jd bn lp lq ne ls lt lu nf lw lx ly ng ma mb mc nh me mf mg ni mi mj mk gi\">Association Rule Data Mining Applications for Atlantic Tropical Cyclone Intensity Changes<\/h2>\n<blockquote class=\"mq mr ms\">\n<p id=\"e9c1\" class=\"kp kq mt kr b ks kt ku kv kw kx ky kz mu lb lc ld mv lf lg lh mw lj lk ll lm iw gi\" data-selectable-paragraph=\"\">Yang, R., Tang, J., &amp; Sun, D. (2011). Association rule data mining applications for Atlantic tropical cyclone intensity changes.\u00a0Weather and Forecasting,\u00a026(3), 337\u2013353.<\/p>\n<\/blockquote>\n<h2 id=\"1068\" class=\"nj lo jd bn lp nk nl nm lt nn no np lx la nq nr mb le ns nt mf li nu nv mj nw gi\" data-selectable-paragraph=\"\">Objective<\/h2>\n<p id=\"419c\" class=\"pw-post-body-paragraph kp kq jd kr b ks ml ku kv kw mm ky kz la mn lc ld le mo lg lh li mp lk ll lm iw gi\" data-selectable-paragraph=\"\">Application of a data mining technique, Association Rule Data Mining, to analyze tropical cyclone (TC) intensity changes. The paper provides a user guide for this mining technique and a method for overcoming the low number of occurrences of some extracted weather conditions to improve the predictive capacity of the intensity of tropical storms.<\/p>\n<h2 id=\"fe6e\" class=\"nj lo jd bn lp nk nl nm lt nn no np lx la nq nr mb le ns nt mf li nu nv mj nw gi\" data-selectable-paragraph=\"\">Overview<\/h2>\n<p id=\"1fc9\" class=\"pw-post-body-paragraph kp kq jd kr b ks ml ku kv kw mm ky kz la mn lc ld le mo lg lh li mp lk ll lm iw gi\" data-selectable-paragraph=\"\">The association rule mining technique provides a detailed picture of the dataset and allows the detection of relationships among multiple conditions that may be missed in a theoretical analysis. The study aims to apply this data mining technique automatically and without supervision. This mining provides &#8220;multiple to one&#8221; associations from various geophysical features describing intensifying, weakening or stable cyclones. The data mining results can shed light on the underlying physical mechanisms that influence changes in the intensity of tropical storms.<\/p>\n<h2 id=\"f911\" class=\"nj lo jd bn lp nk nl nm lt nn no np lx la nq nr mb le ns nt mf li nu nv mj nw gi\" data-selectable-paragraph=\"\">Data and methodology<\/h2>\n<p id=\"4cf9\" class=\"pw-post-body-paragraph kp kq jd kr b ks ml ku kv kw mm ky kz la mn lc ld le mo lg lh li mp lk ll lm iw gi\" data-selectable-paragraph=\"\"><strong>Datasets<\/strong><\/p>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"kf kg dq kh cf ki\" tabindex=\"0\" role=\"button\">\n<div class=\"gt gu ot\"><img loading=\"lazy\" decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/1400\/1*PfB2sqyxju2iflSwFgBO-A.png\" alt=\"\" width=\"700\" height=\"470\" \/><\/div>\n<\/div>\n<\/figure>\n<p id=\"eb72\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">The HURDAT (NHC&#8217;s North Atlantic Hurricane Database) dataset was used to classify cyclone intensity and position. The SHIPS 2003 dataset obtained various parameters concerning tropical storms (21 features). The association rule implemented by <a class=\"au ko\" href=\"http:\/\/www.borgelt.net\/apriori.html\" target=\"_blank\" rel=\"noopener ugc nofollow\">Borgelt<\/a> was subsequently applied to the dataset.<\/p>\n<p id=\"ca16\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><strong>Definitions<\/strong><\/p>\n<p id=\"71b3\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">An <strong>association rule<\/strong> takes on the following form: Z \u2190 X, Y. In a business context, an example of a rule would be to say that a customer buying items X and Y is also likely to buy item Z. Items X and Y are called <strong>antecedents<\/strong> and Z is called the <strong>consequent<\/strong>.<\/p>\n<p id=\"5ea6\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">In the context of the paper, the antecedents are the geophysical conditions of the CTs represented by an interval of values, and the consequent is a category of change in intensity (intensifying, weakening, etc.)<\/p>\n<p id=\"3d2c\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">Three parameters are typically used for association rule exploration. <strong>Support<\/strong> estimates the probability P( { X, Y, Z } ), which is, for example, the frequency with which particular conditions for tropical storms (high\/low wind speed, high\/low pressure, etc.) occur in the dataset. <strong>Confidence<\/strong> estimates the probability P( Z | { X, Y } ), which is the frequency with which the particular tropical storm conditions caused a change in intensity Z. An association rule is strong if it has high support and confidence<\/p>\n<p id=\"1341\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">The <strong>lift<\/strong> estimates the probability P( { X, Y, Z } ) \/ [ P( { X, Y } ) x P( Z ) ], that is, the ratio of the support computed above to that which would be expected if the conditions X and Y were independent. In other words, the lift gives the ratio of the actual probability that a set of items contains the antecedent and the consequent, divided by the product of the individual probabilities of the antecedent and the consequent. This is the ratio of confidence to expected confidence.<\/p>\n<p id=\"ea96\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">A lift of 1 would imply that the probability of occurrence of the antecedents and the consequent is independent. A lift greater than 1 would imply that the probability of occurrence of the antecedents and their consequent is dependent.<\/p>\n<p id=\"a75f\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">An example of an association rule will be explained hereafter to facilitate comprehension of the concept.<\/p>\n<h2 id=\"757d\" class=\"nj lo jd bn lp nk nl nm lt nn no np lx la nq nr mb le ns nt mf li nu nv mj nw gi\" data-selectable-paragraph=\"\">Stratification of cyclones and data pretreatment<\/h2>\n<p id=\"80f5\" class=\"pw-post-body-paragraph kp kq jd kr b ks ml ku kv kw mm ky kz la mn lc ld le mo lg lh li mp lk ll lm iw gi\" data-selectable-paragraph=\"\"><strong>Stratification<\/strong><\/p>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"gt gu ou\"><img loading=\"lazy\" decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/924\/1*8gMzPF40MquvRxeJmAFjMg.png\" alt=\"\" width=\"462\" height=\"354\" \/><\/div>\n<\/figure>\n<p id=\"ee17\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">Since changes in the intensity of tropical cyclones depend on the initial intensity of these cyclones, the data set is divided into different groups. However, before <strong>stratifying<\/strong> the dataset, it is necessary to remove cyclones that do not have a complete set of parameters. The cyclones can then be grouped according to their initial intensity. These groups include:<\/p>\n<ul class=\"\">\n<li id=\"67ba\" class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\">Tropical Depressions (TD)<\/li>\n<li class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\">Tropical Storms (TS)<\/li>\n<li class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\">Hurricanes (H1, H2, H3, H4, H5)<\/li>\n<\/ul>\n<p id=\"6104\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">Two more categories are added, as class 5 hurricanes are infrequent. It is difficult to define rules with such a small sample, so a group containing hurricanes of class 1-2 and another containing classes 3-5 are introduced.<\/p>\n<ul class=\"\">\n<li id=\"3816\" class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\">Minor hurricane group (HR)<\/li>\n<li class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\">Major hurricane group (MH)<\/li>\n<\/ul>\n<p id=\"ca66\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">To explore the combinations of factors influencing changes in intensity, cyclones are separated according to whether they intensify, weaken or remain stable (see Table 2).<\/p>\n<p id=\"23c6\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><strong>Pretreatment<\/strong><\/p>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"kf kg dq kh cf ki\" tabindex=\"0\" role=\"button\">\n<div class=\"gt gu ov\"><img loading=\"lazy\" decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/1400\/1*75RdkfLBULIle-UPpqzKCA.png\" alt=\"\" width=\"700\" height=\"372\" \/><\/div>\n<\/div>\n<\/figure>\n<p id=\"f68d\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">The association rule mining algorithm is initially designed to handle Boolean attribute datasets. In this case, the attributes of cyclones are numerical and continuous, so it is essential to transform them into <strong>binary conditions<\/strong>. Therefore, the spectrum of values is divided into two groups: low values and high values, according to a predefined threshold.<\/p>\n<p id=\"050d\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">The threshold for each parameter is derived by taking the average of the respective averages of intensifying and weakening cyclones for each cyclone category. The relationship <strong>I &gt; W<\/strong> (in bold) indicates that the average of the intensifying cases is considerably larger than that of the weakening cases and vice versa. This expression already gives an idea of the role of each parameter in the intensification or weakening of cyclones.<\/p>\n<h2 id=\"662c\" class=\"nj lo jd bn lp nk nl nm lt nn no np lx la nq nr mb le ns nt mf li nu nv mj nw gi\" data-selectable-paragraph=\"\">Association Rules<\/h2>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"kf kg dq kh cf ki\" tabindex=\"0\" role=\"button\">\n<div class=\"gt gu ow\"><img loading=\"lazy\" decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/1400\/1*zSQDRYk5Ay673a9jvcl5zQ.png\" alt=\"\" width=\"700\" height=\"201\" \/><\/div>\n<\/div>\n<\/figure>\n<p id=\"1b7c\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">The rule mining association algorithm implemented by <a class=\"au ko\" href=\"http:\/\/www.borgelt.net\/apriori.html\" target=\"_blank\" rel=\"noopener ugc nofollow\">Borgelt<\/a> was applied with the transformed binary parameters as priors to find combinations related to intensifying, weakening and stable cyclones. The abbreviations H and L represent high and low parameter values. For example, U200 = L in the TS case signifies that the 200-hPa zonal wind is less than or equal to 11.1 kt.<\/p>\n<p id=\"8970\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">A priori, the parameters for controlling the weight of the rules are set to 33% for support and confidence and 100% for lift. Here is how to interpret an association rule.<\/p>\n<blockquote class=\"mq mr ms\">\n<p id=\"227b\" class=\"kp kq mt kr b ks kt ku kv kw kx ky kz mu lb lc ld mv lf lg lh mw lj lk ll lm iw gi\" data-selectable-paragraph=\"\">A rule typically takes the following form:<\/p>\n<p id=\"3d6c\" class=\"kp kq mt kr b ks kt ku kv kw kx ky kz mu lb lc ld mv lf lg lh mw lj lk ll lm iw gi\" data-selectable-paragraph=\"\">INTENS \u2190 U200 = L, SHRD = L, SRLA = L (38,7, 75,4, 138,9)<\/p>\n<p id=\"a434\" class=\"kp kq mt kr b ks kt ku kv kw kx ky kz mu lb lc ld mv lf lg lh mw lj lk ll lm iw gi\" data-selectable-paragraph=\"\">When all three conditions (U200 = L, SHRD = L, SRLA = L) are satisfied, a storm will intensify with 38.7% support, 57.4% confidence, and 138.6% lift.<\/p>\n<p id=\"6021\" class=\"kp kq mt kr b ks kt ku kv kw kx ky kz mu lb lc ld mv lf lg lh mw lj lk ll lm iw gi\" data-selectable-paragraph=\"\">In other words, for cyclones in the Tropical Storms (TS) category, 38.7% of the cases satisfy the conditions (U200 = L, SHRD = L, SRLA = L). Of these cyclones, 57.4% intensified, compared to the average sample of 41.4% intensifying. The lift represents the ratio of the confidence to the confidence of the average intensifying sample (57.4\/41.4 = 138.6%) (57.4\/41.4 = 138.6%)<\/p>\n<\/blockquote>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"gt gu ox\"><img loading=\"lazy\" decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/938\/1*y1_5OwKWm2hwm1drHH03pQ.png\" alt=\"\" width=\"469\" height=\"431\" \/><\/div>\n<\/figure>\n<p id=\"e818\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">However, once the algorithm is applied, it is necessary to remove the <strong>redundant rules<\/strong> to consider only the <strong>concise rules<\/strong>. A rule is concise if it cannot be derived by a subset of priors from another rule with higher confidence. In other words, <strong>a concise rule contains the smallest number of priors<\/strong>. Any rule that contains the same antecedents and additional antecedents, but does not have a higher confidence than a concise rule, is redundant.<\/p>\n<p id=\"1b28\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">For example, rule 4 (see Table 5) is redundant because it has the same confidence as rules 1 and 2. Rule 11 is redundant because it contains a combination of conditions appearing in rule 8 and has a lower confidence than rule 8.<\/p>\n<h2 id=\"6fc7\" class=\"nj lo jd bn lp nk nl nm lt nn no np lx la nq nr mb le ns nt mf li nu nv mj nw gi\" data-selectable-paragraph=\"\">Intramural Binding<\/h2>\n<p id=\"d1ba\" class=\"pw-post-body-paragraph kp kq jd kr b ks ml ku kv kw mm ky kz la mn lc ld le mo lg lh li mp lk ll lm iw gi\" data-selectable-paragraph=\"\">Some parameters cover the same physical process: for example, IV12 and VV quantify a cyclone&#8217;s past intensity change. Others are all associated with vertical shear: U200, SHRD, SRV0 and SRLA. When the values of one parameter can be used to predict the values of another, they are said to be <strong>intramurally bound<\/strong>.<\/p>\n<p id=\"494a\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">Association hyper-edges were generated using the same <a class=\"au ko\" href=\"https:\/\/pdfs.semanticscholar.org\/f141\/ed5068df835e2090358e8e2d3250149d2444.pdf\" target=\"_blank\" rel=\"noopener ugc nofollow\">association rule mining algorithm<\/a>\u00a0to reveal these links. The results showed a perfect link between IV12 and VV for all categories except the MH group and a strong link between U200, SHRD, SRV0 and SRLA. This is another interesting result of this data mining technique.<\/p>\n<h2 id=\"3750\" class=\"nj lo jd bn lp nk nl nm lt nn no np lx la nq nr mb le ns nt mf li nu nv mj nw gi\" data-selectable-paragraph=\"\">Results<\/h2>\n<p id=\"928c\" class=\"pw-post-body-paragraph kp kq jd kr b ks ml ku kv kw mm ky kz la mn lc ld le mo lg lh li mp lk ll lm iw gi\" data-selectable-paragraph=\"\">This study showed that the rule mining association can be used successfully in geoscience.<\/p>\n<ul class=\"\">\n<li id=\"765f\" class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\">Ease of interpretation of results.<\/li>\n<li class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\">Combinations of parameters were found, allowing to group conditions favouring the intensification or weakening of cyclones.<\/li>\n<li class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\">A faster northward movement of cyclones favours the intensification of tropical storms but not hurricanes.<\/li>\n<li class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\">Intensifying tropical storms are more strongly associated with high convergence in the upper atmosphere (200-hPa relative eddy momentum flux convergence) than weakening ones while intensifying hurricanes are more strongly associated with lower convergence values.<\/li>\n<li class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\">The combinations of conditions identified provide higher probabilities of intensification\/weakening than those based on a single condition (typical of traditional statistical studies).<\/li>\n<li class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\">This study will lead to improved cyclone intensity prediction (TC).<\/li>\n<\/ul>\n<\/div>\n<div class=\"o dz mx my ii mz\" role=\"separator\"><\/div>\n<div class=\"iw ix iy iz ja\">\n<h2 id=\"4c5d\" class=\"ln lo jd bn lp lq ne ls lt lu nf lw lx ly ng ma mb mc nh me mf mg ni mi mj mk gi\">A Hierarchical Pattern Learning Framework for Forecasting Extreme Weather Events<\/h2>\n<blockquote class=\"mq mr ms\">\n<p id=\"cca6\" class=\"kp kq mt kr b ks kt ku kv kw kx ky kz mu lb lc ld mv lf lg lh mw lj lk ll lm iw gi\" data-selectable-paragraph=\"\">Wang, D., &amp; Ding, W. (2015, November). A hierarchical pattern learning framework for forecasting extreme weather events. In\u00a0Data Mining (ICDM), 2015 IEEE International Conference on\u00a0(pp. 1021\u20131026). IEEE.<\/p>\n<\/blockquote>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"gt gu oy\"><img loading=\"lazy\" decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/850\/1*ZzaaG3G9kZHJB2TBJduFOg.png\" alt=\"\" width=\"425\" height=\"551\" \/><\/div>\n<\/figure>\n<h2 id=\"b0a3\" class=\"nj lo jd bn lp nk nl nm lt nn no np lx la nq nr mb le ns nt mf li nu nv mj nw gi\" data-selectable-paragraph=\"\">Objective<\/h2>\n<p id=\"5561\" class=\"pw-post-body-paragraph kp kq jd kr b ks ml ku kv kw mm ky kz la mn lc ld le mo lg lh li mp lk ll lm iw gi\" data-selectable-paragraph=\"\">Design a model to discover structures (patterns) within a spatiotemporal system to predict extreme weather events. The structures are discovered hierarchically, i.e. at each level of learning, new contextual features are learned and used by the next level. Several difficulties must be overcome to deal with such a system:<\/p>\n<ul class=\"\">\n<li id=\"a70a\" class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\">The massive size of the feature space.<\/li>\n<li class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\">The presence of complex structures within the spatiotemporal system.<\/li>\n<li class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\">The strict requirements for the interpretability of the model: We want to understand, not just predict.<\/li>\n<\/ul>\n<h2 id=\"5ba4\" class=\"nj lo jd bn lp nk nl nm lt nn no np lx la nq nr mb le ns nt mf li nu nv mj nw gi\" data-selectable-paragraph=\"\">Summary<\/h2>\n<ol class=\"\">\n<li id=\"8bab\" class=\"oc od jd kr b ks ml kw mm la oz le pa li pb lm pc oi oj ok gi\" data-selectable-paragraph=\"\">Summarize the temporal evolution of individual variables. At each position, the temporal changes of a parameter are generalized into a single feature.<\/li>\n<li class=\"oc od jd kr b ks ml kw mm la oz le pa li pb lm pc oi oj ok gi\" data-selectable-paragraph=\"\">Summarize spatial relationships to assemble singular features into groupings.<\/li>\n<li class=\"oc od jd kr b ks ml kw mm la oz le pa li pb lm pc oi oj ok gi\" data-selectable-paragraph=\"\">Summarize intervariable relationships to predict extreme events.<\/li>\n<\/ol>\n<h2 id=\"8e32\" class=\"nj lo jd bn lp nk nl nm lt nn no np lx la nq nr mb le ns nt mf li nu nv mj nw gi\" data-selectable-paragraph=\"\">Definitions<\/h2>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"gt gu pd\"><img loading=\"lazy\" decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/1134\/1*xGyAn9pH6UABBuL8fFZKcw.png\" alt=\"\" width=\"567\" height=\"192\" \/><\/div>\n<\/figure>\n<p id=\"173c\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><strong>Features and feature sets<\/strong><\/p>\n<p id=\"999c\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">A <strong>feature<\/strong> is a tuple <em>{L, T, V}. V<\/em> is a domain variable. <em>L<\/em> indicates the position<em> (x, y)<\/em>, and <em>T<\/em> indicates the sampling time.<\/p>\n<p id=\"b2d6\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">A <strong>feature<\/strong> <strong>set<\/strong> is a variable sampled over a time range. A variable sampled from a time T1 to T4 will be considered a feature set of the form <em>{V1, V2, V3, V4}<\/em>.<\/p>\n<p id=\"87fe\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><strong>Patterns and location-based patterns<\/strong><\/p>\n<p id=\"e442\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">A <strong>pattern X<\/strong> is a set of feature-value pairs corresponding to a set of features. It is a rule constructed from possible feature values for a certain set. For example,<em> X = {&lt; V1, 1 &gt;, &lt; V2, 0 to 4 &gt;, &lt; V3, 1 or 2 &gt;, &lt; V1, 3 &gt;}<\/em>.<\/p>\n<p id=\"3a6d\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">A<strong> location-based<\/strong> <strong>pattern<\/strong> has the form of a tuple <em>{X, L}<\/em>. X represents a pattern, and <em>L<\/em> contains the spatial information of the pattern.<\/p>\n<p id=\"0041\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><strong>Support and growth ratio<\/strong><\/p>\n<p id=\"fca5\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">A pattern <em>X<\/em> is supported by an instance I from a dataset<em> D<\/em> if the feature values of the instance <em>I<\/em> conform to the rule specified by the pattern<em> X<\/em>. The support of a pattern <em>X<\/em> is the number of instances <em>I<\/em> from a dataset <em>D<\/em> that support it, divided by the total number of instances <em>I<\/em> in <em>D<\/em>. In other words, the support is an indication of how often a pattern <em>X<\/em> appears in a data set <em>D<\/em>.<\/p>\n<p id=\"e530\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">If we divide <em>D<\/em> into two partitions <em>{Dp, Dn}<\/em>, the growth ratio of a pattern <em>X<\/em> is the ratio of the support of <em>X<\/em> in the partition <em>Dp<\/em> to the support of <em>X<\/em> in the partition <em>Dn<\/em>.<\/p>\n<p id=\"1d5a\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><strong>Feature of a pattern<\/strong><\/p>\n<p id=\"cff9\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">The <strong>feature of a pattern X<\/strong> is a binary variable indicating if the pattern is present or not in an instance I.<\/p>\n<h2 id=\"7bc4\" class=\"nj lo jd bn lp nk nl nm lt nn no np lx la nq nr mb le ns nt mf li nu nv mj nw gi\" data-selectable-paragraph=\"\">Method<\/h2>\n<p id=\"f09c\" class=\"pw-post-body-paragraph kp kq jd kr b ks ml ku kv kw mm ky kz la mn lc ld le mo lg lh li mp lk ll lm iw gi\" data-selectable-paragraph=\"\"><strong>Learning location-based patterns<\/strong><\/p>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"gt gu oy\"><img loading=\"lazy\" decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/850\/1*NEnI-FQcgPQXjnLdGxUhuw.png\" alt=\"\" width=\"425\" height=\"491\" \/><\/div>\n<\/figure>\n<p id=\"b8a9\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">The first algorithm based on contrast pattern mining is used to learn location-based patterns. To begin with, the dataset is partitioned into m subsets, each containing one variable. The location-based patterns are then learned (line 3). Pattern sets are generated separately for each subset and each position within them (lines 4-8). At each position, the learned pattern sets are generalized into a representative singular pattern, which is then transformed (line 9). For example, for a position<em> (x, y)<\/em>, the set of patterns<em> {p1, p2, &#8230;}<\/em> of a variable <em>V<\/em> represents the set of revealing temporal changes that occurred more frequently than others in a dataset partition.<\/p>\n<p id=\"bd18\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><strong>Forming spatial groupings<\/strong><\/p>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"gt gu oy\"><img loading=\"lazy\" decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/850\/1*EuRW6-8IVcB4xdZofNWXSg.png\" alt=\"\" width=\"425\" height=\"486\" \/><\/div>\n<\/figure>\n<p id=\"fb01\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">For this second algorithm, the spatial regularities of the system are generalized by expanding the previously learned location-based patterns into spatial clusters. Each variable is treated separately (lines 2-3). For a variable <em>V1<\/em>, a feature <em>F<\/em> is retrieved from its set of location-based patterns (line 5), and a set <em>N<\/em> containing all spatial neighbours of <em>F<\/em> is created (line 6). For all features <em>f<\/em> in <em>N<\/em>, two conditions are tested by linking with the support and growth ratio of the joined structure <em>f \u2229 F<\/em> (line 9). If the conditions are satisfied, the patterns <em>f<\/em> and <em>F<\/em> are combined into a new pattern, and the neighbour set is updated by adding the neighbours of <em>f<\/em> in <em>N<\/em> (lines 10-11).<\/p>\n<p id=\"7797\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><strong>Provide for by classification<\/strong><\/p>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"gt gu oy\"><img loading=\"lazy\" decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/850\/1*UdlHflu8Tg0WDzS8epCbGQ.png\" alt=\"\" width=\"425\" height=\"381\" \/><\/div>\n<\/figure>\n<p id=\"e545\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">The final algorithm examines the interactions between variables by developing a <strong>S<\/strong>patial cluster <strong>P<\/strong>attern-based <strong>C<\/strong>lassifier (SPC), an instance-based learning algorithm. An instance is classified by analyzing its patterns within spatial clusters (lines 3-4) and calculating the pattern growth ratio (lines 5-9). The instance is positively classified if it is higher than the predefined threshold.<\/p>\n<h2 id=\"a491\" class=\"nj lo jd bn lp nk nl nm lt nn no np lx la nq nr mb le ns nt mf li nu nv mj nw gi\" data-selectable-paragraph=\"\">Results<\/h2>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"gt gu oy\"><img loading=\"lazy\" decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/850\/1*qlIwFAAN1FBrvP9NH5S8kQ.png\" alt=\"\" width=\"425\" height=\"617\" \/><\/div>\n<\/figure>\n<ul class=\"\">\n<li id=\"b838\" class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\">The patterns learned from the approach in this paper are consistent with knowledge from studies in the same field.<\/li>\n<li class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\">Increasing the support thresholds and growth ratio increased the F1 value.<\/li>\n<li class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\">Predicting by using spatial clustering features (SCFs) yielded better results than simply using location-based pattern features (LPFs).<\/li>\n<li class=\"oc od jd kr b ks kt kw kx la oe le of li og lm oh oi oj ok gi\" data-selectable-paragraph=\"\">SCFs reduce the risk of overfitting and provide a better growth ratio to the benefit of support.<\/li>\n<\/ul>\n<\/div>\n<div class=\"o dz mx my ii mz\" role=\"separator\"><\/div>\n<div class=\"iw ix iy iz ja\">\n<h2 id=\"b71b\" class=\"ln lo jd bn lp lq ne ls lt lu nf lw lx ly ng ma mb mc nh me mf mg ni mi mj mk gi\">A spatiotemporal mining framework for abnormal association patterns in marine environments with a time series of remote sensing images<\/h2>\n<blockquote class=\"mq mr ms\">\n<p id=\"d350\" class=\"kp kq mt kr b ks kt ku kv kw kx ky kz mu lb lc ld mv lf lg lh mw lj lk ll lm iw gi\" data-selectable-paragraph=\"\">Xue, C., Song, W., Qin, L., Dong, Q., &amp; Wen, X. (2015). A spatiotemporal mining framework for abnormal association patterns in marine environments with a time series of remote sensing images. International Journal of Applied Earth Observation and Geoinformation, 38, 105\u2013114.<\/p>\n<\/blockquote>\n<h2 id=\"2f45\" class=\"nj lo jd bn lp nk nl nm lt nn no np lx la nq nr mb le ns nt mf li nu nv mj nw gi\" data-selectable-paragraph=\"\">Objectives<\/h2>\n<p id=\"719a\" class=\"pw-post-body-paragraph kp kq jd kr b ks ml ku kv kw mm ky kz la mn lc ld le mo lg lh li mp lk ll lm iw gi\" data-selectable-paragraph=\"\">Design a spatiotemporal data mining framework for marine association structures using multiple remote sensing images. The goal is to detect anomalous events based on pixel- and object-level models.<\/p>\n<h2 id=\"84c6\" class=\"nj lo jd bn lp nk nl nm lt nn no np lx la nq nr mb le ns nt mf li nu nv mj nw gi\" data-selectable-paragraph=\"\">Difficulties<\/h2>\n<p id=\"7f61\" class=\"pw-post-body-paragraph kp kq jd kr b ks ml ku kv kw mm ky kz la mn lc ld le mo lg lh li mp lk ll lm iw gi\" data-selectable-paragraph=\"\">Considering the massive size of the feature space, as each pixel has its own spatiotemporal association pattern, finding an efficient way to discover marine spatiotemporal associations on a pixel-by-pixel basis is essential. The second problem is that these pixels must also be grouped to form analyzable objects.<\/p>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"gt gu pd\"><img loading=\"lazy\" decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/1134\/1*_gDzDB1E_shMv2wV5b2bpw.png\" alt=\"\" width=\"567\" height=\"527\" \/><\/div>\n<\/figure>\n<p id=\"0a83\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">Two catalogues of association patterns are thus generated: in <strong>the same region<\/strong> (pixel, singular cell) and <strong>between different regions<\/strong> (object, grouping of pixels).<\/p>\n<p id=\"923b\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">The spatial relationship of the association patterns in the first catalogue is simply the spatial positioning of the pixels. Despite the small contribution of these patterns to spatial relationships, the use of pixels alone is more suited to representing large-scale spatiotemporal association patterns. In the context of this study, this catalogue allows the analysis of the ocean as a whole.<\/p>\n<p id=\"b26f\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">For the second catalogue, the association patterns between regions with uniform marine properties (marine objects) provide more information on spatial relationships: spatial positioning, distance, direction and topology. This one will focus on specific marine regions compared to the first catalogue. The two catalogues are thus complementary in the search for spatiotemporal association patterns.<\/p>\n<h2 id=\"f384\" class=\"nj lo jd bn lp nk nl nm lt nn no np lx la nq nr mb le ns nt mf li nu nv mj nw gi\" data-selectable-paragraph=\"\">Summary<\/h2>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"gt gu pe\"><img loading=\"lazy\" decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/1350\/1*5A-rh0Wb8-eSZvt6I2rKXQ.png\" alt=\"\" width=\"675\" height=\"536\" \/><\/div>\n<\/figure>\n<ol class=\"\">\n<li id=\"d90b\" class=\"oc od jd kr b ks kt kw kx la oe le of li og lm pc oi oj ok gi\" data-selectable-paragraph=\"\">Pretreatment and representation of data by a transaction table.<\/li>\n<li class=\"oc od jd kr b ks kt kw kx la oe le of li og lm pc oi oj ok gi\" data-selectable-paragraph=\"\">Spatiotemporal data mining algorithm to generate association patterns.<\/li>\n<li class=\"oc od jd kr b ks kt kw kx la oe le of li og lm pc oi oj ok gi\" data-selectable-paragraph=\"\">Knowledge visualization.<\/li>\n<\/ol>\n<h2 id=\"3b99\" class=\"nj lo jd bn lp nk nl nm lt nn no np lx la nq nr mb le ns nt mf li nu nv mj nw gi\" data-selectable-paragraph=\"\">Method<\/h2>\n<p id=\"1c2b\" class=\"pw-post-body-paragraph kp kq jd kr b ks ml ku kv kw mm ky kz la mn lc ld le mo lg lh li mp lk ll lm iw gi\" data-selectable-paragraph=\"\"><strong>Data pretreatment and representation<\/strong><\/p>\n<p id=\"fe78\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><em>Eliminating periodic variability<\/em><\/p>\n<p id=\"b676\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">Marine parameters are subject to <strong>seasonal variations<\/strong>: it is, therefore essential to remove this component to normalize the data before identifying abnormal events. To do this, the z-score, calculated monthly, is used. It indicates how many standard deviations an environmental parameter is from the mean and standardizes the value of this parameter accordingly.<\/p>\n<p id=\"0708\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><em>Extracting anomalous objects<\/em><\/p>\n<p id=\"9166\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">Regions sensitive to climate change are identified as <strong>anomalous regions<\/strong> and represented as objects in the context of this study. The ENSO (El Ni\u00f1o-Southern Oscillation) signal represents global climate variability, and the parameters influenced by this signal are identified. To do so, it is necessary to analyze whether the time component of a given physical factor is the same as that of ENSO (4-7 years) but also its spatial pattern. Subsequently, these regions sensitive to climate change and having the same associated spatial pattern are grouped as anomalous objects.<\/p>\n<p id=\"2cf2\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><em>Discretizing marine parameters<\/em><\/p>\n<p id=\"0ea0\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><strong>Discretization<\/strong> consists in transforming marine parameters in real-number format into categories. The variability of the marine environment generally is Gaussian distributed. Consequently, the parameters are discretized into five classes, from -2 to +2 (severe negative change, weak negative change, no change, weak positive change, and severe positive change).<\/p>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"gt gu pf\"><img loading=\"lazy\" decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/908\/1*SbSZcRsEAiLE8e2AUK3BHQ.png\" alt=\"\" width=\"454\" height=\"156\" \/><\/div>\n<\/figure>\n<p id=\"0e16\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">Where <em>P(i,j)<\/em> and<em> O<\/em> are, respectively, pixel position and instances of anomalous objects,<em> Vi,j<\/em> and <em>fRank(Vi,j)<\/em> are the raw value and class of a pixel, <em>Vo<\/em> and <em>fRank(Vo)<\/em> are the raw value and class of an anomalous object, \ud835\udeff and \ud835\udeffo are the standard deviation of a time series of a pixel and an anomalous object.<\/p>\n<p id=\"1a4f\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><em>Generating transaction tables<\/em><\/p>\n<p id=\"5ae4\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">An <strong>exploration table<\/strong> is generated for analysis at each level of granularity: pixels and objects. The first model can explore spatiotemporal association patterns among marine environmental parameters within pixels.<\/p>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"gt gu pf\"><img loading=\"lazy\" decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/908\/1*a7cbtEvq4ssGBiVHAwvUsw.png\" alt=\"\" width=\"454\" height=\"239\" \/><\/div>\n<\/figure>\n<p id=\"4374\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">The second model allows us to discover the spatiotemporal association patterns between marine regions.<\/p>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"kf kg dq kh cf ki\" tabindex=\"0\" role=\"button\">\n<div class=\"gt gu pg\"><img loading=\"lazy\" decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/1400\/1*7BMDv7pn7n8rA7y681GnMg.png\" alt=\"\" width=\"700\" height=\"115\" \/><\/div>\n<\/div>\n<\/figure>\n<p id=\"9292\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><strong>Spatiotemporal data mining algorithm<\/strong><\/p>\n<p id=\"f5d4\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">This algorithm is based on <strong>mutual information<\/strong>, i.e. the amount of information one item provides about another. The items are represented by the environmental parameters and the ENSO index (tables 1 and 2). The ENSO index is used to categorize the marine parameters according to the degree of influence of La Ni\u00f1a (-2) and El Ni\u00f1o (+2) on their evolution.<\/p>\n<p id=\"2b76\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">Association pattern mining algorithms find rules that define relationships between items in two steps. First, they discover <strong>frequent item<\/strong> patterns from the transaction tables using minimal <strong>support<\/strong>. Second, they generalize the association rule from a predefined <strong>confidence<\/strong> level. Only<strong> related items<\/strong>, rather than the complete set, are involved in the search for frequent items.<\/p>\n<p id=\"fce4\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><em>Extracting related items<\/em><\/p>\n<p id=\"9322\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">Items are considered related or not based on their normalized mutual information. The mathematical approach will be set aside for ease of reading. Related items are extracted to provide candidates for the set of frequent items, which will be used to find association patterns. Not all related items are retained to be frequent; to do so, they must reach a certain relationship threshold. The frequent item set contains all the item patterns that appear frequently enough, according to a threshold, in the data set.<\/p>\n<p id=\"8c0d\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><em>Generating association pattern trees<\/em><\/p>\n<p id=\"8eec\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">A <strong>direct association pattern tree<\/strong> represents the association patterns of two or more marine environmental parameters, thus providing a foundation for implementing a space-time exploration algorithm. A recursive method is applied to build this tree.<\/p>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"kf kg dq kh cf ki\" tabindex=\"0\" role=\"button\">\n<div class=\"gt gu ph\"><img loading=\"lazy\" decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/1400\/1*ryjbpIsqaTqEbiKCLQ9gAA.png\" alt=\"\" width=\"700\" height=\"217\" \/><\/div>\n<\/div>\n<\/figure>\n<p id=\"74e8\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><em>Discovering the rules of association<\/em><\/p>\n<p id=\"fad0\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">Generating <strong>association rules<\/strong> allows the discovery of frequent items originating from direct association pattern trees. To achieve this, transaction tables and direct association pattern trees are browsed using a recursive exploration algorithm. Specific evaluation indices are used to generalize strong association rules: support, confidence, lift and interest factors.<\/p>\n<p id=\"0679\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><strong>Visualizing spatiotemporal association patterns<\/strong><\/p>\n<p id=\"c403\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\"><strong>Spatiotemporal association structures<\/strong> are represented as follows:<\/p>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"gt gu pi\"><img loading=\"lazy\" decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/1112\/1*lh_jn3dJ64_18u_4jE133w.png\" alt=\"\" width=\"556\" height=\"38\" \/><\/div>\n<\/figure>\n<p id=\"8e43\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">Where <em>AttrN<\/em> is a marine environmental parameter of a pixel or object, <em>q<\/em> is the quantitative level (-2 to +2) of this parameter, <em>t<\/em> is the time of occurrence of the attribute <em>Attr1<\/em>, <em>{t1, t2 and tn}<\/em> are the time differences from <em>t<\/em> when the other attributes occur, <em>s%<\/em>, <em>c%<\/em> and<em> l%<\/em> are the evaluation indicators used to identify revealing association patterns.<\/p>\n<p id=\"0b4d\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">Although the association structures between objects and pixels are represented similarly, their spatial relationships differ. The spatial relationships between marine objects are implicit; topology, distance, and direction are all obtained from the objects. Spatial relationships between pixels are not as implicit; each pixel may have multiple spatiotemporal association patterns between two or more attributes.<\/p>\n<p id=\"e5ca\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">The spatiotemporal association patterns are represented on the following thematic maps. Note that the white regions define the different continents and the coloured regions the different association patterns in the Pacific Ocean. Map (a) shows the distribution of two-dimensional patterns, and (b) the distribution of three-dimensional patterns, as demonstrated by the association pattern tree.<\/p>\n<p id=\"74d8\" class=\"pw-post-body-paragraph kp kq jd kr b ks kt ku kv kw kx ky kz la lb lc ld le lf lg lh li lj lk ll lm iw gi\" data-selectable-paragraph=\"\">For example, in Figure (a), we can see that when La Ni\u00f1a is strong, the ocean surface temperature (SSTA) drops drastically in the blue region.<\/p>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"gt gu pd\"><img loading=\"lazy\" decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/1134\/1*9JPQIAoaIpO-5SB6BRpbEA.png\" alt=\"\" width=\"567\" height=\"391\" \/><\/div>\n<\/figure>\n<figure class=\"ny nz oa ob hf ke gt gu paragraph-image\">\n<div class=\"gt gu pd\"><img loading=\"lazy\" decoding=\"async\" class=\"cf kj kk\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/max\/1134\/1*EfL2i9RHu11DnwvhT61pUQ.png\" alt=\"\" width=\"567\" height=\"403\" \/><\/div>\n<\/figure>\n<h2 id=\"923d\" class=\"nj lo jd bn lp nk nl nm lt nn no np lx la nq nr mb le ns nt mf li nu nv mj nw gi\" data-selectable-paragraph=\"\">Results<\/h2>\n<ul class=\"\">\n<li id=\"4e80\" class=\"oc od jd kr b ks ml kw mm la oz le pa li pb lm oh oi oj ok gi\" data-selectable-paragraph=\"\">The data mining algorithm based on mutual information is superior to the traditional Apriori algorithm in cases where more parameters and classes are used during discretization.<\/li>\n<li class=\"oc od jd kr b ks ml kw mm la oz le pa li pb lm oh oi oj ok gi\" data-selectable-paragraph=\"\">Two proposed strategies: one model at pixel level and the other at object level. The two complement each other: the first explores large-scale marine features, and the second focuses on specific regions.<\/li>\n<li class=\"oc od jd kr b ks ml kw mm la oz le pa li pb lm oh oi oj ok gi\" data-selectable-paragraph=\"\">Many problems were solved: extraction of associated items (attributes or objects), construction of direct association pattern trees, design of the exploration algorithm, optimal support threshold and innovative visualization of association patterns.<\/li>\n<li class=\"oc od jd kr b ks ml kw mm la oz le pa li pb lm oh oi oj ok gi\" data-selectable-paragraph=\"\">Compared to traditional spatiotemporal analyses, the information obtained from this exploration framework is much more detailed and precise in space and time.<\/li>\n<li class=\"oc od jd kr b ks ml kw mm la oz le pa li pb lm oh oi oj ok gi\" data-selectable-paragraph=\"\">This framework allows for a better understanding of the variation of marine parameters in different areas: how, when and where specific parameters drive the variation of other parameters or respond to the variation of others.<\/li>\n<\/ul>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>This week, I took an interest in spatiotemporal data. There are several reasons why I have been looking into this subject. First, the City of Montreal makes available open data related to different spheres of activity: economy and business, education, health, society and culture. The sector that is of particular interest to me for Montreal [&hellip;]<\/p>\n","protected":false},"author":409,"featured_media":16700,"menu_order":0,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":"","_links_to":"","_links_to_target":""},"mots_cles":[531,454,530,532],"categorie_blogue":[457],"class_list":["post-21793","blogue","type-blogue","status-publish","format-standard","has-post-thumbnail","hentry","mots_cles-anomaly-detection","mots_cles-data-mining-en","mots_cles-predictive-analysis","mots_cles-spatiotemporal-data","categorie_blogue-data-science"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.crim.ca\/en\/wp-json\/wp\/v2\/blogue\/21793","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.crim.ca\/en\/wp-json\/wp\/v2\/blogue"}],"about":[{"href":"https:\/\/www.crim.ca\/en\/wp-json\/wp\/v2\/types\/blogue"}],"author":[{"embeddable":true,"href":"https:\/\/www.crim.ca\/en\/wp-json\/wp\/v2\/users\/409"}],"version-history":[{"count":6,"href":"https:\/\/www.crim.ca\/en\/wp-json\/wp\/v2\/blogue\/21793\/revisions"}],"predecessor-version":[{"id":21799,"href":"https:\/\/www.crim.ca\/en\/wp-json\/wp\/v2\/blogue\/21793\/revisions\/21799"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.crim.ca\/en\/wp-json\/wp\/v2\/media\/16700"}],"wp:attachment":[{"href":"https:\/\/www.crim.ca\/en\/wp-json\/wp\/v2\/media?parent=21793"}],"wp:term":[{"taxonomy":"mots_cles","embeddable":true,"href":"https:\/\/www.crim.ca\/en\/wp-json\/wp\/v2\/mots_cles?post=21793"},{"taxonomy":"categorie_blogue","embeddable":true,"href":"https:\/\/www.crim.ca\/en\/wp-json\/wp\/v2\/categorie_blogue?post=21793"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}