<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Big Data</journal-id>
<journal-title>Frontiers in Big Data</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Big Data</abbrev-journal-title>
<issn pub-type="epub">2624-909X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fdata.2022.1049198</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Big Data</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Spatial data analysis for intelligent buildings: Awareness of context and data uncertainty</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Li</surname> <given-names>Huan</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1873964/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Liu</surname> <given-names>Tiantian</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/2070444/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Chan</surname> <given-names>Harry Kai-Ho</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/2070382/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Lu</surname> <given-names>Hua</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/2060272/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Computer Science, Aalborg University</institution>, <addr-line>Aalborg</addr-line>, <country>Denmark</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of People and Technology, Roskilde University</institution>, <addr-line>Roskilde</addr-line>, <country>Denmark</country></aff>
<aff id="aff3"><sup>3</sup><institution>Information School, University of Sheffield</institution>, <addr-line>Sheffield</addr-line>, <country>United Kingdom</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Feng Chen, Dallas County, United States</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Xujiang Zhao, NEC Laboratories America Inc., United States; Yanjie Fu, University of Central Florida, United States</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Huan Li <email>lihuan&#x00040;cs.aau.dk</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Data Mining and Management, a section of the journal Frontiers in Big Data</p></fn></author-notes>
<pub-date pub-type="epub">
<day>07</day>
<month>11</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>5</volume>
<elocation-id>1049198</elocation-id>
<history>
<date date-type="received">
<day>20</day>
<month>09</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>19</day>
<month>10</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2022 Li, Liu, Chan and Lu.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Li, Liu, Chan and Lu</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license> </permissions>
<abstract>
<p>Intelligent buildings are among the most active Internet-of-Things (IoT) verticals, encompassing various IoT-enabled devices and sensing technologies for digital transformation. Analysis of spatial data, a very common type of data collected in intelligent buildings, offers a lot of insights for many purposes such as facilitating space management and enhancing the utilization efficiency of buildings. In this paper, we recognize two major challenges in spatial data analysis for intelligent buildings (SDAIB): (1) the complicated analytical contexts that are related to the building space and internal entities and (2) the uncertainty of spatial data due to the limitations of positioning and other sensing technologies. To address these challenges, we identify and categorize different kinds of analytical contexts and spatial data uncertainties in SDAIB, and propose a unified modeling framework for handling them. Furthermore, we showcase how the proposed framework and the associated modeling techniques are used to enable context-aware and uncertainty-aware SDAIB, in the tasks of hotspot discovery, path planning, semantic trajectory generation, and distance monitoring. Finally, we offer several research directions of SDAIB, in line with the emerging trends of the IoT.</p></abstract>
<kwd-group>
<kwd>spatial data uncertainty</kwd>
<kwd>context-aware computing</kwd>
<kwd>indoor spaces</kwd>
<kwd>smart buildings</kwd>
<kwd>mobility analysis</kwd>
<kwd>IoT data quality</kwd>
</kwd-group>
<contract-sponsor id="cn001">HORIZON EUROPE Marie Sklodowska-Curie Actions<named-content content-type="fundref-id">10.13039/100018694</named-content></contract-sponsor>
<contract-sponsor id="cn002">Danmarks Frie Forskningsfond<named-content content-type="fundref-id">10.13039/501100011958</named-content></contract-sponsor>
<counts>
<fig-count count="6"/>
<table-count count="1"/>
<equation-count count="1"/>
<ref-count count="88"/>
<page-count count="20"/>
<word-count count="14015"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>1. Introduction</title>
<p>Driven by the popularity of the Internet of Things (IoT) infrastructure, the digital transformation in the architecture industry is experiencing rapid growth. As a well-recognized notation, <italic>intelligent buildings</italic> (or smart buildings) (Qolomany et al., <xref ref-type="bibr" rid="B68">2019</xref>; Daissaoui et al., <xref ref-type="bibr" rid="B20">2020</xref>) formulate a paradigm that integrates various IoT-enabled devices, sensing technologies, and big data analysis technologies for achieving improved efficiency of building uses and enhanced comfort of residents. According to a report<xref ref-type="fn" rid="fn0001"><sup>1</sup></xref> by the research firm MarketsandMarkets, the global market of intelligent buildings is expected to grow from 66.3 billion in 2020 to 108.9 billion in 2025, at a compound annual growth rate of 10.5%.</p>
<p>Intelligent buildings are thought of as an ecosystem made up of both hardware and software (Daissaoui et al., <xref ref-type="bibr" rid="B20">2020</xref>). Based on this, <xref ref-type="fig" rid="F1">Figure 1</xref> presents an overall picture of intelligent buildings, which includes three key components, namely the IoT-enhanced buildings (lower left corner), the data analysis platform (lower right corner), and the intelligent building applications (upper side). In particular, the <italic>IoT-enhanced buildings</italic> encapsulate a wide variety of IoT-enabled devices such as sensors, actuators, and robots that bridge the physical building space and the cyber space. As the underlying infrastructure, the IoT-enhanced buildings employ various sensing technologies to collect data, which are then sent to the data analysis platform. The <italic>data analysis platform</italic> plays the central role in the ecosystem, extracting useful information and generating control messages from the collected data, which in turn facilitate the sensing and interactions within the IoT-enhanced buildings. In the data analysis platform, the fundamental data management module is responsible for data preprocessing, modeling, and storage, on which various data analysis tasks are built. Finally, on top of the IoT-enhanced buildings and the data analysis platform, <italic>intelligent building applications</italic> are designed and tailored for many different building types like hospitals, office buildings, and transportation stations. Some typical applications are asset tracing (Jensen et al., <xref ref-type="bibr" rid="B27">2009a</xref>; Oztekin et al., <xref ref-type="bibr" rid="B62">2010</xref>; Kim et al., <xref ref-type="bibr" rid="B34">2014</xref>), HVAC (heating, ventilation, and air conditioning) control (Capozzoli et al., <xref ref-type="bibr" rid="B11">2017</xref>), and crowd evacuation (Chen and Feng, <xref ref-type="bibr" rid="B16">2009</xref>; Kamkarian and Hexmoor, <xref ref-type="bibr" rid="B31">2012</xref>; Li et al., <xref ref-type="bibr" rid="B44">2018b</xref>).</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>An overall picture of intelligent buildings: IoT devices, data analysis, and applications.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fdata-05-1049198-g0001.tif"/>
</fig>
<p>The management of an intelligent building is often required to be aware of the statuses of many key places of the building. As a result, the data collected in intelligent buildings are often associated with spatial attributes. For example, a smart meter record contains the ID of the room that consumes energy (Ni et al., <xref ref-type="bibr" rid="B61">2016</xref>), while indoor positioning data usually captures a target object&#x00027;s whereabouts as a specific two- or three-dimensional geometric coordinate in the building (Liu et al., <xref ref-type="bibr" rid="B50">2007</xref>; Li et al., <xref ref-type="bibr" rid="B42">2020b</xref>). Correspondingly, a good number of fundamental data analyses for intelligent buildings are oriented to such spatial data, making efforts on facilitating space management and enhancing the utilization efficiency of the building. In this study, we pay particular attention to such <italic>spatial data analysis for intelligent buildings</italic> (SDAIB). Referring to the data analysis platform in <xref ref-type="fig" rid="F1">Figure 1</xref>, general data analysis tasks include anomaly detection, frequent pattern mining, and forecasting, while typical SDAIB tasks are hotspot discovery (Li et al., <xref ref-type="bibr" rid="B44">2018b</xref>,<xref ref-type="bibr" rid="B45">c</xref>), path planning (Feng et al., <xref ref-type="bibr" rid="B22">2020</xref>; Chan et al., <xref ref-type="bibr" rid="B13">2021</xref>; Liu et al., <xref ref-type="bibr" rid="B51">2021a</xref>,<xref ref-type="bibr" rid="B53">c</xref>), semantic trajectory generation (Li et al., <xref ref-type="bibr" rid="B41">2020a</xref>,<xref ref-type="bibr" rid="B42">b</xref>), and distance monitoring (Chan et al., <xref ref-type="bibr" rid="B12">2022</xref>).</p>
<p>The past two decades have seen a lot of spatial data analysis research for different building types and different application scenarios (cf. literature reviews Cheema, <xref ref-type="bibr" rid="B14">2018</xref>; Qolomany et al., <xref ref-type="bibr" rid="B68">2019</xref>; Daissaoui et al., <xref ref-type="bibr" rid="B20">2020</xref>). Setting aside those application-specific technical difficulties, we identify two general challenges faced by SDAIB in regard to the fundamental data management, as presented in <xref ref-type="fig" rid="F2">Figure 2</xref>.</p>
<list list-type="bullet">
<list-item><p><bold>Extracting and exploiting complicated analytical contexts</bold>. In most SDAIB tasks, algorithms are required to be aware of the analytical context for extracting accurate knowledge and making correct decisions accordingly. For example, in the distance monitoring task, physical contexts such as windows, obstacles, and floors must be taken into account in the distance estimation. Otherwise, the obtained results are inaccurate or even useless (Yang et al., <xref ref-type="bibr" rid="B82">2010</xref>; Chan et al., <xref ref-type="bibr" rid="B12">2022</xref>). For another example, in the path planning task, if the algorithm can obtain real-time and accurate contextual information about the populations and flows in each room, it can find for users or robots the routes that avoid potential congestion and collisions&#x02014;such routes are practically more useful than those routes with the shortest distances (Liu et al., <xref ref-type="bibr" rid="B53">2021c</xref>). However, the analytical contexts in SDAIB are complicated and diverse, involving many aspects related to the building environment and the internal entities like human beings and sensory devices. Many existing works (Yang et al., <xref ref-type="bibr" rid="B82">2010</xref>; Kamkarian and Hexmoor, <xref ref-type="bibr" rid="B31">2012</xref>; Lin et al., <xref ref-type="bibr" rid="B47">2016</xref>; Teng et al., <xref ref-type="bibr" rid="B72">2017</xref>; Li et al., <xref ref-type="bibr" rid="B45">2018c</xref>, <xref ref-type="bibr" rid="B42">2020b</xref>; Liu et al., <xref ref-type="bibr" rid="B53">2021c</xref>) find relevant analytical contexts according to the need of the analytical task and design specific data structures for them. In this light, it is desirable to have a general mechanism for efficiently and effectively modeling different types of analytical contexts, in order to facilitate the data modeling and data storage for SDAIB tasks.</p></list-item>
<list-item><p><bold>Modeling and mitigating the uncertainty of spatial data</bold>. Indoor positioning technologies and other sensing technologies (such as Bluetooth- and RFID-based tracking) typically offer limited accuracy and low sampling rates, resulting in low quality of spatial data used for analyses (Liu et al., <xref ref-type="bibr" rid="B50">2007</xref>; Xie et al., <xref ref-type="bibr" rid="B79">2013</xref>; Lu et al., <xref ref-type="bibr" rid="B55">2016</xref>). In particular, the limited accuracy makes an object&#x00027;s observed location deviate, while the low sampling rate makes the object often unobserved. Both two issues enlarge the uncertainty of spatial data and thus lower the effectiveness of downstream SDAIB tasks. To make matters worse, buildings involve much more complicated analytical contexts in relatively small spaces compared to outdoors, further amplifying the negative effects of spatial data uncertainty. For example, in the semantic trajectory generation task, multiple indoor regions, such as shops in a mall, are close to each other and are only segmented by walls, and a positioning deviation of merely a few meters will likely cause a mis-annotation of the target object&#x00027;s located region (Li et al., <xref ref-type="bibr" rid="B41">2020a</xref>,<xref ref-type="bibr" rid="B42">b</xref>). In the hotspot discovery task, the unique indoor topology and the low-sampling issue incur multiple possible paths for a moving object, which complicate the computation and reduce the accuracy of the computation results (Li et al., <xref ref-type="bibr" rid="B44">2018b</xref>). Modeling and reducing spatial data uncertainty have been widely studied for Euclidean space and road networks (Z&#x000FC;fle, <xref ref-type="bibr" rid="B87">2021</xref>). However, those data preprocessing techniques fall short in the intelligent building setting that is characterized by the unique indoor topology and indoor positioning issues. Notably, addressing the challenges of spatial data uncertainty relies on appropriate handling of the analytical contexts (see <xref ref-type="fig" rid="F2">Figure 2</xref>). This is because the contexts involved in intelligent buildings provide key prior knowledge for modeling and reducing spatial data uncertainty; some examples have been provided later in Section 2.2.3. Therefore, it is considered a proper routine to model the context first and then the spatial data uncertainty.</p></list-item>
</list>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>Two general data management challenges in SDAIB.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fdata-05-1049198-g0002.tif"/>
</fig>
<p>In this paper, we will introduce the recent progress in addressing these two general challenges in SDAIB. To be specific, Section 2 will review the recent efforts to handle analytical contexts and spatial data uncertainties for SDAIB, respectively. Based on a classification of the existing works and a summary of their technical highlights, we will propose a unified framework for accommodating modeling techniques for a variety of analytical contexts and spatial data uncertainties in Section 3. Subsequently, Section 4 will select four representative SDAIB tasks and demonstrate the usage of the proposed modeling framework for context-aware and uncertainty-aware SDAIB. The four tasks are hotspot discovery, path planning, semantic trajectory generation, and distance monitoring. In Section 5, we will discuss the open issues and emerging opportunities of SDAIB in the new technology ecosystem. We will finally conclude the paper in Section 6.</p>
</sec>
<sec id="s2">
<title>2. Modeling techniques in SDAIB</title>
<p>We will revisit and summarize relevant techniques for modeling analytical contexts and spatial data uncertainties in Sections 2.1 and 2.2, respectively.</p>
<sec>
<title>2.1. Modeling analytical contexts</title>
<p>The analytical contexts refer to a set of important data that exists along with the analytical process, describing the states of related objects in processing, the ancillary information, the requirements of the analysis task, etc. <xref ref-type="fig" rid="F3">Figure 3</xref> presents a non-exhaustive taxonomy of the analytical contexts in SDAIB, in which we categorize analytical contexts into the contexts related to the building space and those related to the internal entities in the building. Note that we exclude the human-centered contexts (e.g., user preference and sentiment) because they are considered not specific to spatial data analyses. In each category, we further distinguish the <italic>dynamic (analytical) contexts</italic> from those static contexts. The former evolves with the analytical process while the latter does not. We proceed to go through different kinds of analytical contexts in SDAIB and describe their scope and recent efforts in handling them.</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p>A non-exhaustive taxonomy of the analytical contexts in SDAIB.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fdata-05-1049198-g0003.tif"/>
</fig>
<sec>
<title>2.1.1. Building space</title>
<sec>
<title>2.1.1.1. Static contexts of building space</title>
<p>The building environment, whose geometric information (e.g., the dimensions of the whole building or a region, and the position of a particular door), topological information (e.g., the connectivity or accessibility between adjacent regions and floors), and semantic information (e.g., the name of a region and the maximum capacity of a region), are all fundamental to SDAIB (Worboys, <xref ref-type="bibr" rid="B77">2011</xref>). Therefore, building space (or indoor space) models are usually hybrid, in which the basic data structure (Afyouni et al., <xref ref-type="bibr" rid="B1">2012</xref>) is constructed based on either of the three aspects whereas the other two are extended on the basic data structure.</p>
<p>To represent two- or three-dimensional <italic>geometric information</italic> of the <italic>building elements</italic> (e.g., doors, windows, rooms, and staircases), the lattices (Elfes, <xref ref-type="bibr" rid="B21">1989</xref>; Li and Lee, <xref ref-type="bibr" rid="B40">2008</xref>; Lin et al., <xref ref-type="bibr" rid="B49">2017</xref>), tetrahedrons (Penninga et al., <xref ref-type="bibr" rid="B65">2006</xref>), polygonal prisms (Kim et al., <xref ref-type="bibr" rid="B33">2009</xref>), Voronoi tessellation (Choset and Burdick, <xref ref-type="bibr" rid="B18">2000</xref>), boundary-based representation (Crowley, <xref ref-type="bibr" rid="B19">1989</xref>) have been exploited, on top of which tree-based (Li and Lee, <xref ref-type="bibr" rid="B40">2008</xref>) or graph-based (Penninga et al., <xref ref-type="bibr" rid="B65">2006</xref>; Kim et al., <xref ref-type="bibr" rid="B33">2009</xref>) indexes can be built to accelerate spatial search.</p>
<p><italic>Topological information</italic> captures the relationship between the building elements, for which the graph structure (Lee, <xref ref-type="bibr" rid="B39">2004</xref>) is the most commonly used. Typically, <italic>building partitions</italic> (e.g., a room or a region) are represented by graph nodes, while their interactions, such as adjacency and intersection (Lee, <xref ref-type="bibr" rid="B39">2004</xref>; Kim et al., <xref ref-type="bibr" rid="B33">2009</xref>; Lin et al., <xref ref-type="bibr" rid="B49">2017</xref>), mutual visibility Franz et al. (<xref ref-type="bibr" rid="B23">2005</xref>), connecting doors (Lu et al., <xref ref-type="bibr" rid="B54">2012</xref>; Xie et al., <xref ref-type="bibr" rid="B80">2014</xref>), passed sensors (Jensen et al., <xref ref-type="bibr" rid="B27">2009a</xref>; Teng et al., <xref ref-type="bibr" rid="B72">2017</xref>), passed positioning reference points (Li et al., <xref ref-type="bibr" rid="B44">2018b</xref>), are recorded at graph edges. In some settings, considering the search efficiency, a dual transformation of the above structure is applied. For example, a <italic>door graph</italic> (Yang et al., <xref ref-type="bibr" rid="B82">2010</xref>) with doors as nodes and spatial information between doors as edges is used to facilitate the calculation of the minimum indoor walking distance; a <italic>deployment graph</italic> (Baba et al., <xref ref-type="bibr" rid="B6">2014</xref>) capturing RFID readers as nodes and passed region(s) between two readers as edges is designed for cleaning RFID reader ID sequences. Recently, some work considers indexing topological information to improve computation efficiency, e.g., VIP-tree (Shao et al., <xref ref-type="bibr" rid="B69">2016</xref>) is a hierarchical model on the connectivity of rooms to speed up the shortest path finding.</p>
<p><italic>Semantic information</italic> is indispensable in many indoor location-based services, such as navigation (Li and Lee, <xref ref-type="bibr" rid="B40">2008</xref>; Lin et al., <xref ref-type="bibr" rid="B48">2013</xref>; Kang and Li, <xref ref-type="bibr" rid="B32">2017</xref>) and door access control (Bhatt et al., <xref ref-type="bibr" rid="B10">2009</xref>). Pure semantic models for buildings are often object-oriented, using UML-like languages [such as CityGML (Kolbe et al., <xref ref-type="bibr" rid="B35">2005</xref>) and indoorGML (Kang and Li, <xref ref-type="bibr" rid="B32">2017</xref>)], task-specific ontologies (Bhatt et al., <xref ref-type="bibr" rid="B10">2009</xref>; Wu et al., <xref ref-type="bibr" rid="B78">2018</xref>), IFC (International Foundation Class) data formats (Lin et al., <xref ref-type="bibr" rid="B48">2013</xref>), etc. It is worth noting that semantic information is often associated with the building elements, e.g., names and functionality of rooms. Therefore, compared to maintaining a purely object-oriented semantic model, it would be more efficient to attach the semantic information to the building&#x00027;s spatial model which can be easily enhanced with a spatial index, as done in many existing works (Li and Lee, <xref ref-type="bibr" rid="B40">2008</xref>; Teng et al., <xref ref-type="bibr" rid="B72">2017</xref>; Li et al., <xref ref-type="bibr" rid="B44">2018b</xref>, <xref ref-type="bibr" rid="B41">2020a</xref>; Feng et al., <xref ref-type="bibr" rid="B22">2020</xref>; Liu et al., <xref ref-type="bibr" rid="B51">2021a</xref>).</p>
</sec>
<sec>
<title>2.1.1.2. Dynamic contexts of building space</title>
<p>Since the building environment is dynamic, all three types of information described above may evolve during the analytical process. Relatively few studies have considered dynamic modeling of building space information:</p>
<list list-type="bullet">
<list-item><p><italic>Change of geometric information</italic>. To capture the change in the geometric information of the building elements, an event-based updating approach is often used, i.e., deleting outdated information and inserting new information. Correspondingly, the spatial index should be updated incrementally or fully (Li and Lee, <xref ref-type="bibr" rid="B40">2008</xref>; Lu et al., <xref ref-type="bibr" rid="B54">2012</xref>; Xie et al., <xref ref-type="bibr" rid="B80">2014</xref>; Liu et al., <xref ref-type="bibr" rid="B52">2021b</xref>). Since the geometric information changes infrequently, event-based updating is acceptable for most applications.</p></list-item>
<list-item><p><italic>Change of topological information</italic>. The topological relationship between building elements should be updated timely when particular events happen, such as the lock and unlock of a door, the combination or division of building partitions, and the block of a region. Otherwise, such changes may invalidate the computation results in services like real-time indoor navigation (Liu et al., <xref ref-type="bibr" rid="B51">2021a</xref>). Walton and Worboys (<xref ref-type="bibr" rid="B75">2012</xref>) model the static topological information using a bi-graphical model. In their topological model, the dynamic change is defined by a reaction rule consisting of a pair of a pattern to be changed and a resulting pattern. Xie et al. (<xref ref-type="bibr" rid="B80">2014</xref>) maintain for each building partition a set of pointers to other partitions, which can be updated dynamically. Also assuming a graph-based model, Liu et al. (<xref ref-type="bibr" rid="B51">2021a</xref>) propose both synchronous and asynchronous mechanisms to check the temporal variations of door&#x00027;s accessibility along the path planning process. This work also utilizes VIP-tree (Shao et al., <xref ref-type="bibr" rid="B69">2016</xref>) to index the topological information. To reflect the topological changes on the index, life span interval information is extended to the internal nodes of the tree.</p></list-item>
<list-item><p><italic>Change of semantic information</italic>. For most models that augment semantic information to a base spatial model, the event-based updating approach and the life span interval definition can handle the dynamic changes of building semantics. Recently, some studies (Teng et al., <xref ref-type="bibr" rid="B73">2018</xref>; Guo et al., <xref ref-type="bibr" rid="B24">2021</xref>) consider extracting updates of semantics from crowdsourced images and videos for building self-updating indoor semantic floorplan models.</p></list-item>
</list>
</sec>
</sec>
<sec>
<title>2.1.2. Internal entities</title>
<p>The internal entities that are interesting to SDAIB include, but are not limited to, stationary objects (e.g., appliances, furniture, obstacles, and indoor POIs), moving objects (e.g., pedestrians, robots, and vehicles), sensors (e.g., smoke detectors and RFID readers), and positioning reference points<xref ref-type="fn" rid="fn0002"><sup>2</sup></xref>.</p>
<sec>
<title>2.1.2.1. Static contexts of internal entities</title>
<p>For internal entities, the approaches to modeling their static geometric, topological, and semantic information can follow those for the building elements presented in Section 2.1.1. In general, object-oriented models can be used to maintain various aspects of information of each internal entity, including UML-like languages and IFC data formats. For example, IFC data format (Lin et al., <xref ref-type="bibr" rid="B48">2013</xref>) can represent the furnishing objects and facilities in a building. Moreover, for search and storage efficiency, the maintained internal entities are often attached to the existing spatial model of the building space, according to the spatial relationship (e.g., intersection and containment) between the internal entity and the building space. To realize this, pointers, linked lists, and hash tables are used. For example, the composite indoor index (Xie et al., <xref ref-type="bibr" rid="B80">2014</xref>) maintains an object bucket for each of its building partitions and an object hashtable for quickly finding the pointer of the host partition of a particular object; the deployment graph (Jensen et al., <xref ref-type="bibr" rid="B27">2009a</xref>) links doors and partitions to the two-dimensional location, activation range, and types of each deployed RFID reader.</p>
</sec>
<sec>
<title>2.1.2.2. Dynamic contexts of internal entities</title>
<p>In practice, internal entities feature dynamic changes. Therefore, a good body of modeling techniques has been proposed to handle different kinds of dynamic contexts related to internal entities. Several important dimensions are introduced below.</p>
<sec>
<title>2.1.2.2.1. Object movement</title>
<p>For the real-time position changes of moving objects, some works (Li et al., <xref ref-type="bibr" rid="B45">2018c</xref>; Chan et al., <xref ref-type="bibr" rid="B12">2022</xref>) consider updating the mappings between objects and the building partitions to which they belong. However, this mechanism is not suitable for analyzing large-scale historical trajectory data as it may lead to high accumulated processing latency. Therefore, indexing techniques are usually applied to historical trajectories generated by indoor moving objects, either based on R-trees (Jensen et al., <xref ref-type="bibr" rid="B28">2009b</xref>; Alamri et al., <xref ref-type="bibr" rid="B3">2013</xref>; Lin et al., <xref ref-type="bibr" rid="B47">2016</xref>) or grid cells (Choi et al., <xref ref-type="bibr" rid="B17">2004</xref>). Such indexes facilitate the range and nearest neighbor queries, which lay the foundation for other complex spatial computations.</p>
</sec>
<sec>
<title>2.1.2.2.2. Behaviors and events</title>
<p>Becker et al. (<xref ref-type="bibr" rid="B7">2009</xref>) propose a multilayered space-event model for indoor navigation, in which the sensor space layer is built on top of the topographic space layer for maintaining the locations and ranges of installed transmitters and sensors. Likewise, Jin et al. (<xref ref-type="bibr" rid="B30">2012</xref>) design a conceptual modeling framework for indoor space, whose moving object layer supports tracking an indoor entity&#x00027;s visiting durations, moving patterns, and other behavioral information. Kim et al. (<xref ref-type="bibr" rid="B34">2014</xref>) pay particular attention to indoor facility management and propose a CityGML extension of modeling indoor facility features. Li et al. (<xref ref-type="bibr" rid="B41">2020a</xref>,<xref ref-type="bibr" rid="B42">b</xref>) model an indoor behavior as a triple [<italic>s, e</italic>, &#x003C4;], which captures an object&#x00027;s event <italic>e</italic>&#x02208;<italic>E</italic> in a semantic region <italic>s</italic>&#x02208;<italic>S</italic> during a particular time period &#x003C4;.</p>
</sec>
<sec>
<title>2.1.2.2.3. Populations and flows</title>
<p>In many tasks, such as path planning (Liu et al., <xref ref-type="bibr" rid="B53">2021c</xref>) and crowd evacuation (Kamkarian and Hexmoor, <xref ref-type="bibr" rid="B31">2012</xref>), algorithms need to know the populations of regions and the flow of doors in a building. To support the planning of pedestrian facilities, Lam et al. (<xref ref-type="bibr" rid="B37">2003</xref>) devise a generalized walking time function considering bi-directional flow distributions, which can be calibrated for various flow conditions. Aiming for finding populated locations using RFID-like tracking data, Ahmed et al. (<xref ref-type="bibr" rid="B2">2017</xref>) propose a hierarchical location-time index that maintains the number of distinct objects entering, exiting, and presence at a particular location at discrete timestamps or during time intervals. To enable finding a path with the least encountered objects or a path with the shortest travel time in the presence of crowds, Liu et al. (<xref ref-type="bibr" rid="B53">2021c</xref>) capture the flows of doors using a Poisson distribution based function.</p>
</sec>
</sec>
</sec>
<sec>
<title>2.1.3. Summary of modeling techniques for analytical contexts</title>
<p>To sum up, multiple aspects of analytical contexts have been considered and modeled in existing studies. In <xref ref-type="table" rid="T1">Table 1</xref>, we select representative studies on modeling contexts for SDAIB. We can find that these studies accommodate different modeling techniques for different application goals. However, these works do not consider a common methodology to guide analysts in the design and adoption of different types of modeling techniques, which is indeed the main objective of our work (as to be detailed in Section 3).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Featural comparison of representative studies on intelligent building context modeling.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>References</bold></th>
<th valign="top" align="center" colspan="6" style="border-bottom: thin solid #000000;"><bold>Building space</bold></th>
<th valign="top" align="center" colspan="2" style="border-bottom: thin solid #000000;"><bold>Internal entities</bold></th>
<th valign="top" align="center"><bold>Remarks</bold></th>
</tr>
<tr>
<th/>
<th valign="top" align="center"><bold>SG</bold></th>
<th valign="top" align="center"><bold>ST</bold></th>
<th valign="top" align="center"><bold>SS</bold></th>
<th valign="top" align="center"><bold>DG</bold></th>
<th valign="top" align="center"><bold>DT</bold></th>
<th valign="top" align="center"><bold>DS</bold></th>
<th valign="top" align="center"><bold>Static</bold></th>
<th valign="top" align="center"><bold>Dynamic</bold></th>
<th/>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Li and Lee (<xref ref-type="bibr" rid="B40">2008</xref>)</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="left">Basic semantic relationships like containment and overlap</td>
</tr>
<tr>
<td valign="top" align="left">Worboys (<xref ref-type="bibr" rid="B77">2011</xref>)</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="left">Formal models for building space</td>
</tr>
<tr>
<td valign="top" align="left">Jin et al. (<xref ref-type="bibr" rid="B30">2012</xref>)</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="left">Supports sensors and static/moving objects</td>
</tr>
<tr>
<td valign="top" align="left">Lu et al. (<xref ref-type="bibr" rid="B54">2012</xref>)</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="left">Focuses on distance-related computations in building spaces</td>
</tr>
<tr>
<td valign="top" align="left">Lin et al. (<xref ref-type="bibr" rid="B48">2013</xref>)</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="left">Support dynamic changes <italic>via</italic> database update</td>
</tr>
<tr>
<td valign="top" align="left">Alamri et al. (<xref ref-type="bibr" rid="B3">2013</xref>)</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="left">Support dynamic changes <italic>via</italic> indexing</td>
</tr>
<tr>
<td valign="top" align="left">Xie et al. (<xref ref-type="bibr" rid="B80">2014</xref>)</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="left">Support dynamic changes <italic>via</italic> indexing and pointers</td>
</tr>
<tr>
<td valign="top" align="left">Kang and Li (<xref ref-type="bibr" rid="B32">2017</xref>)</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="left">Semantic representations of building spaces</td>
</tr>
<tr>
<td valign="top" align="left"><bold>Objective of this work</bold></td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="center">&#x02713;</td>
<td valign="top" align="left">Methodology on context modeling</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><bold>SG</bold>, static geometric information; <bold>ST</bold>, static topological information; <bold>SS</bold>, static semantic information; <bold>DG</bold>, dynamic geometric information; <bold>DT</bold>, dynamic topological information; <bold>DS</bold>, dynamic semantic information.</p>
</table-wrap-foot>
</table-wrap>
<p>Aiming for formulating the methodology of modeling analytical contexts in SDAIB, we draw the following findings from the aforementioned studies. First, spatial models for buildings, whether based on geometric information or topological information, are suitable as the <italic>base</italic> model for organizing the analytical contexts, with which semantic information can be efficiently associated. When discussing a base model here, we mean a basic data structure that can be extended with advanced models/functions for handling specific types of analytical contexts. Second, for dynamic contexts of the building space, amendment and updating to the base model are often needed; in contrast, dynamic changes of the internal entities can be modeled by some external, complex models/functions that are flexibly linked to the base model.</p>
</sec>
</sec>
<sec>
<title>2.2. Modeling spatial data uncertainty</title>
<p>The spatial data uncertainty in SDAIB is mainly caused by two technical limitations of indoor positioning (tracking) systems, namely (1) the inaccuracy of the positioning algorithm and (2) the discrete sampling scheme of the system. The former is mainly reflected in the errors of estimated locations observed at a specific time point, while the latter results in incomplete and insufficient overall information for analysis. An example is shown in <xref ref-type="fig" rid="F4">Figure 4</xref>. Suppose <italic>p</italic> and <italic>q</italic> are two locations consecutively observed by the positioning system at timestamps <italic>t</italic><sub><italic>p</italic></sub> and <italic>t</italic><sub><italic>q</italic></sub>, respectively. Due to the positioning inaccuracy, there are many possibilities for the actual locations at <italic>t</italic><sub><italic>p</italic></sub> and <italic>t</italic><sub><italic>q</italic></sub>. In <xref ref-type="fig" rid="F4">Figure 4</xref>, two small circles in pink are used to indicate the spatial uncertainties caused by positioning inaccuracy. On the other hand, the object whereabouts between <italic>t</italic><sub><italic>p</italic></sub> and <italic>t</italic><sub><italic>q</italic></sub> also have many possibilities as there are no observations in-between: First, the snapshot location at time <italic>t</italic> &#x02208; (<italic>t</italic><sub><italic>p</italic></sub>, <italic>t</italic><sub><italic>q</italic></sub>) is undetermined (called <italic>snapshot uncertainty</italic>) but should be constrained within a geometric region, e.g., as indicated by the shaded portions in <xref ref-type="fig" rid="F4">Figure 4</xref>, the location at <italic>t</italic> should be within an indoor distance <italic>r</italic> to the location at <italic>t</italic><sub><italic>p</italic></sub> such that <italic>r</italic> is the maximum distance one can reach from <italic>t</italic><sub><italic>p</italic></sub> to <italic>t</italic>; Second, the overall movement from <italic>t</italic><sub><italic>p</italic></sub> to <italic>t</italic><sub><italic>q</italic></sub> is also unknown (called <italic>interval uncertainty</italic>), e.g., one can reach <italic>q</italic> from <italic>p</italic> through either the door <italic>d</italic><sub>2</sub> or the doors <italic>d</italic><sub>3</sub> and <italic>d</italic><sub>1</sub>, as illustrated by the two possible paths in <xref ref-type="fig" rid="F4">Figure 4</xref>. In general, the modeling techniques for the uncertainties due to the positioning inaccuracy and those due to low sampling issues are quite different. As such, we present a classification of relevant techniques in <xref ref-type="fig" rid="F5">Figure 5</xref>. Their details are given below.</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p>An example of different kinds of spatial data uncertainties in SDAIB.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fdata-05-1049198-g0004.tif"/>
</fig>
<fig id="F5" position="float">
<label>Figure 5</label>
<caption><p>The classification of the approaches for handling spatial data uncertainties in SDAIB.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fdata-05-1049198-g0005.tif"/>
</fig>
<sec>
<title>2.2.1. Uncertainty due to the positioning inaccuracy</title>
<p>For a target object at a particular timestamp, the estimated location may deviate from the true location due to the inherent positioning error. This phenomenon is reflected in the uncertainty of the observed spatial data (Hunter, <xref ref-type="bibr" rid="B26">1999</xref>; Pfoser and Jensen, <xref ref-type="bibr" rid="B67">1999</xref>; Jeung et al., <xref ref-type="bibr" rid="B29">2013</xref>; Z&#x000FC;fle et al., <xref ref-type="bibr" rid="B88">2017</xref>), which must be taken into consideration in spatial data analyses, especially for the indoor scenes that usually pose high requirements on the precision of the spatial data.</p>
<p>Different modeling approaches have been employed, depending on the technologies (e.g., Wi-Fi, RFID, or Bluetooth) and positioning algorithms (e.g., proximity-based, geometry-based, or learning-based) employed in the indoor positioning system (Liu et al., <xref ref-type="bibr" rid="B50">2007</xref>).</p>
<p>In the proximity-based positioning that is commonly seen in RFID- and Bluetooth-based tracking solutions, the <italic>activation range based representation</italic> (Jensen et al., <xref ref-type="bibr" rid="B27">2009a</xref>; Yang et al., <xref ref-type="bibr" rid="B82">2010</xref>; Lu et al., <xref ref-type="bibr" rid="B56">2011</xref>, <xref ref-type="bibr" rid="B55">2016</xref>) is employed to model the uncertainty of the object location reported in a tracking record. Specifically, a tracking record (<italic>o</italic><sub><italic>i</italic></sub>, <italic>r</italic><sub><italic>j</italic></sub>, <italic>t</italic>) indicates that an object (e.g., an RFID tag) <italic>o</italic><sub><italic>i</italic></sub> has been detected by a device (e.g., an RFID reader) <italic>r</italic><sub><italic>j</italic></sub> at time <italic>t</italic>. As the proximity-based tracking cannot give exact geometric coordinates, the possible position of <italic>o</italic><sub><italic>i</italic></sub> at <italic>t</italic> is simply considered to be within the activation range of the device <italic>r</italic><sub><italic>j</italic></sub>, which is usually modeled as a small circle &#x000B0;(<italic>l</italic><sub><italic>j</italic></sub>, <italic>mrd</italic><sub><italic>j</italic></sub>), centered at <italic>r</italic><sub><italic>j</italic></sub>&#x00027;s deployment position <italic>l</italic><sub><italic>j</italic></sub> with the maximum reader distance <italic>mrd</italic><sub><italic>j</italic></sub> of <italic>r</italic><sub><italic>j</italic></sub> as the radius.</p>
<p>The location result as a geometric coordinate is more widely used in practice (Liu et al., <xref ref-type="bibr" rid="B50">2007</xref>). Its form of (<italic>o</italic><sub><italic>i</italic></sub>, <italic>l</italic><sub><italic>j</italic></sub>, <italic>t</italic>) indicates that object <italic>o</italic><sub><italic>i</italic></sub> is considered to be at a specific indoor location <italic>l</italic><sub><italic>j</italic></sub> at time <italic>t</italic>. Usually, the indoor location is a two-dimensional plane point on a particular floor. In this study, for the sake of presentation simplicity, we focus on the uncertainty of the estimated location on a determined floor as the floor determination has proven highly accurate (&#x0003E;99%; Wang and Luo, <xref ref-type="bibr" rid="B76">2017</xref>). For geometric positioning results, the two following kinds of representations are often used.</p>
<list list-type="bullet">
<list-item><p>The <italic>instance-based representation</italic> (Xie et al., <xref ref-type="bibr" rid="B79">2013</xref>, <xref ref-type="bibr" rid="B80">2014</xref>; Li et al., <xref ref-type="bibr" rid="B44">2018b</xref>) models the possible locations of the target object as a set of location samples, each being a pair of a location and an existence probability. Formally, the sample set is denoted as {(<italic>l</italic><sub><italic>i</italic></sub>, &#x003C1;<sub><italic>i</italic></sub>)} where <italic>l</italic><sub><italic>i</italic></sub> is the sample location and &#x003C1;<sub><italic>i</italic></sub> is the corresponding probability such that <inline-formula><mml:math id="M1"><mml:munder class="msub"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:munder><mml:msub><mml:mrow><mml:mi>&#x003C1;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula>. The instance-based representation fits well with typical localization algorithms, such as <italic>k</italic>NN and naive Bayes (Liu et al., <xref ref-type="bibr" rid="B50">2007</xref>), which produce multiple possible locations along with their corresponding likelihoods. Moreover, instances can be used to approximate arbitrary distributions, providing high flexibility.</p></list-item>
<list-item><p>The <italic>circle-based representation</italic> (Chan et al., <xref ref-type="bibr" rid="B12">2022</xref>) captures the possible location of the target object <italic>o</italic><sub><italic>i</italic></sub> as a circle &#x000B0;(<italic>l</italic><sub><italic>j</italic></sub>, &#x003F5;) where the circle center <italic>l</italic><sub><italic>j</italic></sub> is the observed location and the radius &#x003F5; is the pre-known average positioning error of the underlying positioning system. The circle-based representation can be modified and extended with prior knowledge, such as using different radii at different locations (Chan et al., <xref ref-type="bibr" rid="B12">2022</xref>). It can also be converted into the instance-based representation (Chan et al., <xref ref-type="bibr" rid="B12">2022</xref>). However, it is nontrivial to determine the radii at different building locations.</p></list-item>
</list>
</sec>
<sec>
<title>2.2.2. Uncertainty due to low sampling issues</title>
<p>For the uncertainty caused by low sampling issues, we further differentiate the <italic>snapshot uncertainty</italic> and the <italic>interval uncertainty</italic>. Their main difference is that the snapshot uncertainty discusses the object&#x00027;s whereabouts at a particular time point, while the interval uncertainty concerns the object&#x00027;s whereabouts during a time interval between two observations (Lu et al., <xref ref-type="bibr" rid="B55">2016</xref>).</p>
<sec>
<title>2.2.2.1. Snapshot uncertainty</title>
<p>Snapshot uncertainty is caused when the last observation has expired while the current object location has not been observed. Given the latest positioning record (<italic>o</italic><sub><italic>i</italic></sub>, <italic>x</italic><sub><italic>j</italic></sub>, <italic>t</italic><sub><italic>l</italic></sub>), where <italic>x</italic><sub><italic>j</italic></sub> is the observed location at the latest sampling time <italic>t</italic><sub><italic>l</italic></sub>, the possible locations of <italic>o</italic><sub><italic>i</italic></sub> at the current time <italic>t</italic><sub><italic>c</italic></sub>(&#x0003E;<italic>t</italic><sub><italic>l</italic></sub>) can be derived based on <italic>geometric approaches</italic> (Yang et al., <xref ref-type="bibr" rid="B82">2010</xref>; Li et al., <xref ref-type="bibr" rid="B45">2018c</xref>; Chan et al., <xref ref-type="bibr" rid="B12">2022</xref>), as a space- and speed-constrained region. To be specific, given the maximum speed <italic>V</italic><sub><italic>max</italic></sub> of all indoor moving objects<xref ref-type="fn" rid="fn0003"><sup>3</sup></xref>, the possible locations of the object <italic>o</italic><sub><italic>i</italic></sub> at time <italic>t</italic><sub><italic>c</italic></sub> are covered by an <italic>uncertainty region</italic> <italic>UR</italic>(<italic>x</italic><sub><italic>j</italic></sub>, <italic>t</italic><sub><italic>l</italic></sub>, <italic>t</italic><sub><italic>c</italic></sub>), which is computed as follows.</p>
<disp-formula id="E1"><label>(1)</label><mml:math id="M2"><mml:mrow><mml:mi>U</mml:mi><mml:mi>R</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mi>l</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mi>c</mml:mi></mml:msub><mml:mo stretchy='false'>)</mml:mo><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mtable columnalign='left'><mml:mtr><mml:mtd><mml:msub><mml:mtext>Range</mml:mtext><mml:mi>I</mml:mi></mml:msub><mml:mo stretchy='false'>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:mi>&#x003B4;</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo>,</mml:mo><mml:mtext>if</mml:mtext><mml:mo>&#x000A0;</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>&#x000A0;</mml:mo><mml:mtext>is&#x000A0;a&#x000A0;specific&#x000A0;location</mml:mtext><mml:mo>,</mml:mo></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mstyle displaystyle='true'><mml:msub><mml:mo>&#x0222A;</mml:mo><mml:mrow><mml:mi>p</mml:mi><mml:mo>&#x02208;</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>.</mml:mo><mml:mtext>boundary</mml:mtext></mml:mrow></mml:msub><mml:mrow><mml:msub><mml:mrow><mml:mtext>Range</mml:mtext></mml:mrow><mml:mi>I</mml:mi></mml:msub><mml:mo stretchy='false'>(</mml:mo><mml:mi>p</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003B4;</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo>&#x02216;</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:mtext>if</mml:mtext><mml:mo>&#x000A0;</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>&#x000A0;</mml:mo><mml:mtext>is&#x000A0;an</mml:mtext></mml:mrow></mml:mstyle></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mtext>activation&#x000A0;range</mml:mtext><mml:mo>,</mml:mo></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mstyle displaystyle='true'><mml:msub><mml:mo>&#x0222A;</mml:mo><mml:mrow><mml:mi>e</mml:mi><mml:mo>&#x02208;</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:msub><mml:mrow><mml:msub><mml:mrow><mml:mtext>Range</mml:mtext></mml:mrow><mml:mi>I</mml:mi></mml:msub><mml:mo stretchy='false'>(</mml:mo><mml:mi>e</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003B4;</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo>,</mml:mo><mml:mtext>if</mml:mtext><mml:mo>&#x000A0;</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>&#x000A0;</mml:mo><mml:mtext>is&#x000A0;an&#x000A0;instance&#x000A0;set</mml:mtext><mml:mo>,</mml:mo></mml:mrow></mml:mstyle></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mstyle displaystyle='true'><mml:msub><mml:mo>&#x0222A;</mml:mo><mml:mrow><mml:mi>p</mml:mi><mml:mo>&#x02208;</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>.</mml:mo><mml:mtext>boundary</mml:mtext></mml:mrow></mml:msub><mml:mrow><mml:msub><mml:mrow><mml:mtext>Range</mml:mtext></mml:mrow><mml:mi>I</mml:mi></mml:msub><mml:mo stretchy='false'>(</mml:mo><mml:mi>p</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003B4;</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo>&#x0222A;</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:mtext>if</mml:mtext><mml:mo>&#x000A0;</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>&#x000A0;</mml:mo><mml:mtext>is&#x000A0;a</mml:mtext></mml:mrow></mml:mstyle></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mtext>small&#x000A0;circle</mml:mtext><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:math></disp-formula>
<p>where &#x003B4; &#x0003D; <italic>V</italic><sub><italic>max</italic></sub> &#x000B7; (<italic>t</italic><sub><italic>c</italic></sub> &#x02212; <italic>t</italic><sub><italic>l</italic></sub>) is the maximum distance that <italic>o</italic><sub><italic>i</italic></sub> can walk from <italic>t</italic><sub><italic>l</italic></sub> to <italic>t</italic><sub><italic>c</italic></sub>, and the Indoor Range Query function Range<sub><italic>I</italic></sub>(<italic>p, dis</italic>) returns the indoor portions that can be reached within the distance <italic>dis</italic> from the given point <italic>p</italic>. The implementation of the Range<sub><italic>I</italic></sub> function has been covered in existing studies (Lu et al., <xref ref-type="bibr" rid="B54">2012</xref>; Xie et al., <xref ref-type="bibr" rid="B79">2013</xref>; Li et al., <xref ref-type="bibr" rid="B45">2018c</xref>). In Equation (1), Case 1 assumes there is no positioning uncertainty (Li et al., <xref ref-type="bibr" rid="B45">2018c</xref>), while the other three cases correspond to different kinds of representations of positioning uncertainties (see Section 2.2.1). In particular, Case 2 considers the activation range based representation (Yang et al., <xref ref-type="bibr" rid="B82">2010</xref>) and <italic>x</italic><sub><italic>j</italic></sub> is excluded because the object is currently unobserved and must not be within any activation range; Case 3 considers the instance-based representation, and each instance is used to infer an uncertainty region that will be merged then; Case 4 refers to the circle-based representation (Chan et al., <xref ref-type="bibr" rid="B12">2022</xref>) used in geometric positioning and the uncertainty region is modeled as an outwardly extended region of the last observed location <italic>x</italic><sub><italic>j</italic></sub>.</p>
<p>Generally speaking, the distribution of possible locations in the derived uncertainty region is not necessarily uniform. Following the first law of geography (Tobler, <xref ref-type="bibr" rid="B74">2004</xref>), i.e., things that are near are more related than things that are far away, various <italic>distance-decaying functions</italic> (Li et al., <xref ref-type="bibr" rid="B45">2018c</xref>; Chan et al., <xref ref-type="bibr" rid="B12">2022</xref>) have been designed to model the likelihood of different possible locations in an uncertainty region.</p>
</sec>
<sec>
<title>2.2.2.2. Interval uncertainty</title>
<p>Unlike snapshot uncertainty, interval uncertainty happens across the intervals between two consecutively observed locations. We categorize three different types of approaches for modeling interval uncertainty as follows.</p>
<list list-type="bullet">
<list-item><p>The <italic>geometric approaches</italic> (Jensen et al., <xref ref-type="bibr" rid="B27">2009a</xref>; Lu et al., <xref ref-type="bibr" rid="B56">2011</xref>, <xref ref-type="bibr" rid="B55">2016</xref>; Teng et al., <xref ref-type="bibr" rid="B72">2017</xref>; Li et al., <xref ref-type="bibr" rid="B42">2020b</xref>) calculate the possible locations of an object between its two consecutively observed locations based on spatial constraints. Suppose (<italic>x</italic><sub><italic>a</italic></sub>, <italic>t</italic><sub><italic>a</italic></sub>) and (<italic>x</italic><sub><italic>b</italic></sub>, <italic>t</italic><sub><italic>b</italic></sub>) are two consecutively positioning records, where <italic>x</italic><sub><italic>a</italic></sub> and <italic>x</italic><sub><italic>b</italic></sub> are observed locations and <italic>t</italic><sub><italic>a</italic></sub> and <italic>t</italic><sub><italic>b</italic></sub> are the corresponding timestamps. Referring to Section 2.2.1, an observed location <italic>x</italic><sub><italic>a</italic></sub> in the RFID setting is represented as a small circle &#x000B0;(<italic>l</italic><sub><italic>a</italic></sub>, <italic>mrd</italic><sub><italic>a</italic></sub>) centered at the reader&#x00027;s deployed location <italic>l</italic><sub><italic>a</italic></sub> with the maximum reader distance <italic>mrd</italic><sub><italic>a</italic></sub> as the radius. Jensen et al. (<xref ref-type="bibr" rid="B27">2009a</xref>) and Lu et al. (<xref ref-type="bibr" rid="B55">2016</xref>) point out that the possible locations during the time interval [<italic>t</italic><sub><italic>a</italic></sub>, <italic>t</italic><sub><italic>b</italic></sub>] must be constrained by an extended ellipse, whose two foci are located at <italic>l</italic><sub><italic>a</italic></sub> and <italic>l</italic><sub><italic>b</italic></sub> and eccentricity is defined by the distance [<italic>V</italic><sub><italic>max</italic></sub> &#x000B7; (<italic>t</italic><sub><italic>b</italic></sub> &#x02212; <italic>t</italic><sub><italic>a</italic></sub>) &#x0002B; <italic>mrd</italic><sub><italic>a</italic></sub> &#x0002B; <italic>mrd</italic><sub><italic>b</italic></sub>] where <italic>V</italic><sub><italic>max</italic></sub> is the maximum speed of moving objects. Moreover, the ellipse must exclude those unreachable indoor parts in terms of minimum indoor walking distance, rather than the Euclidean distance. Also in the RFID tracking scenario, Lu et al. (<xref ref-type="bibr" rid="B56">2011</xref>) indicate that the possible location at a particular time point <italic>t</italic><sub><italic>c</italic></sub> &#x02208; [<italic>t</italic><sub><italic>a</italic></sub>, <italic>t</italic><sub><italic>b</italic></sub>] is constrained by two rings ring[<italic>l</italic><sub><italic>a</italic></sub>, <italic>mrd</italic><sub><italic>a</italic></sub>, <italic>V</italic><sub><italic>max</italic></sub> &#x000B7; (<italic>t</italic><sub><italic>c</italic></sub> &#x02212; <italic>t</italic><sub><italic>a</italic></sub>)] and ring[<italic>l</italic><sub><italic>b</italic></sub>, <italic>mrd</italic><sub><italic>b</italic></sub>, <italic>V</italic><sub><italic>max</italic></sub> &#x000B7; (<italic>t</italic><sub><italic>b</italic></sub> &#x02212; <italic>t</italic><sub><italic>c</italic></sub>)], where ring(<italic>p, dia</italic><sub>1</sub>, <italic>dist</italic><sub>2</sub>) refers to a ring centered at point <italic>p</italic> with inside diameter <italic>dia</italic><sub>1</sub> and outside diameter <italic>dia</italic><sub>2</sub>. Finally, the intersection of the two rings excluded those unreachable indoor portions is used to model the uncertainty region at time <italic>t</italic><sub><italic>c</italic></sub>. Similar to the space-constrained intersection of rings (Lu et al., <xref ref-type="bibr" rid="B56">2011</xref>), Li et al. (<xref ref-type="bibr" rid="B42">2020b</xref>) employ the space-constrained intersection of circles for geometric positioning results. Assuming grid-based partitioning of the target indoor space, Teng et al. (<xref ref-type="bibr" rid="B72">2017</xref>) use a set of reachable cells to represent possible object locations. Each grid satisfies that any moving object can reach the previously observed location <italic>l</italic><sub><italic>a</italic></sub> and next observed location <italic>l</italic><sub><italic>b</italic></sub> within the limited traveling time.</p></list-item>
<list-item><p>The <italic>probabilistic approaches</italic> (Li et al., <xref ref-type="bibr" rid="B44">2018b</xref>, <xref ref-type="bibr" rid="B42">2020b</xref>) represent the possible indoor movement between two observed locations in a probabilistic form. Representing the indoor space as a graph with rooms as nodes and doors as edges, Li et al. (<xref ref-type="bibr" rid="B42">2020b</xref>) compute the transition probability on each directed edge using historical data, and analyze the movement between two observation locations (rooms) based on conditional probabilities with the computed transition probabilities as prior knowledge. Assuming that the location at each observed time point is a set of instances with probabilities, another work (Li et al., <xref ref-type="bibr" rid="B44">2018b</xref>) uses the Cartesian product of the instances at continuous time points to generate a set of possible indoor paths, and the probability of each path is calculated as the product of the probabilities of the instances it goes through. Those indoor paths violating indoor topology will be removed and the probabilities of the rest paths will be normalized.</p></list-item>
<list-item><p>The <italic>model-based approaches</italic> (Hightower and Borriello, <xref ref-type="bibr" rid="B25">2004</xref>; Petzold et al., <xref ref-type="bibr" rid="B66">2005</xref>; Bekkali et al., <xref ref-type="bibr" rid="B8">2007</xref>; Asahara et al., <xref ref-type="bibr" rid="B4">2011</xref>; Laursen et al., <xref ref-type="bibr" rid="B38">2012</xref>; Patel and Thakore, <xref ref-type="bibr" rid="B64">2013</xref>; Yu et al., <xref ref-type="bibr" rid="B84">2013</xref>; Baba et al., <xref ref-type="bibr" rid="B5">2016</xref>; Belmonte-Hernandez et al., <xref ref-type="bibr" rid="B9">2019</xref>; Li et al., <xref ref-type="bibr" rid="B41">2020a</xref>; Tariq et al., <xref ref-type="bibr" rid="B71">2021</xref>) capture the underlying data generation mechanism of the observed data in a model, which is then used to recover the unsampled time points. Three representative techniques, namely Bayes filters, probabilistic graphical models, and neural networks, have been explored, mainly depending on capturing the temporal dependencies of the sequential positioning records. The Bayes filter sequentially estimates the optimal location of the target object by mapping the noisy location observations at each time point to a fuzzy representation, such as samples in particle filters (Hightower and Borriello, <xref ref-type="bibr" rid="B25">2004</xref>; Yu et al., <xref ref-type="bibr" rid="B84">2013</xref>) and Gaussian distributions in Kalman filters (Bekkali et al., <xref ref-type="bibr" rid="B8">2007</xref>; Patel and Thakore, <xref ref-type="bibr" rid="B64">2013</xref>). Probabilistic graphical models represent the observed object positions as discrete and piecewise constant states and learn dependencies between those states using historical data. Different models have been explored to incorporate different kinds of application semantics, such as hidden Markov models (Laursen et al., <xref ref-type="bibr" rid="B38">2012</xref>; Baba et al., <xref ref-type="bibr" rid="B5">2016</xref>), conditional random fields (Li et al., <xref ref-type="bibr" rid="B41">2020a</xref>), dynamic Bayesian networks (Petzold et al., <xref ref-type="bibr" rid="B66">2005</xref>), and mixed Markov-chain models (Asahara et al., <xref ref-type="bibr" rid="B4">2011</xref>). Recently, several works propose using deep neural networks to learn intrinsic dependencies among sequential positioning records, such as recurrent neural networks (Belmonte-Hernandez et al., <xref ref-type="bibr" rid="B9">2019</xref>) and 1-D convolutional networks (Tariq et al., <xref ref-type="bibr" rid="B71">2021</xref>).</p></list-item>
</list>
</sec>
</sec>
<sec>
<title>2.2.3. Summary of modeling techniques for spatial data uncertainties</title>
<p>First, to handle the spatial data uncertainty, the characteristics of the indoor positioning/tracking system, such as the used technologies and positioning algorithms, must be taken into account. Second, the uncertainty due to the positioning inaccuracy should be handled before the uncertainty due to low sampling issues. In the latter case, the handling techniques usually utilize spatial observations to derive the uncertainty regions. Such spatial observations, obtained from an indoor positioning system, often carry positioning errors (see Section 2.2.1). As these positioning errors take effect in the analyses, they should be handled before deriving the uncertainty regions with respect to low sampling issues. Referring to the example in <xref ref-type="fig" rid="F4">Figure 4</xref>, the positioning errors of the observed location <italic>p</italic> are considered in modeling the snapshot uncertainty at a time <italic>t</italic> &#x02208; (<italic>t</italic><sub><italic>p</italic></sub>, <italic>t</italic><sub><italic>q</italic></sub>) (the shaded regions). Last, in handling uncertainties caused by low sampling, the analytical contexts are very useful. For example, the Indoor Range Query function used in snapshot uncertainty models (Xie et al., <xref ref-type="bibr" rid="B79">2013</xref>; Li et al., <xref ref-type="bibr" rid="B45">2018c</xref>) heavily relies on the geometric and topological information of the building space: To return the indoor portions that one can reach within a limited time budget, the function needs to know the positions of doors to access adjacent rooms as well as the size of rooms. In <xref ref-type="fig" rid="F4">Figure 4</xref>, deriving the uncertainty region (the shaded portions) considers the position of door <italic>d</italic><sub>3</sub> and geometries of partitions <italic>v</italic><sub>1</sub> and <italic>v</italic><sub>3</sub>. For another example, some model-based approaches for interval uncertainties (Laursen et al., <xref ref-type="bibr" rid="B38">2012</xref>; Yu et al., <xref ref-type="bibr" rid="B84">2013</xref>; Li et al., <xref ref-type="bibr" rid="B41">2020a</xref>) require both spatial and semantic information to build a probabilistic graphical model. In particular, semantics like sensor deployment and geometries of indoor space are used as prior knowledge for designing the corresponding state space.</p>
</sec>
</sec>
</sec>
<sec id="s3">
<title>3. A unified modeling framework</title>
<p>Given that various kinds of analytical contexts and spatial data uncertainties should be considered in different SDAIB tasks, we propose a unified modeling framework based on which data analysts can incorporate relevant contexts and spatial data uncertainties into their specific tasks through a standard workflow. The framework is illustrated in <xref ref-type="fig" rid="F6">Figure 6</xref>, which consists of six steps for modeling analytical contexts followed by two steps for modeling spatial data uncertainties. We model the analytical contexts first and then spatial data uncertainties because handling data uncertainties usually rely on the contexts.</p>
<list list-type="order">
<list-item><p><bold>Handling Static Contexts of Building Space</bold>. According to the summary in Section 2.1.3, we first use the spatial information (including geometric and topological information) to build a base model called <italic>connectivity base graph</italic> (Jensen et al., <xref ref-type="bibr" rid="B27">2009a</xref>)<xref ref-type="fn" rid="fn0004"><sup>4</sup></xref>. We use a graph-based model due to its flexibility in handling dynamic changes in both nodes and edges. Specifically, we divide the target building space into a set of basic topological units according to the geometric decomposition algorithm (Xie et al., <xref ref-type="bibr" rid="B80">2014</xref>). Each such unit is called a partition and represented as a graph node. Then, each two topologically connected partitions are represented by an edge, on which the connecting door is maintained. Using directed edges is an alternative solution if the door directionality is critical (see <italic>accessibility graph</italic>; Jensen et al., <xref ref-type="bibr" rid="B27">2009a</xref>). Afterwards, we capture the geometric and semantic properties of each building element using object-oriented data models and link those data models to the components of the connectivity base graph (i.e., doors and partitions).</p></list-item>
<list-item><p><bold>Handling Static Contexts of Internal Entities</bold>. Similar to linking building elements to the connectivity base graph, internal entities such as sensors and positioning reference points are associated with the graph nodes and edges after being modeled with geometric and semantic properties. Some analyses require knowing topological relationships between specific internal entities. For example, the asset tracing task may need to know whether two specific RFID readers&#x00027; activation ranges overlap with each other (Jensen et al., <xref ref-type="bibr" rid="B27">2009a</xref>; Yang et al., <xref ref-type="bibr" rid="B82">2010</xref>; Lu et al., <xref ref-type="bibr" rid="B56">2011</xref>, <xref ref-type="bibr" rid="B55">2016</xref>). Thus, we derive topological relationships of internal entities based on their geometric and semantic information, and add them to the base graph.</p></list-item>
<list-item><p><bold>Specific Optimization to Static Context Modeling</bold>. To improve data access efficiency for specific tasks, facilitator data structures are designed. Firstly, if the analysis task focuses on the data captured as IDs of sensors (or doors, or positioning reference points), a dual-form transformation of the original connectivity base graph will be beneficial. In particular, the dual-form graph captures the sensors (or doors, or positioning reference points) of interest as graph nodes and their relationships as edges. Secondly, materialized mappings between different kinds of objects can be pre-computed to facilitate search. Moreover, spatial indexes would speed up spatial computations (Liu et al., <xref ref-type="bibr" rid="B52">2021b</xref>), e.g., R-trees and grids for geometric computations (Choi et al., <xref ref-type="bibr" rid="B17">2004</xref>; Lu et al., <xref ref-type="bibr" rid="B54">2012</xref>) and VIP-trees for topology-relevant operations (Shao et al., <xref ref-type="bibr" rid="B69">2016</xref>). Nevertheless, building these additional facilitator data structures incurs more storage space.</p></list-item>
<list-item><p><bold>Handling Dynamic Contexts of Internal Entities</bold>. Steps 4 and 5 can be performed simultaneously, while Step 4 for internal entities is more straightforward. Specifically, for the dynamic changes of an internal entity, we update its corresponding object-oriented model directly. The update order is geometric information first, semantic information second, and topological information last, considering that the update of one kind of information will affect another kind of information.</p></list-item>
<list-item><p><bold>Handling Dynamic Contexts of Building Space</bold>. Updating building elements in this step is almost the same as updating internal entities in Step 4, except for the topological information. Since topological relationships between building elements are maintained directly in the base graph, frequently updating the base graph will lead to chained changes. An optimization strategy is to establish checkpoints of the base graph (or its dual-form graph) along the time and maintain incremental updates of graph edges. We refer interested readers to materials (Mondal and Deshpande, <xref ref-type="bibr" rid="B60">2012</xref>; Zaki et al., <xref ref-type="bibr" rid="B85">2016</xref>) on dynamic graph modeling.</p></list-item>
<list-item><p><bold>Update Mappings/Indexes Related to Dynamic Contexts</bold>. Mappings and indexes should be updated accordingly if there are dynamic information changes in building elements and internal entities. Updating such facilitator data structures will potentially incur a large time overhead, which should be considered carefully in their design. We refer interested readers to an empirical study on indoor spatial indexes (Liu et al., <xref ref-type="bibr" rid="B52">2021b</xref>).</p></list-item>
<list-item><p><bold>Handling Uncertainty Caused by the Positioning Inaccuracy</bold>. The summary in Section 2.2.3 discloses that positioning uncertainty must be handled before the low sampling uncertainty. To model positioning uncertainty, the representation choice depends on the technology and positioning algorithm employed by the underlying indoor positioning system. Usually, activation range based representation is used in symbolic tracking using RFID and infrared, instance-based representation fits well with the probabilistic positioning algorithms, and circle-based representation is a default choice for geometric positioning results.</p></list-item>
<list-item><p><bold>Handling Uncertainty Caused by Low Sampling Issues</bold>. Finally, we deal with low sampling uncertainty. Before modeling, we need to introduce relevant analytical contexts as prior knowledge, such as topology and geometric information of the building. If the data observations before and after the analysis time are available, modeling interval uncertainty is considered. Otherwise, we model the snapshot uncertainty. The choice of the modeling approach for low sampling uncertainties (see Section 2.2.2) should consider the characteristics of the indoor positioning system, as well as the availability of prior knowledge and historical positioning data.</p></list-item>
</list>
<fig id="F6" position="float">
<label>Figure 6</label>
<caption><p>A unified framework for modeling analytical contexts and spatial data uncertainties.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fdata-05-1049198-g0006.tif"/>
</fig>
<p>The last two steps of the proposed framework focus on modeling data uncertainties related to data analyses, which is considered a fundamental process for reducing data uncertainties in downstream applications. On the one hand, interpolation and sampling techniques on top of a data uncertainty model can be employed to derive or recover the unobserved data (Li et al., <xref ref-type="bibr" rid="B42">2020b</xref>), thus reducing the data uncertainty explicitly. On the other hand, data uncertainty models can be incorporated into some analytical processes [e.g., graphical models Li et al. (<xref ref-type="bibr" rid="B41">2020a</xref>) and probabilistic queries Li et al. (<xref ref-type="bibr" rid="B45">2018c</xref>)] in a probabilistic way, mitigating the negative impact of spatial data uncertainty in analyses. Examples of reducing data uncertainty based on the modeling techniques will be provided in Section 4.</p>
</sec>
<sec id="s4">
<title>4. Context-aware and uncertainty-aware data analyses</title>
<p>We will describe how the unified modeling framework proposed in Section 3 enables context-aware and uncertainty-aware data analyses. Sections 4.1&#x02013;4.4 cover hotspots discovery, path planning, semantic trajectory generation, and distance monitoring, respectively. For each task, we will introduce its problem background, modeling process, and analysis algorithms based on the aforementioned framework. We refer interested readers to the original papers for detailed empirical studies.</p>
<sec>
<title>4.1. Hotspots discovery</title>
<p>Public indoor venues, such as shopping malls, subway stations, and airports, are often crowded during peak times. For example, as early as 2004, the weekend traffic at Hong Kong New Town Plaza<xref ref-type="fn" rid="fn0005"><sup>5</sup></xref> reached 320,000 (Xie et al., <xref ref-type="bibr" rid="B80">2014</xref>). Crowd gatherings can easily lead to congestion, safety incidents, and public health concerns. Therefore, it becomes fundamentally important to find out and monitor those indoor hotspots with high populations and flows. Below, we present our techniques for finding out the <italic>indoor dense regions</italic> with the highest populations (Li et al., <xref ref-type="bibr" rid="B45">2018c</xref>) and the <italic>indoor popular semantic locations</italic> with the highest flows (Li et al., <xref ref-type="bibr" rid="B44">2018b</xref>), respectively.</p>
<sec>
<title>4.1.1. Indoor dense regions</title>
<p>The previous study (Li et al., <xref ref-type="bibr" rid="B45">2018c</xref>) aims to timely find the <italic>k</italic> most dense regions using only the latest geometric positioning result of each observed object in a Wi-Fi based indoor positioning system. The candidate regions are arbitrary rectangles customized by users. As the latest observed locations may be outdated, the counting of objects in a region is based on the objects&#x00027; uncertainty regions derived from their latest observations. Specifically, for each object <italic>o</italic><sub><italic>i</italic></sub> whose uncertainty region at the current query time intersects with the candidate region <italic>r</italic><sub><italic>j</italic></sub>, the proportion of the intersecting area occupying <italic>o</italic><sub><italic>i</italic></sub>&#x00027;s uncertainty region is added to the density of <italic>r</italic><sub><italic>j</italic></sub> as the probability that <italic>o</italic><sub><italic>i</italic></sub> currently presents in <italic>r</italic><sub><italic>j</italic></sub>.</p>
<p>The modeling for analytical contexts and spatial data uncertainties are as follows.</p>
<list list-type="bullet">
<list-item><p><bold>Analytical Contexts</bold>. For static contexts, a connectivity base graph is built with geometric and topological information of rooms, hallways, and staircases. The indoor moving objects, as internal entities, are maintained in each partition&#x00027;s associated bucket. An R-tree and the mappings between moving objects and partitions are used to speed up locating objects. There is no dynamic change of the building space but the object movements are updated in the base graph and facilitator structures.</p></list-item>
<list-item><p><bold>Spatial Data Uncertainties</bold>. For simplicity, positioning uncertainty is not considered in the latest positioning records&#x02014;nevertheless, it can be extended to instance-based or circle-based representation. Subsequently, to infer the possible location of an object at a particular query time, the snapshot uncertainty model is introduced, corresponding to Case 1 in Equation (1).</p></list-item>
</list>
<p>Given the snapshot uncertainty region, it is however time-consuming to compute the probability-based object presence for each pair of a moving object and a candidate region. We briefly introduce the basic idea for efficiently finding the top-<italic>k</italic> densest regions. In particular, instead of calculating the intersecting areas in the indoor space, a simple counting of those certainly and possibly intersected objects is used to derive the lower bound and upper bound of a region&#x00027;s density. In this way, some unpromising regions can be pruned before concrete density computations. The search further utilizes spatial loose bounds enabled by Euclidean distances and temporal loose bounds enabled by the oldest observed timestamps for even more aggressive pruning.</p>
</sec>
<sec>
<title>4.1.2. Indoor popular semantic locations</title>
<p>The study (Li et al., <xref ref-type="bibr" rid="B45">2018c</xref>) focuses on identifying those region-like semantic locations with the highest flows during a past time interval. The available historical positioning data is in the instance-based representation and each instance corresponds to a pre-selected reference point in the Wi-Fi based positioning system. Due to low sampling uncertainty, it is necessary to obtain all possible indoor paths of each object that appeared in the given time interval. To this end, the Cartesian product is used to connect instances of any two consecutive reporting times, and the probability of a resultant sequence is the product of the probabilities of all instances on that sequence. Among all generated sequences, those having two consecutive instances not topologically connected are invalid and should be removed, and those remaining sequences are assigned normalized probabilities eventually. As a result, we consider the flow of each candidate semantic location as the summed probabilities of all valid indoor paths that have passed that semantic location.</p>
<p>The summary of its modeling of contexts and data uncertainties is as follows.</p>
<list list-type="bullet">
<list-item><p><bold>Analytical Contexts</bold>. First, a dual-form graph is generated from the original connectivity base graph, in which reference points are regarded as nodes and the passed semantic locations between two reference points are maintained at edges. In this way, it is easy to find relevant semantic locations given any two consecutive instances (reference points). Second, R-trees are used to index both semantic locations and positioning data, which enable efficient spatial joins between semantic locations and positioning instances in the search.</p></list-item>
<list-item><p><bold>Spatial Data Uncertainties</bold>. As mentioned above, positioning uncertainty is modeled using the instance-based representation while low sampling uncertainty is modeled using an interval uncertainty model, i.e., a set of possible instance sequences.</p></list-item>
</list>
<p>To speed up the search, several supporting techniques are proposed. First, to avoid the explosive instance sequences generated by the Cartesian product, indoor topology based pruning and semantic information based instance grouping are used for data reduction. Second, spatial joins over the semantic location R-tree and the positioning instance R-tree are utilized, prioritizing those semantic locations (and their parent regions) that potentially cover more passing moving objects.</p>
</sec>
</sec>
<sec>
<title>4.2. Path planning</title>
<p>Modern buildings are becoming increasingly large and complex, e.g., the Dubai Mall covers an area of over 12 million square feet (equivalent to more than 50 soccer fields)<xref ref-type="fn" rid="fn0006"><sup>6</sup></xref>, the King Fahd International Airport is 3.5 million square feet<xref ref-type="fn" rid="fn0007"><sup>7</sup></xref>, and the Louvre museum covers a total area of 782,910 square feet of galleries space, as big as 280 tennis courts<xref ref-type="fn" rid="fn0008"><sup>8</sup></xref>. It is difficult for people to find an appropriate path in such a large and complex building, especially when they are new visitors. Moreover, to better cater to the user&#x00027;s navigation needs, some important contextual information, such as semantic information of POIs, temporal variation of doors, and dynamic crowds, should be considered in the pathfinding, rather than simply returning the shortest path between the source point and the target point. Correspondingly, we present the following techniques for <italic>keyword-aware path planning</italic> (Feng et al., <xref ref-type="bibr" rid="B22">2020</xref>), <italic>temporal-variation aware path planning</italic> (Liu et al., <xref ref-type="bibr" rid="B51">2021a</xref>), and <italic>crowd-aware path planning</italic> (Liu et al., <xref ref-type="bibr" rid="B53">2021c</xref>).</p>
<sec>
<title>4.2.1. Keyword-aware path planning</title>
<p>In the previous study (Feng et al., <xref ref-type="bibr" rid="B22">2020</xref>), we focus on finding indoor paths that can cover user-specific keywords (e.g., &#x0201C;cola&#x0201D; and &#x0201C;toilet&#x0201D;). Specifically, an <italic>indoor top-</italic><italic>k</italic> <italic>keyword-aware routing query</italic> is defined, which returns the <italic>k</italic> best routes from the start point to the target point that are not longer than a distance constraint and have the highest ranking scores defined as a linear combination of the route&#x00027;s keyword relevance and its length.</p>
<p>According to the workflow defined in the proposed framework, the modeling is as follows.</p>
<list list-type="bullet">
<list-item><p><bold>Analytical Contexts</bold>. A connectivity base graph is built to maintain the geometric and topological information used for routing. For semantic information, two types of keywords are considered. An identity word (i-word) is an indoor partition&#x00027;s unique semantic name such as <italic>McDonald&#x00027;s</italic> and <italic>toilet</italic>, while a thematic word (t-word) is the description of an i-word such as <italic>hamburger</italic> and <italic>cola</italic> for <italic>McDonald&#x00027;s</italic>. Two mappings are used to organize i-words and t-words, i.e., finding a group of relevant t-words for a given i-word and finding relevant i-words for a given t-word. Two other mappings are used to maintain the one-to-many relationships between an i-word and a set of associated partitions. The mappings allow for updating in case of changes of semantic information.</p></list-item>
<list-item><p><bold>Spatial Data Uncertainties</bold>. As the analysis does not involve any internal entities and positioning data, no uncertainty is modeled.</p></list-item>
</list>
<p>The aforementioned context model supports the top-<italic>k</italic> keyword-aware routing query answering in the following aspects. First, the maintained relationships between keywords and partitions are used to compute the keyword relevance score of a given route represented as a sequence of passed partitions. The keyword relevance score is further combined with the route distance in a weighted manner for ranking the most suitable indoor paths for the user query. Second, geometric and topological information of the building space is used to design a set of pruning rules, reducing the search space of candidate paths between the source and the target. Enabled by different pruning rules, two search strategies are proposed. In particular, the topology-oriented expansion strategy expands a partial path to the next graph node<xref ref-type="fn" rid="fn0009"><sup>9</sup></xref>, computes the ranking score of the partial path, and prioritizes processing those partial paths with higher ranking scores until <italic>k</italic> complete paths are found. In contrast, the keyword-oriented expansion strategy first finds all keyword-relevant partitions and then searches for complete paths by orchestrating these keyword-relevant partitions and complementing other in-between partitions based on indoor topology.</p>
<p>A follow-up work (Chan et al., <xref ref-type="bibr" rid="B13">2021</xref>) considers more sophisticated contexts in keyword-aware routing, such as the time cost to spend on a POI and the <italic>category word</italic> of a POI. In the modeling, these static contexts of building space are linked to the partitions in the base graph.</p>
</sec>
<sec>
<title>4.2.2. Temporal-variation aware path planning</title>
<p>The previous work (Liu et al., <xref ref-type="bibr" rid="B51">2021a</xref>) aims to find the shortest indoor paths for users while considering the temporal variations of doors and blocking of rooms during their navigation. Such a pathfinding problem reflects the real-world scenarios where many doors have regular opening and closing times and many rooms may be temporarily occupied and unavailable for routing use.</p>
<p>To solve the problem, we consider the modeling process as follows.</p>
<list list-type="bullet">
<list-item><p><bold>Analytical Contexts</bold>. Static contexts including geometric and topological information are maintained in the base graph, like partitions&#x00027; shapes and locations of doors. Dynamic contexts mainly consist of doors&#x00027; <italic>Active Time Intervals</italic> and types of doors/partitions, where the former records each door&#x00027;s opening and closing times and the latter indicates whether a door/partition is private or public. All these dynamic contexts are linked to the graph and allow for updating. To speed up pathfinding, an indoor temporal-variation index is built upon VIP-tree (Shao et al., <xref ref-type="bibr" rid="B69">2016</xref>). The tree structure is constructed by hierarchically gathering those topologically connected partitions such that each internal tree node corresponds to an interconnected area. Moreover, each internal node links to a distance cube that materializes the time-parameterized shortest path information among the doors covered by the internal node.</p></list-item>
<list-item><p><bold>Spatial Data Uncertainties</bold>. Similar to the work above, no spatial data uncertainty is involved.</p></list-item>
</list>
<p>Based on the constructed context model, synchronous and asynchronous strategies are proposed to deal with temporal variations during the pathfinding over the graph. In particular, synchronous pathfinding checks temporal variations whenever a node expansion is needed whereas asynchronous pathfinding enjoys lazy computations as graph checkpoints are used to indicate the topological changes. An accelerated search using the indoor temporal-variation index is also proposed. This search first finds leaf nodes covering the source and the target, then finds the lowest common ancestor for the two leaf nodes, and finally connects the source to the target <italic>via</italic> their common ancestors with materialized shortest path information.</p>
</sec>
<sec>
<title>4.2.3. Crowd-aware path planning</title>
<p>The work (Liu et al., <xref ref-type="bibr" rid="B53">2021c</xref>) aims for finding proper indoor paths by taking into account the crowded people en route. Two different query types are studied. The <italic>indoor crowd-aware fastest path query</italic> (FPQ) returns the path with the shortest traveling time in the presence of the lag caused by crowds, and the <italic>indoor least crowded path query</italic> (LCPQ) finds a path encountering the least number of objects.</p>
<p>To answer these two queries, contexts and data uncertainties are handled as follows.</p>
<list list-type="bullet">
<list-item><p><bold>Analytical Contexts</bold>. Static topological and geometric information such as partitions&#x00027; areas and shapes are maintained in the base graph. The dynamic contexts of internal entities, such as the populations of partitions at each time interval and the flow <italic>via</italic> a door, are modeled by external models that are linked to the base model. They serve to estimate the traveling time and number of encountered objects at a particular time during en route.</p></list-item>
<list-item><p><bold>Spatial Data Uncertainties</bold>. Positioning data is used to model and predict the time-evolving door flows and partition populations, whose uncertainty will however affect the estimation of traveling time and the number of encountered objects. Without deriving the uncertainty region of each individual object, we aggregate the uncertain positioning data to obtain uncertain flows at different time points, and fit it with a Poisson distribution. To deal with the uncertainty at the flow level, we capture the relationship between inflows and outflows of partitions according to the indoor topology, and exploit it to design a recursive flow rectification scheme for accurate population estimation.</p></list-item>
</list>
<p>The Dijkstra-based search over the graph model along with time-evolving population estimators is used to process FPQ and LCPQ. Two exact and two approximate estimators are provided in the query processing framework. Among the two exact estimators, the global one executes flow rectification over all partitions while the local one focuses on rectifying the inflow/outflow of a relevant partition. The two approximate estimators accelerate the query processing at the cost of accuracy. One skips the outflow rectification of those dependent partitions when deriving the population of a relevant partition, while the other optionally uses probability functions to predict populations instead of the timestamp-by-timestamp derivation of populations.</p>
</sec>
</sec>
<sec>
<title>4.3. Semantic trajectory generation</title>
<p>Translating raw indoor positioning sequences into human-readable, concise indoor semantic trajectories is practically useful. Various downstream applications, such as querying, visualization, and recommendation, can all benefit from such semantics-oriented representation. Following many existing studies on outdoor semantic trajectories (Parent et al., <xref ref-type="bibr" rid="B63">2013</xref>; Yan et al., <xref ref-type="bibr" rid="B81">2013</xref>), an object&#x00027;s indoor semantic trajectory is captured as a sequence of triples in the form of (<italic>s, e</italic>, &#x003C4;), meaning that the object has a mobility event <italic>e</italic> &#x02208; {stay, pass-by} within a semantic region <italic>s</italic> during the time interval &#x003C4;. Semantic trajectory generation is a complex process. A layered framework for constructing indoor semantic trajectories has been introduced in previous studies (Li et al., <xref ref-type="bibr" rid="B43">2018a</xref>, <xref ref-type="bibr" rid="B42">2020b</xref>), encompassing the subtasks of data cleaning, semantic annotation, and sequence completion. A follow-up work (Li et al., <xref ref-type="bibr" rid="B41">2020a</xref>) focuses on improving the performance of semantic annotation. In these studies, sufficient consideration of contexts and data uncertainty is a prerequisite for achieving decent performance in generating indoor semantic trajectories.</p>
<p>Following the proposed framework, the modeling process is executed as follows.</p>
<list list-type="bullet">
<list-item><p><bold>Analytical Contexts</bold>. On top of the connectivity base graph, the information of those semantic regions is maintained by hashtables where each semantic region is mapped to a set of indoor partitions. The topological relationships between semantic regions are also derived and added to the base graph. Indoor partitions are indexed by R-trees such that raw positioning records can be efficiently mapped to the covering indoor partition and further mapped to the covering semantic region.</p></list-item>
<list-item><p><bold>Spatial Data Uncertainties</bold>. To facilitate the cleaning of historical positioning data, an interval uncertainty model is employed, capturing the possible object whereabouts as the space-constrained intersection of two circles (Li et al., <xref ref-type="bibr" rid="B42">2020b</xref>). Subsequently, to complete the missing triples in a generated semantic trajectory, the possible movement across the unobserved time interval is modeled as a set of possible paths on the connectivity base graph. The historical data is used to calculate the prior transition probabilities between two semantic regions such that the missing movement between semantic regions can be modeled and inferred using probabilistic approaches.</p></list-item>
</list>
<p>Finally, we briefly describe how these models for contexts and data uncertainties support the subtasks of semantic trajectory generation:</p>
<list list-type="order">
<list-item><p>For <italic>positioning data cleaning</italic>, the interval uncertainty model is used to identify outliers in the raw data. Particularly, any observed location falling out of the derived uncertainty region is determined as an outlier, and a sample drawn from the uncertainty region will substitute the outlier.</p></list-item>
<list-item><p>For <italic>semantic annotation</italic>, positioning sequence is segmented using a density-based clustering approach. In particular, a segment with low spatiotemporal density is likely to match a pass-by event; otherwise, high spatiotemporal density indicates a stay event. Once events have been determined, the corresponding semantic region is estimated as the one occupying the largest fraction of the segment&#x00027;s combined uncertainty region. This method determines the event and semantic region sequentially, which may bring about chained errors. Therefore, probabilistic graphical models (Li et al., <xref ref-type="bibr" rid="B41">2020a</xref>) have been exploited to jointly model the matching probabilities of semantic regions and those of events as well as learn their dependencies along the time. To model the energy functions in such a graphical model, indoor geometric information, and semantic information are exploited.</p></list-item>
<list-item><p>For <italic>sequence completion</italic>, the missing triples corresponding to the low sampling issue of the raw data are inferred. Specifically, the historical movement patterns among semantic regions are modeled as transition probabilities between graph nodes. Subsequently, the most likely path between two generated triples is found as the one with maximum posterior probability.</p></list-item>
</list>
</sec>
<sec>
<title>4.4. Distance monitoring</title>
<p>The COVID-19 pandemic has endangered people&#x00027;s life since the beginning of 2020. As of August 22, 2022, it had caused more than 6 millions deaths<xref ref-type="fn" rid="fn0010"><sup>10</sup></xref>. One of the effective approaches to contain the spread of the virus is to keep proper social distancing (e.g., at least 1 m apart) between people. This is particularly important in buildings like high-risk workplaces (e.g., people in nursing homes and quarantine hotels). In the following, we introduce <italic>continuous social distance monitoring</italic> (SDM) (Chan et al., <xref ref-type="bibr" rid="B12">2022</xref>), which aims to monitor and predict the pairwise distances between moving objects (people) in real-time. In particular, given a set of moving objects in an indoor space, SDM identifies all object pairs that are going to form close contact, i.e., having a distance smaller than a pre-defined threshold (e.g., 1 m) within a near future (e.g., in 5 s or so). One example application is that a museum or gallery can integrate this social distance monitoring into their guide app, which allows visitors to keep track of their distances from others while visiting the exhibition. It can suggest further actions to those contact visitors by alerting them.</p>
<p>In this task, the modeling process is as follows.</p>
<list list-type="bullet">
<list-item><p><bold>Analytical Contexts</bold>. To maintain the static context including the geometric and topological information of the building, a connectivity base graph is used. Door-to-door shortest distances and partition-to-partition shortest distances are pre-computed and stored in two matrices. The internal entities of interest are the indoor moving objects (people), and they are maintained in the buckets linked to the covering partition. When a new location update is received, the bucket information of the corresponding object will be updated.</p></list-item>
<list-item><p><bold>Spatial Data Uncertainties</bold>. Both positioning inaccuracy and low sampling issues are considered in modeling object positions. To deal with uncertainties caused by the positioning inaccuracy, we use the circle-based representation combined with different distance decaying functions. However, in the implementation, we draw instances from the circle-based representation as such instances fit in arbitrary distributions and make computations easier. To deal with uncertainties caused by low sampling issues, the interval uncertainty region model is used, corresponding to Case 4 (circle-based representation) in Equation (1).</p></list-item>
</list>
<p>To process SDM efficiently, a set of acceleration techniques are proposed. First, to reduce the number of pairwise distance computations, we prune unpromising object pairs based on the floors they are located on, the indoor topology, and their probability distributions. Second, we propose a batch processing strategy to group nearby objects into one large group, and update the distances between the objects within and outside the group in one pass.</p>
</sec>
</sec>
<sec id="s5">
<title>5. Future directions</title>
<p>Above, we have introduced recent advances in addressing complicated analytical contexts and spatial data uncertainties in SDAIB. Next, we discuss new opportunities that emerging technologies bring to SDAIB.</p>
<list list-type="bullet">
<list-item><p><bold>Decentralized and edge-resident SDAIB</bold>. In the past decade, SDAIB tasks have been mainly run in a centralized manner, which potentially incurs high costs in the data center for data integration, computation, and storage. Currently, edge computing (Mao et al., <xref ref-type="bibr" rid="B58">2017</xref>) emerges as a novel paradigm, pushing the data processing as close as possible to the place where the data is generated. In this way, reduced computation workload of the data center and improved service responsiveness are achieved. There may be hundreds or thousands of IoT devices in an intelligent building environment, using different working mechanisms to collect, transmit, and process data. Spatial data analyses feature inherent locality, i.e., data generated locally can be analyzed locally. This means that the main process of a spatial data analysis task is very suitable for running in decentralized devices resident in the IoT edge. However, it is nontrivial to realize such a decentralized, edge-resident SDAIB. Specifically, IoT devices in intelligent buildings are heterogeneous and dynamic, with distinct computing capabilities and life cycles. To make appropriate task decomposition and task assignment among IoT devices, new techniques for profiling IoT devices, modeling tasks&#x00027; objectives (e.g., processing latency, accuracy, and transmitted data volume), and efficient coordination are required. Recent efforts (Ma et al., <xref ref-type="bibr" rid="B57">2019</xref>; Li et al., <xref ref-type="bibr" rid="B46">2022</xref>) have been made to manage and analyze spatial and spatiotemporal data in the IoT edge computing environments.</p></list-item>
<list-item><p><bold>Privacy-preserving SDAIB</bold>. In the new technology ecosystem, another important issue is data security and privacy protection. Compared with data centers, IoT devices are less protected, and their data computation and storage face security risks. In the foreseeable future, more and more spatial data collected in intelligent buildings will be anonymized, obfuscated, encrypted, or removed, which will bring greater challenges to analyses. On the one hand, the efficiency of analysis will potentially be degraded due to more complex data security schemes; on the other hand, privacy-enhanced data may lead to information loss and thus degradation of the results&#x00027; effectiveness. Some potential techniques can be applied to privacy-preserving SDAIB. First, representation learning techniques (Kumar et al., <xref ref-type="bibr" rid="B36">2021</xref>) can be introduced into a hierarchical analysis process from end devices to IoT devices to the data center, in which generated data encodings are exchanged in the IoT and the network, and end devices and the data center take the normal input and output. Second, data summaries (Siddique and Eldawy, <xref ref-type="bibr" rid="B70">2018</xref>), specific to particular data analysis tasks, can be used in scenarios that are fault-tolerant. Finally, for some model-driven analyses, federated learning (Kumar et al., <xref ref-type="bibr" rid="B36">2021</xref>) and machine learning in Blockchain (Chen et al., <xref ref-type="bibr" rid="B15">2021</xref>) can help avoid sharing sensitive data while maintaining decent performance.</p></list-item>
<list-item><p><bold>Lightweight and energy-saving SDAIB</bold>. Last but not least, the realization of lightweight and energy-efficient data analysis is of great benefit to intelligent buildings. IoT-enhanced buildings require a lot of energy to acquire data and interact with people and the environment. Hence, it is necessary to cut ineffective operations from data analysis, for sustainable intelligent building applications. We illustrate potentially applicable technologies from several perspectives. (1) From a data perspective, how to sample, retain, and discard data is worth investigating. Many sensors collect redundant data at high frequency, therefore, data sampling strategies based on reinforcement learning (Yoon et al., <xref ref-type="bibr" rid="B83">2020</xref>) will be highly usable, e.g., to adaptively change the sampling rate or the buffer size according to the objective of the analysis task. (2) From a computational perspective, identifying and removing invalid or inefficient operators in the analysis can be explored. For some machine learning tasks, lightweight techniques (Menghani, <xref ref-type="bibr" rid="B59">2021</xref>) such as model compression, knowledge distillation, and parameter pruning have been applied in computer vision and natural language processing fields, but how to design light models for spatial data and spatiotemporal data in IoT scenarios still remains open.</p></list-item>
</list>
<p>Based on the above, we envision an end-edge-cloud architecture for SDAIB, which encompasses various data privacy protection schemes and decentralized algorithms built on them. This architecture also integrates a coordinator module, which provides a quality-aware and energy-efficient mechanism to decompose tasks and assign them to heterogeneous and dynamic IoT devices.</p>
<p>To support the aforementioned future directions, the unified modeling framework proposed in Section 3 should be further improved. In terms of <italic>context modeling</italic>, the framework should be able to represent more complex and diverse IoT devices and building environments. On the one hand, compared to current manually designed models, automated means such as semantic parsing and extraction (Teng et al., <xref ref-type="bibr" rid="B73">2018</xref>; Guo et al., <xref ref-type="bibr" rid="B24">2021</xref>) can be developed to extract contexts from the physical world; on the other hand, data structures with more powerful expression capabilities such as heterogeneous graphs and hypergraphs (Zhou et al., <xref ref-type="bibr" rid="B86">2006</xref>) can be introduced to the modeling framework. In terms of <italic>data uncertainty modeling</italic>, the framework should support the decentralized data setting (Li et al., <xref ref-type="bibr" rid="B46">2022</xref>), i.e., heterogeneous computing nodes generate and consume data with different mechanisms. In this case, the framework should be able to model and analyze the spatial data uncertainty of heterogeneous data nodes, and the merging of such spatial data in different degrees or types of uncertainty.</p>
</sec>
<sec sec-type="conclusions" id="s6">
<title>6. Conclusion</title>
<p>In this article, we focus on advanced techniques of spatial data analysis for intelligent buildings (SDAIB), and identify two general technical challenges in SDAIB, namely the complicated analytical context and inherent spatial data uncertainty. We then revisit recent advances in modeling analytical context and spatial data uncertainty. By summarizing the technical highlights of these modeling approaches, we propose a unified modeling framework that creates a roadmap for handling various analytical contexts and spatial data uncertainties. We demonstrate the support of this unified modeling framework for several SDAIB tasks. Finally, we discuss the future directions of SDAIB in the presence of emerging technologies, which may inspire researchers and practitioners in constructing innovative intelligent building applications.</p>
</sec>
<sec sec-type="data-availability" id="s7">
<title>Data availability statement</title>
<p>The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.</p>
</sec>
<sec id="s8">
<title>Author contributions</title>
<p>HLi and HLu contributed to the conception and design of this work. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec sec-type="funding-information" id="s9">
<title>Funding</title>
<p>The authors&#x00027; work was supported by the Independent Research Fund Denmark (Grant No. 8022-00366B) and the EU MSCA program (Grant No. 882232).</p>
</sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s10">
<title>Publisher&#x00027;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<ack><p>The authors would like to thank the reviewers and the editors for their insightful, careful, and constructive comments. The authors would also like to thank their collaborators in the projects related to this research.</p>
</ack>

<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Afyouni</surname> <given-names>I.</given-names></name> <name><surname>Ray</surname> <given-names>C.</given-names></name> <name><surname>Christophe</surname> <given-names>C.</given-names></name></person-group> (<year>2012</year>). <article-title>Spatial models for context-aware indoor navigation systems: a survey</article-title>. <source>J. Spat. Inform. Sci</source>. <volume>1</volume>, <fpage>85</fpage>&#x02013;<lpage>123</lpage>. <pub-id pub-id-type="doi">10.5311/JOSIS.2012.4.73</pub-id></citation>
</ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ahmed</surname> <given-names>T.</given-names></name> <name><surname>Pedersen</surname> <given-names>T. B.</given-names></name> <name><surname>Lu</surname> <given-names>H.</given-names></name></person-group> (<year>2017</year>). <article-title>Finding dense locations in symbolic indoor tracking data: modeling, indexing, and processing</article-title>. <source>GeoInformatica</source> <volume>21</volume>, <fpage>119</fpage>&#x02013;<lpage>150</lpage>. <pub-id pub-id-type="doi">10.1007/s10707-016-0276-8</pub-id></citation>
</ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Alamri</surname> <given-names>S.</given-names></name> <name><surname>Taniar</surname> <given-names>D.</given-names></name> <name><surname>Safar</surname> <given-names>M.</given-names></name> <name><surname>Al-Khalidi</surname> <given-names>H.</given-names></name></person-group> (<year>2013</year>). <article-title>Spatiotemporal indexing for moving objects in an indoor cellular space</article-title>. <source>Neurocomputing</source> <volume>122</volume>, <fpage>70</fpage>&#x02013;<lpage>78</lpage>. <pub-id pub-id-type="doi">10.1016/j.neucom.2013.03.035</pub-id></citation>
</ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Asahara</surname> <given-names>A.</given-names></name> <name><surname>Maruyama</surname> <given-names>K.</given-names></name> <name><surname>Sato</surname> <given-names>A.</given-names></name> <name><surname>Seto</surname> <given-names>K.</given-names></name></person-group> (<year>2011</year>). <article-title>Pedestrian-movement prediction based on mixed markov-chain model,</article-title> in <source>SIGSPATIAL</source> (<publisher-loc>Chicago, IL</publisher-loc>), <fpage>25</fpage>&#x02013;<lpage>33</lpage>. <pub-id pub-id-type="doi">10.1145/2093973.2093979</pub-id></citation>
</ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baba</surname> <given-names>A. I.</given-names></name> <name><surname>Jaeger</surname> <given-names>M.</given-names></name> <name><surname>Lu</surname> <given-names>H.</given-names></name> <name><surname>Pedersen</surname> <given-names>T. B.</given-names></name> <name><surname>Ku</surname> <given-names>W.-S.</given-names></name> <name><surname>Xie</surname> <given-names>X.</given-names></name></person-group> (<year>2016</year>). <article-title>Learning-based cleansing for indoor RFID data,</article-title> in <source>SIGMOD</source> (<publisher-loc>San Francisco, CA</publisher-loc>), <fpage>925</fpage>&#x02013;<lpage>936</lpage>. <pub-id pub-id-type="doi">10.1145/2882903.2882907</pub-id></citation>
</ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baba</surname> <given-names>A. I.</given-names></name> <name><surname>Lu</surname> <given-names>H.</given-names></name> <name><surname>Pedersen</surname> <given-names>T. B.</given-names></name> <name><surname>Xie</surname> <given-names>X.</given-names></name></person-group> (<year>2014</year>). <article-title>Handling false negatives in indoor RFID data,</article-title> in <source>MDM</source> (<publisher-loc>Brisbane</publisher-loc>), Vol. <volume>1</volume>, <fpage>117</fpage>&#x02013;<lpage>126</lpage>. <pub-id pub-id-type="doi">10.1109/MDM.2014.21</pub-id></citation>
</ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Becker</surname> <given-names>T.</given-names></name> <name><surname>Nagel</surname> <given-names>C.</given-names></name> <name><surname>Kolbe</surname> <given-names>T. H.</given-names></name></person-group> (<year>2009</year>). <article-title>A multilayered space-event model for navigation in indoor spaces,</article-title> in <source>3D Geo-Information Sciences. Lecture Notes in Geoinformation and Cartography</source>, eds <person-group person-group-type="editor"><name><surname>Lee</surname> <given-names>J.</given-names></name> <name><surname>Zlatanova</surname> <given-names>S.</given-names></name></person-group> (<publisher-loc>Berlin; Heidelberg</publisher-loc>: <publisher-name>Springer</publisher-name>). <pub-id pub-id-type="doi">10.1007/978-3-540-87395-2_5</pub-id></citation>
</ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bekkali</surname> <given-names>A.</given-names></name> <name><surname>Sanson</surname> <given-names>H.</given-names></name> <name><surname>Matsumoto</surname> <given-names>M.</given-names></name></person-group> (<year>2007</year>). <article-title>RFID indoor positioning based on probabilistic RFID map and Kalman filtering,</article-title> in <source>WiMob</source> (<publisher-loc>White Plains, NY</publisher-loc>), <fpage>21</fpage>. <pub-id pub-id-type="doi">10.1109/WIMOB.2007.4390815</pub-id></citation>
</ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Belmonte-Hernandez</surname> <given-names>A.</given-names></name> <name><surname>Hernandez-Penaloza</surname> <given-names>G.</given-names></name> <name><surname>Guti&#x000E9;rrez</surname> <given-names>D. M.</given-names></name> <name><surname>Alvarez</surname> <given-names>F.</given-names></name></person-group> (<year>2019</year>). <article-title>Recurrent model for wireless indoor tracking and positioning recovering using generative networks</article-title>. <source>IEEE Sensors J</source>. <volume>20</volume>, <fpage>3356</fpage>&#x02013;<lpage>3365</lpage>. <pub-id pub-id-type="doi">10.1109/JSEN.2019.2958201</pub-id></citation>
</ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bhatt</surname> <given-names>M.</given-names></name> <name><surname>Dylla</surname> <given-names>F.</given-names></name> <name><surname>Hois</surname> <given-names>J.</given-names></name></person-group> (<year>2009</year>). <article-title>Spatio-terminological inference for the design of ambient environments,</article-title> in <source>COSIT</source> (<publisher-loc>Berlin; Heidelberg</publisher-loc>), <fpage>371</fpage>&#x02013;<lpage>391</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-642-03832-7_23</pub-id></citation>
</ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Capozzoli</surname> <given-names>A.</given-names></name> <name><surname>Piscitelli</surname> <given-names>M. S.</given-names></name> <name><surname>Gorrino</surname> <given-names>A.</given-names></name> <name><surname>Ballarini</surname> <given-names>I.</given-names></name> <name><surname>Corrado</surname> <given-names>V.</given-names></name></person-group> (<year>2017</year>). <article-title>Data analytics for occupancy pattern learning to reduce the energy consumption of HVAC systems in office buildings</article-title>. <source>Sustain. Cities Soc</source>. <volume>35</volume>, <fpage>191</fpage>&#x02013;<lpage>208</lpage>. <pub-id pub-id-type="doi">10.1016/j.scs.2017.07.016</pub-id></citation>
</ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chan</surname> <given-names>H. K.-H.</given-names></name> <name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Li</surname> <given-names>X.</given-names></name> <name><surname>Lu</surname> <given-names>H.</given-names></name></person-group> (<year>2022</year>). <article-title>Continuous social distance monitoring in indoor space</article-title>. <source>Proc. VLDB Endow</source>. <volume>15</volume>, <fpage>1390</fpage>&#x02013;<lpage>1402</lpage>. <pub-id pub-id-type="doi">10.14778/3523210.3523217</pub-id></citation>
</ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chan</surname> <given-names>H. K.-H.</given-names></name> <name><surname>Liu</surname> <given-names>T.</given-names></name> <name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Lu</surname> <given-names>H.</given-names></name></person-group> (<year>2021</year>). <article-title>Time-constrained indoor keyword-aware routing,</article-title> in <source>SSTD</source>, 74&#x02013;84. <pub-id pub-id-type="doi">10.1145/3469830.3470895</pub-id></citation>
</ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cheema</surname> <given-names>M. A.</given-names></name></person-group> (<year>2018</year>). <article-title>Indoor location-based services: challenges and opportunities</article-title>. <source>SIGSPATIAL Spec</source>. <volume>10</volume>, <fpage>10</fpage>&#x02013;<lpage>17</lpage>. <pub-id pub-id-type="doi">10.1145/3292390.3292394</pub-id></citation>
</ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>F.</given-names></name> <name><surname>Wan</surname> <given-names>H.</given-names></name> <name><surname>Cai</surname> <given-names>H.</given-names></name> <name><surname>Cheng</surname> <given-names>G.</given-names></name></person-group> (<year>2021</year>). <article-title>Machine learning in/for blockchain: future and challenges</article-title>. <source>Can. J. Stat</source>. <volume>49</volume>, <fpage>1364</fpage>&#x02013;<lpage>1382</lpage>. <pub-id pub-id-type="doi">10.1002/cjs.11623</pub-id></citation>
</ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>P.-H.</given-names></name> <name><surname>Feng</surname> <given-names>F.</given-names></name></person-group> (<year>2009</year>). <article-title>A fast flow control algorithm for real-time emergency evacuation in large indoor areas</article-title>. <source>Fire Saf. J</source>. <volume>44</volume>, <fpage>732</fpage>&#x02013;<lpage>740</lpage>. <pub-id pub-id-type="doi">10.1016/j.firesaf.2009.02.005</pub-id></citation>
</ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Choi</surname> <given-names>W.</given-names></name> <name><surname>Moon</surname> <given-names>B.</given-names></name> <name><surname>Lee</surname> <given-names>S.</given-names></name></person-group> (<year>2004</year>). <article-title>Adaptive cell-based index for moving objects</article-title>. <source>Data Knowl. Eng</source>. <volume>48</volume>, <fpage>75</fpage>&#x02013;<lpage>101</lpage>. <pub-id pub-id-type="doi">10.1016/S0169-023X(03)00120-4</pub-id></citation>
</ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Choset</surname> <given-names>H.</given-names></name> <name><surname>Burdick</surname> <given-names>J.</given-names></name></person-group> (<year>2000</year>). <article-title>Sensor-based exploration: the hierarchical generalized Voronoi graph</article-title>. <source>Int. J. Robot. Res</source>. <volume>19</volume>, <fpage>96</fpage>&#x02013;<lpage>125</lpage>. <pub-id pub-id-type="doi">10.1177/02783640022066770</pub-id></citation>
</ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Crowley</surname> <given-names>J. L.</given-names></name></person-group> (<year>1989</year>). <article-title>World modeling and position estimation for a mobile robot using ultrasonic ranging,</article-title> in <source>ICRA, Vol. 89</source> (<publisher-loc>Scottsdale, AZ</publisher-loc>), <fpage>674</fpage>&#x02013;<lpage>680</lpage>. <pub-id pub-id-type="doi">10.1109/ROBOT.1989.100062</pub-id></citation>
</ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Daissaoui</surname> <given-names>A.</given-names></name> <name><surname>Boulmakoul</surname> <given-names>A.</given-names></name> <name><surname>Karim</surname> <given-names>L.</given-names></name> <name><surname>Lbath</surname> <given-names>A.</given-names></name></person-group> (<year>2020</year>). <article-title>IoT and big data analytics for smart buildings: a survey</article-title>. <source>Proc. Comput. Sci</source>. <volume>170</volume>, <fpage>161</fpage>&#x02013;<lpage>168</lpage>. <pub-id pub-id-type="doi">10.1016/j.procs.2020.03.021</pub-id></citation>
</ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Elfes</surname> <given-names>A.</given-names></name></person-group> (<year>1989</year>). <article-title>Using occupancy grids for mobile robot perception and navigation</article-title>. <source>Computer</source> <volume>22</volume>, <fpage>46</fpage>&#x02013;<lpage>57</lpage>. <pub-id pub-id-type="doi">10.1109/2.30720</pub-id></citation>
</ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Feng</surname> <given-names>Z.</given-names></name> <name><surname>Liu</surname> <given-names>T.</given-names></name> <name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Lu</surname> <given-names>H.</given-names></name> <name><surname>Shou</surname> <given-names>L.</given-names></name> <name><surname>Xu</surname> <given-names>J.</given-names></name></person-group> (<year>2020</year>). <article-title>Indoor top-k keyword-aware routing query,</article-title> in <source>ICDE</source>, 1213&#x02013;1224. <pub-id pub-id-type="doi">10.1109/ICDE48307.2020.00109</pub-id></citation>
</ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Franz</surname> <given-names>G.</given-names></name> <name><surname>Mallot</surname> <given-names>H. A.</given-names></name> <name><surname>Wiener</surname> <given-names>J. M.</given-names></name></person-group> (<year>2005</year>). <article-title>Graph-based models of space in architecture and cognitive science: a comparative analysis,</article-title> in <source>INTERSYMP</source> (<publisher-loc>Baden-Baden</publisher-loc>), <fpage>30</fpage>&#x02013;<lpage>38</lpage>.</citation>
</ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Guo</surname> <given-names>D.</given-names></name> <name><surname>Teng</surname> <given-names>X.</given-names></name> <name><surname>Guo</surname> <given-names>Y.</given-names></name> <name><surname>Zhou</surname> <given-names>X.</given-names></name> <name><surname>Liu</surname> <given-names>Z.</given-names></name></person-group> (<year>2021</year>). <article-title>SiFi: self-updating of indoor semantic floorplans for annotated objects</article-title>. <source>ACM Trans. Intern. Things</source> <volume>2</volume>, <fpage>1</fpage>&#x02013;<lpage>21</lpage>. <pub-id pub-id-type="doi">10.1145/3450567</pub-id></citation>
</ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hightower</surname> <given-names>J.</given-names></name> <name><surname>Borriello</surname> <given-names>G.</given-names></name></person-group> (<year>2004</year>). <article-title>Particle filters for location estimation in ubiquitous computing: a case study,</article-title> in <source>UbiComp</source> (<publisher-loc>Nottingham</publisher-loc>), <fpage>88</fpage>&#x02013;<lpage>106</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-540-30119-6_6</pub-id></citation>
</ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hunter</surname> <given-names>G. J.</given-names></name></person-group> (<year>1999</year>). <article-title>Managing uncertainty in GIS</article-title>. <source>Geograph. Inform. Syst</source>. <volume>2</volume>, <fpage>633</fpage>&#x02013;<lpage>641</lpage>.</citation>
</ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jensen</surname> <given-names>C. S.</given-names></name> <name><surname>Lu</surname> <given-names>H.</given-names></name> <name><surname>Yang</surname> <given-names>B.</given-names></name></person-group> (<year>2009a</year>). <article-title>Graph model based indoor tracking,</article-title> in <source>MDM</source> (<publisher-loc>Taipei</publisher-loc>), <fpage>122</fpage>&#x02013;<lpage>131</lpage>. <pub-id pub-id-type="doi">10.1109/MDM.2009.23</pub-id></citation>
</ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jensen</surname> <given-names>C. S.</given-names></name> <name><surname>Lu</surname> <given-names>H.</given-names></name> <name><surname>Yang</surname> <given-names>B.</given-names></name></person-group> (<year>2009b</year>). <article-title>Indexing the trajectories of moving objects in symbolic indoor space,</article-title> in <source>SSTD</source> (<publisher-loc>Aalborg</publisher-loc>), <fpage>208</fpage>&#x02013;<lpage>227</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-642-02982-0_15</pub-id></citation>
</ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jeung</surname> <given-names>H.</given-names></name> <name><surname>Lu</surname> <given-names>H.</given-names></name> <name><surname>Sathe</surname> <given-names>S.</given-names></name> <name><surname>Yiu</surname> <given-names>M. L.</given-names></name></person-group> (<year>2013</year>). <article-title>Managing evolving uncertainty in trajectory databases</article-title>. <source>IEEE Trans. Knowl. Data Eng</source>. <volume>26</volume>, <fpage>1692</fpage>&#x02013;<lpage>1705</lpage>. <pub-id pub-id-type="doi">10.1109/TKDE.2013.141</pub-id></citation>
</ref>
<ref id="B30">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Jin</surname> <given-names>P.</given-names></name> <name><surname>Zhang</surname> <given-names>L.</given-names></name> <name><surname>Zhao</surname> <given-names>J.</given-names></name> <name><surname>Zhao</surname> <given-names>L.</given-names></name> <name><surname>Yue</surname> <given-names>L.</given-names></name></person-group> (<year>2012</year>). <article-title>Semantics and modeling of indoor moving objects</article-title>. <source>Int. J. Multimed. Ubiquit. Eng</source>. <volume>7</volume>, <fpage>153</fpage>&#x02013;<lpage>158</lpage>. Retrieved from: <ext-link ext-link-type="uri" xlink:href="https://gvpress.com/journals/IJMUE/vol7_no2/11.pdf">https://gvpress.com/journals/IJMUE/vol7_no2/11.pdf</ext-link></citation>
</ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kamkarian</surname> <given-names>P.</given-names></name> <name><surname>Hexmoor</surname> <given-names>H.</given-names></name></person-group> (<year>2012</year>). <article-title>Modeling crowd evacuation from indoor spaces,</article-title> in <source>ICAI</source>, 1. <pub-id pub-id-type="doi">10.1155/2012/340615</pub-id></citation>
</ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kang</surname> <given-names>H.-K.</given-names></name> <name><surname>Li</surname> <given-names>K.-J.</given-names></name></person-group> (<year>2017</year>). <article-title>A standard indoor spatial data model-OGC IndoorGML and implementation approaches</article-title>. <source>ISPRS Int. J. Geo-Inform</source>. <volume>6</volume>:<fpage>116</fpage>. <pub-id pub-id-type="doi">10.3390/ijgi6040116</pub-id></citation>
</ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname> <given-names>J.-S.</given-names></name> <name><surname>Kang</surname> <given-names>H.-Y.</given-names></name> <name><surname>Lee</surname> <given-names>T.-H.</given-names></name> <name><surname>Li</surname> <given-names>K.-J.</given-names></name></person-group> (<year>2009</year>). <article-title>Topology of the prism model for 3D indoor spatial objects,</article-title> in <source>MDM: Systems, Services and Middleware</source> (<publisher-loc>Taipei</publisher-loc>), <fpage>698</fpage>&#x02013;<lpage>703</lpage>. <pub-id pub-id-type="doi">10.1109/MDM.2009.119</pub-id></citation>
</ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname> <given-names>Y.</given-names></name> <name><surname>Kang</surname> <given-names>H.</given-names></name> <name><surname>Lee</surname> <given-names>J.</given-names></name></person-group> (<year>2014</year>). <article-title>Developing CityGML indoor ADE to manage indoor facilities,</article-title> in <source>Innovations in 3D Geo-Information Sciences. Lecture Notes in Geoinformation and Cartography</source>, ed <person-group person-group-type="editor"><name><surname>Isikdag</surname> <given-names>U</given-names></name></person-group> (<publisher-loc>Cham</publisher-loc>: <publisher-name>Springer</publisher-name>). <pub-id pub-id-type="doi">10.1007/978-3-319-00515-7_15</pub-id></citation>
</ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kolbe</surname> <given-names>T. H.</given-names></name> <name><surname>Gr&#x000F6;ger</surname> <given-names>G.</given-names></name> <name><surname>Pl&#x000FC;mer</surname> <given-names>L.</given-names></name></person-group> (<year>2005</year>). <article-title>CityGML: Interoperable access to 3D city models,</article-title> in <source>Geo-information for Disaster Management</source>, eds <person-group person-group-type="editor"><name><surname>van Oosterom</surname> <given-names>P.</given-names></name> <name><surname>Zlatanova</surname> <given-names>S.</given-names></name> <name><surname>Fendel</surname> <given-names>E. M.</given-names></name></person-group> (<publisher-loc>Berlin; Heidelberg</publisher-loc>: <publisher-name>Springer</publisher-name>). <pub-id pub-id-type="doi">10.1007/3-540-27468-5_63</pub-id></citation>
</ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kumar</surname> <given-names>P.</given-names></name> <name><surname>Gupta</surname> <given-names>G. P.</given-names></name> <name><surname>Tripathi</surname> <given-names>R.</given-names></name></person-group> (<year>2021</year>). <article-title>PEFL: Deep privacy-encoding-based federated learning framework for smart agriculture</article-title>. <source>IEEE Micro</source> <volume>42</volume>, <fpage>33</fpage>&#x02013;<lpage>40</lpage>. <pub-id pub-id-type="doi">10.1109/MM.2021.3112476</pub-id></citation>
</ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lam</surname> <given-names>W. H.</given-names></name> <name><surname>Lee</surname> <given-names>J. Y.</given-names></name> <name><surname>Chan</surname> <given-names>K.</given-names></name> <name><surname>Goh</surname> <given-names>P.</given-names></name></person-group> (<year>2003</year>). <article-title>A generalised function for modeling bi-directional flow effects on indoor walkways in Hong Kong</article-title>. <source>Transport. Res. Part A</source> <volume>37</volume>, <fpage>789</fpage>&#x02013;<lpage>810</lpage>. <pub-id pub-id-type="doi">10.1016/S0965-8564(03)00058-2</pub-id></citation>
</ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Laursen</surname> <given-names>T.</given-names></name> <name><surname>Pedersen</surname> <given-names>N. B.</given-names></name> <name><surname>Nielsen</surname> <given-names>J. J.</given-names></name> <name><surname>Madsen</surname> <given-names>T. K.</given-names></name></person-group> (<year>2012</year>). <article-title>Hidden Markov model based mobility learning for improving indoor tracking of mobile users,</article-title> in <source>WPNC</source> (<publisher-loc>Dresden</publisher-loc>), <fpage>100</fpage>&#x02013;<lpage>104</lpage>. <pub-id pub-id-type="doi">10.1109/WPNC.2012.6268746</pub-id></citation>
</ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname> <given-names>J.</given-names></name></person-group> (<year>2004</year>). <article-title>A spatial access-oriented implementation of a 3-D GIS topological data model for urban entities</article-title>. <source>GeoInformatica</source> <volume>8</volume>, <fpage>237</fpage>&#x02013;<lpage>264</lpage>. <pub-id pub-id-type="doi">10.1023/B:GEIN.0000034820.93914.d0</pub-id></citation>
</ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>D.</given-names></name> <name><surname>Lee</surname> <given-names>D. L.</given-names></name></person-group> (<year>2008</year>). <article-title>A lattice-based semantic location model for indoor navigation,</article-title> in <source>MDM</source> (<publisher-loc>Beijing</publisher-loc>), <fpage>17</fpage>&#x02013;<lpage>24</lpage>. <pub-id pub-id-type="doi">10.1109/MDM.2008.11</pub-id></citation>
</ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Lu</surname> <given-names>H.</given-names></name> <name><surname>Cheema</surname> <given-names>M. A.</given-names></name> <name><surname>Shou</surname> <given-names>L.</given-names></name> <name><surname>Chen</surname> <given-names>G.</given-names></name></person-group> (<year>2020a</year>). <article-title>Indoor mobility semantics annotation using coupled conditional Markov networks,</article-title> in <source>ICDE</source> (<publisher-loc>Dallas, TX</publisher-loc>), <fpage>1441</fpage>&#x02013;<lpage>1452</lpage>. <pub-id pub-id-type="doi">10.1109/ICDE48307.2020.00128</pub-id></citation>
</ref>
<ref id="B42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Lu</surname> <given-names>H.</given-names></name> <name><surname>Chen</surname> <given-names>G.</given-names></name> <name><surname>Chen</surname> <given-names>K.</given-names></name> <name><surname>Chen</surname> <given-names>Q.</given-names></name> <name><surname>Shou</surname> <given-names>L.</given-names></name></person-group> (<year>2020b</year>). <article-title>Toward translating raw indoor positioning data into mobility semantics</article-title>. <source>ACM Trans. Data Sci</source>. <volume>1</volume>, <fpage>1</fpage>&#x02013;<lpage>37</lpage>. <pub-id pub-id-type="doi">10.1145/3385190</pub-id></citation>
</ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Lu</surname> <given-names>H.</given-names></name> <name><surname>Shi</surname> <given-names>F.</given-names></name> <name><surname>Chen</surname> <given-names>G.</given-names></name> <name><surname>Chen</surname> <given-names>K.</given-names></name> <name><surname>Shou</surname> <given-names>L.</given-names></name></person-group> (<year>2018a</year>). <article-title>TRIPS: A system for translating raw indoor positioning data into visual mobility semantics</article-title>. <source>Proc. VLDB Endow</source>. <volume>11</volume>, <fpage>1918</fpage>&#x02013;<lpage>1921</lpage>. <pub-id pub-id-type="doi">10.14778/3229863.3236224</pub-id></citation>
</ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Lu</surname> <given-names>H.</given-names></name> <name><surname>Shou</surname> <given-names>L.</given-names></name> <name><surname>Chen</surname> <given-names>G.</given-names></name> <name><surname>Chen</surname> <given-names>K.</given-names></name></person-group> (<year>2018b</year>). <article-title>Finding most popular indoor semantic locations using uncertain mobility data</article-title>. <source>IEEE Trans. Knowl. Data Eng</source>. <volume>31</volume>, <fpage>2108</fpage>&#x02013;<lpage>2123</lpage>. <pub-id pub-id-type="doi">10.1109/TKDE.2018.2875096</pub-id></citation>
</ref>
<ref id="B45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Lu</surname> <given-names>H.</given-names></name> <name><surname>Shou</surname> <given-names>L.</given-names></name> <name><surname>Chen</surname> <given-names>G.</given-names></name> <name><surname>Chen</surname> <given-names>K.</given-names></name></person-group> (<year>2018c</year>). <article-title>In search of indoor dense regions: an approach using indoor positioning data</article-title>. <source>IEEE Trans. Knowl. Data Eng</source>. <volume>30</volume>, <fpage>1481</fpage>&#x02013;<lpage>1495</lpage>. <pub-id pub-id-type="doi">10.1109/TKDE.2018.2799215</pub-id></citation>
</ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Yi</surname> <given-names>L.</given-names></name> <name><surname>Tang</surname> <given-names>B.</given-names></name> <name><surname>Lu</surname> <given-names>H.</given-names></name> <name><surname>Jensen</surname> <given-names>C. S.</given-names></name></person-group> (<year>2022</year>). <article-title>Efficient and error-bounded spatiotemporal quantile monitoring in edge computing environments</article-title>. <source>Proc. VLDB Endow</source>. <volume>15</volume>, <fpage>1753</fpage>&#x02013;<lpage>1765</lpage>. <pub-id pub-id-type="doi">10.14778/3538598.3538600</pub-id></citation>
</ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lin</surname> <given-names>H.</given-names></name> <name><surname>Peng</surname> <given-names>L.</given-names></name> <name><surname>Chen</surname> <given-names>S.</given-names></name> <name><surname>Liu</surname> <given-names>T.</given-names></name> <name><surname>Chi</surname> <given-names>T.</given-names></name></person-group> (<year>2016</year>). <article-title>Indexing for moving objects in multi-floor indoor spaces that supports complex semantic queries</article-title>. <source>ISPRS Int. J. Geo-Inform</source>. <volume>5</volume>:<fpage>176</fpage>. <pub-id pub-id-type="doi">10.3390/ijgi5100176</pub-id></citation>
</ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lin</surname> <given-names>Y.-H.</given-names></name> <name><surname>Liu</surname> <given-names>Y.-S.</given-names></name> <name><surname>Gao</surname> <given-names>G.</given-names></name> <name><surname>Han</surname> <given-names>X.-G.</given-names></name> <name><surname>Lai</surname> <given-names>C.-Y.</given-names></name> <name><surname>Gu</surname> <given-names>M.</given-names></name></person-group> (<year>2013</year>). <article-title>The IFC-based path planning for 3D indoor spaces</article-title>. <source>Adv. Eng. Inform</source>. <volume>27</volume>, <fpage>189</fpage>&#x02013;<lpage>205</lpage>. <pub-id pub-id-type="doi">10.1016/j.aei.2012.10.001</pub-id></citation>
</ref>
<ref id="B49">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lin</surname> <given-names>Z.</given-names></name> <name><surname>Xu</surname> <given-names>Z.</given-names></name> <name><surname>Hu</surname> <given-names>D.</given-names></name> <name><surname>Hu</surname> <given-names>Q.</given-names></name> <name><surname>Li</surname> <given-names>W.</given-names></name></person-group> (<year>2017</year>). <article-title>Hybrid spatial data model for indoor space: combined topology and grid</article-title>. <source>ISPRS Int. J. Geo-Inform</source>. <volume>6</volume>:<fpage>343</fpage>. <pub-id pub-id-type="doi">10.3390/ijgi6110343</pub-id></citation>
</ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>H.</given-names></name> <name><surname>Darabi</surname> <given-names>H.</given-names></name> <name><surname>Banerjee</surname> <given-names>P.</given-names></name> <name><surname>Liu</surname> <given-names>J.</given-names></name></person-group> (<year>2007</year>). <article-title>Survey of wireless indoor positioning techniques and systems</article-title>. <source>IEEE Trans. Syst. Man Cybern. Part C</source> <volume>37</volume>, <fpage>1067</fpage>&#x02013;<lpage>1080</lpage>. <pub-id pub-id-type="doi">10.1109/TSMCC.2007.905750</pub-id><pub-id pub-id-type="pmid">33348701</pub-id></citation></ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>T.</given-names></name> <name><surname>Feng</surname> <given-names>Z.</given-names></name> <name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Lu</surname> <given-names>H.</given-names></name> <name><surname>Cheema</surname> <given-names>M. A.</given-names></name> <name><surname>Cheng</surname> <given-names>H.</given-names></name> <etal/></person-group>. (<year>2021a</year>). <article-title>Towards indoor temporal-variation aware shortest path query</article-title>. <source>IEEE Trans. Knowl. Data Eng</source>. <volume>15</volume>:<fpage>1</fpage>. <pub-id pub-id-type="doi">10.1109/TKDE.2021.3076144</pub-id></citation>
</ref>
<ref id="B52">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>T.</given-names></name> <name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Lu</surname> <given-names>H.</given-names></name> <name><surname>Cheema</surname> <given-names>M. A.</given-names></name> <name><surname>Shou</surname> <given-names>L.</given-names></name></person-group> (<year>2021b</year>). <article-title>Indoor spatial queries: modeling, indexing, and processing,</article-title> in <source>EDBT</source> (<publisher-loc>Nicosia</publisher-loc>), <fpage>181</fpage>&#x02013;<lpage>192</lpage>. <pub-id pub-id-type="doi">10.5441/002/edbt.2021.17</pub-id></citation>
</ref>
<ref id="B53">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>T.</given-names></name> <name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Lu</surname> <given-names>H.</given-names></name> <name><surname>Cheema</surname> <given-names>M. A.</given-names></name> <name><surname>Shou</surname> <given-names>L.</given-names></name></person-group> (<year>2021c</year>). <article-title>Towards crowd-aware indoor path planning</article-title>. <source>Proc. VLDB Endow</source>. <volume>14</volume>, <fpage>1365</fpage>&#x02013;<lpage>1377</lpage>. <pub-id pub-id-type="doi">10.14778/3457390.3457401</pub-id></citation>
</ref>
<ref id="B54">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lu</surname> <given-names>H.</given-names></name> <name><surname>Cao</surname> <given-names>X.</given-names></name> <name><surname>Jensen</surname> <given-names>C. S.</given-names></name></person-group> (<year>2012</year>). <article-title>A foundation for efficient indoor distance-aware query processing,</article-title> in <source>ICDE</source> (<publisher-loc>Arlington, VA</publisher-loc>), <fpage>438</fpage>&#x02013;<lpage>449</lpage>. <pub-id pub-id-type="doi">10.1109/ICDE.2012.44</pub-id></citation>
</ref>
<ref id="B55">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lu</surname> <given-names>H.</given-names></name> <name><surname>Guo</surname> <given-names>C.</given-names></name> <name><surname>Yang</surname> <given-names>B.</given-names></name> <name><surname>Jensen</surname> <given-names>C. S.</given-names></name></person-group> (<year>2016</year>). <article-title>Finding frequently visited indoor POIs using symbolic indoor tracking data,</article-title> in <source>EDBT</source> (<publisher-loc>Bordeaux</publisher-loc>), <fpage>449</fpage>&#x02013;<lpage>460</lpage>. <pub-id pub-id-type="doi">10.5441/002/edbt.2016.41</pub-id></citation>
</ref>
<ref id="B56">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lu</surname> <given-names>H.</given-names></name> <name><surname>Yang</surname> <given-names>B.</given-names></name> <name><surname>Jensen</surname> <given-names>C. S.</given-names></name></person-group> (<year>2011</year>). <article-title>Spatio-temporal joins on symbolic indoor tracking data,</article-title> in <source>ICDE</source> (<publisher-loc>Hannover</publisher-loc>), <fpage>816</fpage>&#x02013;<lpage>827</lpage>. <pub-id pub-id-type="doi">10.1109/ICDE.2011.5767902</pub-id></citation>
</ref>
<ref id="B57">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ma</surname> <given-names>X.</given-names></name> <name><surname>Liang</surname> <given-names>J.</given-names></name> <name><surname>Yang</surname> <given-names>S.</given-names></name> <name><surname>Li</surname> <given-names>Y.</given-names></name> <name><surname>Li</surname> <given-names>Y.</given-names></name> <name><surname>Ma</surname> <given-names>W.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>SLS-STQ: a novel scheme for securing spatial-temporal top-k queries in TWSNs-based edge computing systems</article-title>. <source>IEEE Intern. Things J</source>. <volume>6</volume>, <fpage>10093</fpage>&#x02013;<lpage>10104</lpage>. <pub-id pub-id-type="doi">10.1109/JIOT.2019.2935738</pub-id></citation>
</ref>
<ref id="B58">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mao</surname> <given-names>Y.</given-names></name> <name><surname>You</surname> <given-names>C.</given-names></name> <name><surname>Zhang</surname> <given-names>J.</given-names></name> <name><surname>Huang</surname> <given-names>K.</given-names></name> <name><surname>Letaief</surname> <given-names>K. B.</given-names></name></person-group> (<year>2017</year>). <article-title>A survey on mobile edge computing: the communication perspective</article-title>. <source>IEEE Commun. Surveys Tutor</source>. <volume>19</volume>, <fpage>2322</fpage>&#x02013;<lpage>2358</lpage>. <pub-id pub-id-type="doi">10.1109/COMST.2017.2745201</pub-id></citation>
</ref>
<ref id="B59">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Menghani</surname> <given-names>G.</given-names></name></person-group> (<year>2021</year>). <article-title>Efficient deep learning: a survey on making deep learning models smaller, faster, and better</article-title>. <source>arXiv[Preprint].arXiv:2106.08962</source>. <pub-id pub-id-type="doi">10.48550/arXiv.2106.08962</pub-id></citation>
</ref>
<ref id="B60">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mondal</surname> <given-names>J.</given-names></name> <name><surname>Deshpande</surname> <given-names>A.</given-names></name></person-group> (<year>2012</year>). <article-title>Managing large dynamic graphs efficiently,</article-title> in <source>SIGMOD</source> (<publisher-loc>Scottsdale, AZ</publisher-loc>), <fpage>145</fpage>&#x02013;<lpage>156</lpage>. <pub-id pub-id-type="doi">10.1145/2213836.2213854</pub-id></citation>
</ref>
<ref id="B61">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ni</surname> <given-names>F.</given-names></name> <name><surname>Nguyen</surname> <given-names>P. H.</given-names></name> <name><surname>Cobben</surname> <given-names>J. F.</given-names></name> <name><surname>van den Brom</surname> <given-names>H. E.</given-names></name> <name><surname>Zhao</surname> <given-names>D.</given-names></name></person-group> (<year>2016</year>). <article-title>Uncertainty analysis of aggregated smart meter data for state estimation,</article-title> in <source>AMPS</source> (<publisher-loc>Aachen</publisher-loc>), <fpage>1</fpage>&#x02013;<lpage>6</lpage>. <pub-id pub-id-type="doi">10.1109/AMPS.2016.7602805</pub-id></citation>
</ref>
<ref id="B62">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Oztekin</surname> <given-names>A.</given-names></name> <name><surname>Pajouh</surname> <given-names>F. M.</given-names></name> <name><surname>Delen</surname> <given-names>D.</given-names></name> <name><surname>Swim</surname> <given-names>L. K.</given-names></name></person-group> (<year>2010</year>). <article-title>An RFID network design methodology for asset tracking in healthcare</article-title>. <source>Decis. Support Syst</source>. <volume>49</volume>, <fpage>100</fpage>&#x02013;<lpage>109</lpage>. <pub-id pub-id-type="doi">10.1016/j.dss.2010.01.007</pub-id></citation>
</ref>
<ref id="B63">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parent</surname> <given-names>C.</given-names></name> <name><surname>Spaccapietra</surname> <given-names>S.</given-names></name> <name><surname>Renso</surname> <given-names>C.</given-names></name> <name><surname>Andrienko</surname> <given-names>G.</given-names></name> <name><surname>Andrienko</surname> <given-names>N.</given-names></name> <name><surname>Bogorny</surname> <given-names>V.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Semantic trajectories modeling and analysis</article-title>. <source>ACM Comput. Surveys</source> <volume>45</volume>, <fpage>1</fpage>&#x02013;<lpage>32</lpage>. <pub-id pub-id-type="doi">10.1145/2501654.2501656</pub-id></citation>
</ref>
<ref id="B64">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Patel</surname> <given-names>H. A.</given-names></name> <name><surname>Thakore</surname> <given-names>D. G.</given-names></name></person-group> (<year>2013</year>). <article-title>Moving object tracking using Kalman filter</article-title>. <source>Int. J. Comput. Sci. Mobile Comput</source>. <volume>2</volume>, <fpage>326</fpage>&#x02013;<lpage>332</lpage>. Retrieved from: <ext-link ext-link-type="uri" xlink:href="https://www.ijcsmc.com/docs/papers/April2013/V2I4201376.pdf">https://www.ijcsmc.com/docs/papers/April2013/V2I4201376.pdf</ext-link></citation>
</ref>
<ref id="B65">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Penninga</surname> <given-names>F.</given-names></name> <name><surname>Oosterom</surname> <given-names>P. v.</given-names></name> <name><surname>Kazar</surname> <given-names>B. M.</given-names></name></person-group> (<year>2006</year>). <article-title>A tetrahedronized irregular network based DBMS approach for 3D topographic data modeling,</article-title> in <source>Progress in Spatial Data Handling</source>, eds <person-group person-group-type="editor"><name><surname>Riedl</surname> <given-names>A.</given-names></name> <name><surname>Kainz</surname> <given-names>W.</given-names></name> <name><surname>Elmes</surname> <given-names>G. A.</given-names></name></person-group> (<publisher-loc>Berlin; Heidelberg</publisher-loc>: <publisher-name>Springer</publisher-name>). <pub-id pub-id-type="doi">10.1007/3-540-35589-8_37</pub-id></citation>
</ref>
<ref id="B66">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Petzold</surname> <given-names>J.</given-names></name> <name><surname>Pietzowski</surname> <given-names>A.</given-names></name> <name><surname>Bagci</surname> <given-names>F.</given-names></name> <name><surname>Trumler</surname> <given-names>W.</given-names></name> <name><surname>Ungerer</surname> <given-names>T.</given-names></name></person-group> (<year>2005</year>). <article-title>Prediction of indoor movements using Bayesian networks,</article-title> in <source>LoCA</source> (<publisher-loc>Oberpfaffenhofen</publisher-loc>), <fpage>211</fpage>&#x02013;<lpage>222</lpage>. <pub-id pub-id-type="doi">10.1007/11426646_20</pub-id><pub-id pub-id-type="pmid">33203034</pub-id></citation></ref>
<ref id="B67">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pfoser</surname> <given-names>D.</given-names></name> <name><surname>Jensen</surname> <given-names>C. S.</given-names></name></person-group> (<year>1999</year>). <article-title>Capturing the uncertainty of moving-object representations,</article-title> in <source>SSD</source>, 111&#x02013;131. <pub-id pub-id-type="doi">10.1007/3-540-48482-5_9</pub-id></citation>
</ref>
<ref id="B68">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Qolomany</surname> <given-names>B.</given-names></name> <name><surname>Al-Fuqaha</surname> <given-names>A.</given-names></name> <name><surname>Gupta</surname> <given-names>A.</given-names></name> <name><surname>Benhaddou</surname> <given-names>D.</given-names></name> <name><surname>Alwajidi</surname> <given-names>S.</given-names></name> <name><surname>Qadir</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>Leveraging machine learning and big data for smart buildings: a comprehensive survey</article-title>. <source>IEEE Access</source> <volume>7</volume>, <fpage>90316</fpage>&#x02013;<lpage>90356</lpage>. <pub-id pub-id-type="doi">10.1109/ACCESS.2019.2926642</pub-id></citation>
</ref>
<ref id="B69">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shao</surname> <given-names>Z.</given-names></name> <name><surname>Cheema</surname> <given-names>M. A.</given-names></name> <name><surname>Taniar</surname> <given-names>D.</given-names></name> <name><surname>Lu</surname> <given-names>H.</given-names></name></person-group> (<year>2016</year>). <article-title>VIP-Tree: an effective index for indoor spatial queries</article-title>. <source>Proc. VLDB Endow</source>. <volume>10</volume>, <fpage>325</fpage>&#x02013;<lpage>336</lpage>. <pub-id pub-id-type="doi">10.14778/3025111.3025115</pub-id></citation>
</ref>
<ref id="B70">
<citation citation-type="journal"><person-group person-group-type="author"><collab>Siddique A. and Eldawy, A..</collab></person-group> (<year>2018</year>). <article-title>Experimental evaluation of sketching techniques for big spatial data,</article-title> in <source>SoCC</source> (<publisher-loc>Carlsbad, CA</publisher-loc>), <fpage>522</fpage>. <pub-id pub-id-type="doi">10.1145/3267809.3275464</pub-id></citation>
</ref>
<ref id="B71">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tariq</surname> <given-names>O. B.</given-names></name> <name><surname>Lazarescu</surname> <given-names>M. T.</given-names></name> <name><surname>Lavagno</surname> <given-names>L.</given-names></name></person-group> (<year>2021</year>). <article-title>Neural networks for indoor person tracking with infrared sensors</article-title>. <source>IEEE Sensors Lett</source>. <volume>5</volume>, <fpage>1</fpage>&#x02013;<lpage>4</lpage>. <pub-id pub-id-type="doi">10.1109/LSENS.2021.3049706</pub-id></citation>
</ref>
<ref id="B72">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Teng</surname> <given-names>S.-Y.</given-names></name> <name><surname>Ku</surname> <given-names>W.-S.</given-names></name> <name><surname>Chuang</surname> <given-names>K.-T.</given-names></name></person-group> (<year>2017</year>). <article-title>Toward mining stop-by behaviors in indoor space</article-title>. <source>ACM Trans. Spat. Algorth. Syst</source>. 3, 1-38. <pub-id pub-id-type="doi">10.1145/3106736</pub-id></citation>
</ref>
<ref id="B73">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Teng</surname> <given-names>X.</given-names></name> <name><surname>Guo</surname> <given-names>D.</given-names></name> <name><surname>Guo</surname> <given-names>Y.</given-names></name> <name><surname>Zhao</surname> <given-names>X.</given-names></name> <name><surname>Liu</surname> <given-names>Z.</given-names></name></person-group> (<year>2018</year>). <article-title>SISE: Self-updating of indoor semantic floorplans for general entities</article-title>. <source>IEEE Trans. Mobile Comput</source>. <volume>17</volume>, <fpage>2646</fpage>&#x02013;<lpage>2659</lpage>. <pub-id pub-id-type="doi">10.1109/TMC.2018.2812752</pub-id></citation>
</ref>
<ref id="B74">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tobler</surname> <given-names>W.</given-names></name></person-group> (<year>2004</year>). <article-title>On the first law of geography: a reply</article-title>. <source>Ann. Assoc. Am. Geograph</source>. <volume>94</volume>, <fpage>304</fpage>&#x02013;<lpage>310</lpage>. <pub-id pub-id-type="doi">10.1111/j.1467-8306.2004.09402009.x</pub-id></citation>
</ref>
<ref id="B75">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Walton</surname> <given-names>L. A.</given-names></name> <name><surname>Worboys</surname> <given-names>M.</given-names></name></person-group> (<year>2012</year>). <article-title>A qualitative bigraph model for indoor space,</article-title> in <source>GIScience</source> (<publisher-loc>Columbus, OH</publisher-loc>), <fpage>226</fpage>&#x02013;<lpage>240</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-642-33024-7_17</pub-id></citation>
</ref>
<ref id="B76">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>P.</given-names></name> <name><surname>Luo</surname> <given-names>Y.</given-names></name></person-group> (<year>2017</year>). <article-title>Research on wifi indoor location algorithm based on RSSI ranging,</article-title> in <source>ICISCE</source> (<publisher-loc>Changsha</publisher-loc>), <fpage>1694</fpage>&#x02013;<lpage>1698</lpage>. <pub-id pub-id-type="doi">10.1109/ICISCE.2017.354</pub-id></citation>
</ref>
<ref id="B77">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Worboys</surname> <given-names>M.</given-names></name></person-group> (<year>2011</year>). <article-title>Modeling indoor space,</article-title> in <source>SIGSPATIAL ISA</source> (<publisher-loc>Chicago, IL</publisher-loc>), <fpage>1</fpage>&#x02013;<lpage>6</lpage>. <pub-id pub-id-type="doi">10.1145/2077357.2077358</pub-id></citation>
</ref>
<ref id="B78">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>H.</given-names></name> <name><surname>Chou</surname> <given-names>X.</given-names></name> <name><surname>Li</surname> <given-names>G.</given-names></name> <name><surname>Shi</surname> <given-names>Y.</given-names></name></person-group> (<year>2018</year>). <article-title>Indoor space model based on knowledge graph,</article-title> in <source>IHMSC, Vol. 2</source> (<publisher-loc>Hangzhou</publisher-loc>), <fpage>72</fpage>&#x02013;<lpage>76</lpage>. <pub-id pub-id-type="doi">10.1109/IHMSC.2018.10123</pub-id></citation>
</ref>
<ref id="B79">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xie</surname> <given-names>X.</given-names></name> <name><surname>Lu</surname> <given-names>H.</given-names></name> <name><surname>Pedersen</surname> <given-names>T. B.</given-names></name></person-group> (<year>2013</year>). <article-title>Efficient distance-aware query evaluation on indoor moving objects,</article-title> in <source>ICDE</source> (<publisher-loc>Brisbane</publisher-loc>), <fpage>434</fpage>&#x02013;<lpage>445</lpage>. <pub-id pub-id-type="doi">10.1109/ICDE.2013.6544845</pub-id></citation>
</ref>
<ref id="B80">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xie</surname> <given-names>X.</given-names></name> <name><surname>Lu</surname> <given-names>H.</given-names></name> <name><surname>Pedersen</surname> <given-names>T. B.</given-names></name></person-group> (<year>2014</year>). <article-title>Distance-aware join for indoor moving objects</article-title>. <source>IEEE Trans. Knowl. Data Eng</source>. <volume>27</volume>, <fpage>428</fpage>&#x02013;<lpage>442</lpage>. <pub-id pub-id-type="doi">10.1109/TKDE.2014.2330834</pub-id></citation>
</ref>
<ref id="B81">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yan</surname> <given-names>Z.</given-names></name> <name><surname>Chakraborty</surname> <given-names>D.</given-names></name> <name><surname>Parent</surname> <given-names>C.</given-names></name> <name><surname>Spaccapietra</surname> <given-names>S.</given-names></name> <name><surname>Aberer</surname> <given-names>K.</given-names></name></person-group> (<year>2013</year>). <article-title>Semantic trajectories: mobility data computation and annotation</article-title>. <source>ACM Trans. Intell. Syst. Technol</source>. <volume>4</volume>, <fpage>1</fpage>&#x02013;<lpage>38</lpage>. <pub-id pub-id-type="doi">10.1145/2483669.2483682</pub-id></citation>
</ref>
<ref id="B82">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>B.</given-names></name> <name><surname>Lu</surname> <given-names>H.</given-names></name> <name><surname>Jensen</surname> <given-names>C. S.</given-names></name></person-group> (<year>2010</year>). <article-title>Probabilistic threshold k nearest neighbor queries over moving objects in symbolic indoor space,</article-title> in <source>EDBT</source> (<publisher-loc>Lausanne</publisher-loc>), <fpage>335</fpage>&#x02013;<lpage>346</lpage>. <pub-id pub-id-type="doi">10.1145/1739041.1739083</pub-id></citation>
</ref>
<ref id="B83">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yoon</surname> <given-names>J.</given-names></name> <name><surname>Arik</surname> <given-names>S.</given-names></name> <name><surname>Pfister</surname> <given-names>T.</given-names></name></person-group> (<year>2020</year>). <article-title>Data valuation using reinforcement learning,</article-title> in <source>ICML</source>, <fpage>10842</fpage>&#x02013;<lpage>10851</lpage>.</citation>
</ref>
<ref id="B84">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yu</surname> <given-names>J.</given-names></name> <name><surname>Ku</surname> <given-names>W.-S.</given-names></name> <name><surname>Sun</surname> <given-names>M.-T.</given-names></name> <name><surname>Lu</surname> <given-names>H.</given-names></name></person-group> (<year>2013</year>). <article-title>An RFID and particle filter-based indoor spatial query evaluation system,</article-title> in <source>EDBT</source> (<publisher-loc>Genoa</publisher-loc>), <fpage>263</fpage>&#x02013;<lpage>274</lpage>. <pub-id pub-id-type="doi">10.1145/2452376.2452408</pub-id></citation>
</ref>
<ref id="B85">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zaki</surname> <given-names>A.</given-names></name> <name><surname>Attia</surname> <given-names>M.</given-names></name> <name><surname>Hegazy</surname> <given-names>D.</given-names></name> <name><surname>Amin</surname> <given-names>S.</given-names></name></person-group> (<year>2016</year>). <article-title>Comprehensive survey on dynamic graph models</article-title>. <source>Int. J. Adv. Comput. Sci. Appl</source>. 7. <pub-id pub-id-type="doi">10.14569/IJACSA.2016.070273</pub-id></citation>
</ref>
<ref id="B86">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>D.</given-names></name> <name><surname>Huang</surname> <given-names>J.</given-names></name> <name><surname>Sch&#x000F6;lkopf</surname> <given-names>B.</given-names></name></person-group> (<year>2006</year>). <article-title>Learning with hypergraphs: clustering, classification, and embedding,</article-title> in <source>Advances in Neural Information Processing System</source> (<publisher-loc>Vancouver</publisher-loc>), <fpage>19</fpage>.</citation>
</ref>
<ref id="B87">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Z&#x000FC;fle</surname> <given-names>A.</given-names></name></person-group> (<year>2021</year>). <article-title>Uncertain spatial data management: an overview,</article-title> in <source>Handbook of Big Geospatial Data</source>, eds <person-group person-group-type="editor"><name><surname>Werner</surname> <given-names>M.</given-names></name> <name><surname>Chiang</surname> <given-names>Y. Y.</given-names></name></person-group> (<publisher-loc>Cham</publisher-loc>: <publisher-name>Springer</publisher-name>). <pub-id pub-id-type="doi">10.1007/978-3-030-55462-0_14</pub-id></citation>
</ref>
<ref id="B88">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Z&#x000FC;fle</surname> <given-names>A.</given-names></name> <name><surname>Trajcevski</surname> <given-names>G.</given-names></name> <name><surname>Pfoser</surname> <given-names>D.</given-names></name> <name><surname>Renz</surname> <given-names>M.</given-names></name> <name><surname>Rice</surname> <given-names>M. T.</given-names></name> <name><surname>Leslie</surname> <given-names>T.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>Handling uncertainty in geo-spatial data,</article-title> in <source>ICDE</source> (<publisher-loc>San Diego, CA</publisher-loc>), <fpage>1467</fpage>&#x02013;<lpage>1470</lpage>. <pub-id pub-id-type="doi">10.1109/ICDE.2017.212</pub-id></citation>
</ref>
</ref-list>
<fn-group>
<fn id="fn0001"><p><sup>1</sup><ext-link ext-link-type="uri" xlink:href="https://www.researchandmarkets.com/reports/5235835/global-smart-buildings-market-by-component">https://www.researchandmarkets.com/reports/5235835/global-smart-buildings-market-by-component</ext-link></p></fn>
<fn id="fn0002"><p><sup>2</sup>Positioning reference points are the predefined locations in the building, providing calibration information and ground truth for positioning algorithms (Liu et al., <xref ref-type="bibr" rid="B50">2007</xref>).</p></fn>
<fn id="fn0003"><p><sup>3</sup>The maximum speed can be maintained object-wise, as long as prior knowledge is available.</p></fn>
<fn id="fn0004"><p><sup>4</sup>Different notations are used to refer to the base indoor graph structure with partitions as nodes and doors as edges (Jensen et al., <xref ref-type="bibr" rid="B27">2009a</xref>; Worboys, <xref ref-type="bibr" rid="B77">2011</xref>; Afyouni et al., <xref ref-type="bibr" rid="B1">2012</xref>; Lu et al., <xref ref-type="bibr" rid="B54">2012</xref>). In this study, we use the connectivity base graph following an early work (Jensen et al., <xref ref-type="bibr" rid="B27">2009a</xref>).</p></fn>
<fn id="fn0005"><p><sup>5</sup><ext-link ext-link-type="uri" xlink:href="https://en.wikipedia.org/wiki/New_Town_Plaza">https://en.wikipedia.org/wiki/New_Town_Plaza</ext-link></p></fn>
<fn id="fn0006"><p><sup>6</sup><ext-link ext-link-type="uri" xlink:href="https://en.wikipedia.org/wiki/The_Dubai_Mall">https://en.wikipedia.org/wiki/The_Dubai_Mall</ext-link></p></fn>
<fn id="fn0007"><p><sup>7</sup><ext-link ext-link-type="uri" xlink:href="https://en.wikipedia.org/wiki/King_Fahd_International_Airport">https://en.wikipedia.org/wiki/King_Fahd_International_Airport</ext-link></p></fn>
<fn id="fn0008"><p><sup>8</sup><ext-link ext-link-type="uri" xlink:href="https://en.wikipedia.org/wiki/Louvre">https://en.wikipedia.org/wiki/Louvre</ext-link></p></fn>
<fn id="fn0009"><p><sup>9</sup>A partial path, in contrast to a complete path, represents those paths that start from the source but have not reached the target.</p></fn>
<fn id="fn0010"><p><sup>10</sup><ext-link ext-link-type="uri" xlink:href="https://covid19.who.int">https://covid19.who.int</ext-link></p></fn>
</fn-group>

</back>
</article>