<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Big Data</journal-id>
<journal-title>Frontiers in Big Data</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Big Data</abbrev-journal-title>
<issn pub-type="epub">2624-909X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fdata.2019.00010</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Big Data</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Discovering Topic-Oriented Highly Interactive Online Communities</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Das</surname> <given-names>Swarna</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/713452/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Anwar</surname> <given-names>Md Musfique</given-names></name>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/714469/overview"/>
</contrib>
</contrib-group>
<aff><institution>Department of Computer Science and Engineering</institution> <addr-line>Jahangirnagar University</addr-line>, <addr-line>Dhaka</addr-line>, <country>Bangladesh</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Roberto Interdonato, UMR9000 Territoires, Environnement, T&#x000E9;l&#x000E9;d&#x000E9;tection et Information Spatiale (TETIS), France</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Marinette Savonnet, Universit&#x000E9; de Bourgogne, France; Dino Ienco, National Research Institute of Science and Technology for Environment and Agriculture (IRSTEA), France</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Md Musfique Anwar <email>manwar&#x00040;juniv.edu</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Data Mining and Management, a section of the journal Frontiers in Big Data</p></fn></author-notes>
<pub-date pub-type="epub">
<day>06</day>
<month>06</month>
<year>2019</year>
</pub-date>
<pub-date pub-type="collection">
<year>2019</year>
</pub-date>
<volume>2</volume>
<elocation-id>10</elocation-id>
<history>
<date date-type="received">
<day>01</day>
<month>04</month>
<year>2019</year>
</date>
<date date-type="accepted">
<day>20</day>
<month>05</month>
<year>2019</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2019 Das and Anwar.</copyright-statement>
<copyright-year>2019</copyright-year>
<copyright-holder>Das and Anwar</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract><p>Community detection is an interesting field of online social networks. Most existing approaches either consider common attributes of social network users or rely on only social connections among the users. However, not enough attention is paid to the degree of interactions among the community members in the retrieved communities, resulting in less interactive community members. This inactivity will create problems for many businesses as they require highly interactive users to efficiently advertise their marketing information. In this paper, we propose a model to detect topic-oriented densely-connected communities in which community members have active interactions among each other. We conduct experiments on a real dataset to demonstrate the effectiveness of our proposed approach.</p></abstract> <kwd-group>
<kwd>online social network</kwd>
<kwd>interaction strength</kwd>
<kwd>active community</kwd>
<kwd>query cohesiveness</kwd>
<kwd>structure cohesiveness</kwd>
</kwd-group>
<counts>
<fig-count count="4"/>
<table-count count="2"/>
<equation-count count="5"/>
<ref-count count="17"/>
<page-count count="6"/>
<word-count count="3686"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>1. Introduction</title>
<p>Nowadays, Online Social Networks (OSN) are widely used by a large part of the general population. Similar interests, choices, and hobbies tend to form a group of users in a social network known as online community. There have been many attempts to detect these online communities for the purpose of business, marketing, recommendations, biological research, etc. Often the mere use of connection links does not provide an effective group of users. As a result, these groups do not bring efficient results.</p>
<p>There are two types of network topology. One is global, where information of a whole network is captured and another is local, i.e., a network that works with the similar nodes (Tang et al., <xref ref-type="bibr" rid="B15">2017</xref>). There have been many approaches to detect communities and serve various other fields with it (Fortunato and Hric, <xref ref-type="bibr" rid="B7">2016</xref>). An approach to detecting communities is Affinity Propagation, where the network is divided and a multiobjective evolutionary algorithm is introduced (Shang et al., <xref ref-type="bibr" rid="B14">2016</xref>). For the purpose of local community formation, dynamic membership function can be used (Luo et al., <xref ref-type="bibr" rid="B12">2018</xref>). Fuzzy relations can be used for non-overlapping community detection. The nearest node with each node&#x00027;s greater centrality and fuzzy relations are combined for the desired result (Luo et al., <xref ref-type="bibr" rid="B11">2017</xref>).</p>
<p>Recent research works consider social users&#x00027; topical interests in OSNs, e.g., (Yang et al., <xref ref-type="bibr" rid="B16">2013</xref>), in order to find meaningful communities. However, these methods did not focus on the topical interactions among the community members. Therefore, such communities contain many members who have very inactive topical interactions among them which perform poorly in viral marketing. In order to avoid the inactivity problem authors (Lim and Datta, <xref ref-type="bibr" rid="B9">2016</xref>) have proposed an approach where interaction pattern and frequency are considered rather than only counting the following/follower links.</p>
<p>Our observation is that social users have different degrees of topical intimacy among them. In this work, we propose an approach to discover topic-oriented <italic>highly interactive</italic> communities in OSNs, where the members in the community should have a certain degree of topical interactions with each other related to a given query. We also emphasize that the members in the retrieved communities should actively interact with at least <italic>k</italic> other members within the community. Below, we summarize our contributions:
<list list-type="bullet">
<list-item><p>We propose a methodology to discover highly interactive online communities where community members have a high degree of interactions with each other on similar topics;</p></list-item>
<list-item><p>We quantify the topical interaction strengths among the users;</p></list-item>
<list-item><p>We perform experiments on a real dataset to demonstrate the effectiveness of the proposed method.</p></list-item>
</list></p></sec>
<sec id="s2">
<title>2. Related Work</title>
<p>Earlier methods for community detection are based on structural information of the social graph such as modularity (Clauset et al., <xref ref-type="bibr" rid="B3">2004</xref>), edge betweenness (Newman and Park, <xref ref-type="bibr" rid="B13">2003</xref>), and neighborhood concepts (Cohen, <xref ref-type="bibr" rid="B4">2008</xref>). Some approaches also considered the textual content published by the users along with social connections to detect like-minded users. For example, SA-Cluster applied random-walk to measure the closeness of a node in an augmented attributed graph (Zhou et al., <xref ref-type="bibr" rid="B17">2009</xref>). A Topic-Link LDA model (Liu et al., <xref ref-type="bibr" rid="B10">2009</xref>) is proposed which considers both the linkage structure and similarities of the contents of edges to detect communities. A probabilistic generative model named as CESNA is proposed by Yang et al. (<xref ref-type="bibr" rid="B16">2013</xref>) and combines community memberships, node attributes, and the network topology to find the communities.</p>
<p>More recently, some approaches have focused on the interaction strength between the users in order to find active communities. Dev et al. (<xref ref-type="bibr" rid="B6">2014</xref>) considered the impact of interaction between users as well as the impact of the group behavior without considering topical attributes of the nodes. Lim and Datta (<xref ref-type="bibr" rid="B9">2016</xref>) proposed the Highly Interactive Community Detection (HICD) method, which constructs a weighted network using the frequency of direct interactions between users. Correa et al. (<xref ref-type="bibr" rid="B5">2012</xref>) proposed the <italic>iTop</italic> algorithm, which constructs a weighted graph based on user interactions and maximizes the local modularity to detect topic-oriented communities based on a set of seed users. However, all these methods ignored topic-wise users&#x00027; inter-activeness. Our goal is to discover communities where users have high interactions with others with regard to the given query consisting a set of topics.</p></sec>
<sec sec-type="methods" id="s3">
<title>3. Methodology</title>
<p>First we formally formulate the problem of discovering highly interactive topical communities in OSNs. Then we give an overview of our proposed approach.</p>
<p><bold>Attributed Social Graph:</bold> An attributed social graph is denoted as <italic>G</italic> &#x0003D; <italic>(U, E</italic>, <inline-formula><mml:math id="M1"><mml:mrow><mml:mi mathvariant="-tex-caligraphic">A</mml:mi></mml:mrow></mml:math></inline-formula><italic>)</italic>, where <italic>U</italic> represents the set of social users (nodes), <italic>E</italic> indicates the set of links (edges) between the users, and <inline-formula><mml:math id="M2"><mml:mrow><mml:mi mathvariant="-tex-caligraphic">A</mml:mi></mml:mrow></mml:math></inline-formula>={<italic>T</italic><sub>1</sub>, <italic>T</italic><sub>2</sub>, &#x02026;, <italic>T</italic><sub><italic>m</italic></sub>} is the set of topics discussed by the social users in <italic>G</italic>.</p>
<p>In Twitter, users mention each other using &#x0201C;&#x00040;.&#x0201D; In order to construct a link (<italic>a, b</italic>) between users, &#x00040;mention is used, i.e., <italic>M</italic><sub><italic>a, b</italic></sub> denotes that user <italic>a</italic> has posted a tweet which contains &#x00040;<italic>b</italic>.</p>
<p><bold><italic>k</italic>-Core:</bold> Given an integer <italic>k</italic> (<italic>k</italic>&#x02265; 0), the <italic>k</italic>-core of a graph <italic>G</italic>, denoted by <italic>C</italic><sup><italic>k</italic></sup>, is the maximal connected sub graph of <italic>G</italic>, such that &#x02200;<italic>u</italic> &#x02208; <italic>C</italic><sup><italic>k</italic></sup>, <inline-formula><mml:math id="M3"><mml:mi>d</mml:mi><mml:mi>e</mml:mi><mml:msub><mml:mrow><mml:mi>g</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>u</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula> &#x02265; <italic>k</italic>, where <inline-formula><mml:math id="M4"><mml:mi>d</mml:mi><mml:mi>e</mml:mi><mml:msub><mml:mrow><mml:mi>g</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>u</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula> refers to the degree of a node <italic>u</italic> in <italic>C</italic><sup><italic>k</italic></sup>. A <italic>k</italic>-core component <inline-formula><mml:math id="M5"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula> is considered as a community from a structural point of view.</p>
<p><bold>Node Core Number:</bold> The core number of a social user <italic>u</italic> in a <italic>k</italic>-core induced sub graph from <italic>G</italic> indicates the maximum <italic>k</italic> for which <italic>u</italic> belongs to that <italic>k</italic>-core sub graph.</p>
<p><bold>Topic:</bold> A topic contains a set of related words that represents the topic. For example, the politics topic has words like election, vote, democracy, political party, etc.</p>
<p><bold>Activity:</bold> Any action performed by a social user is referred to as an activity. For example, posting a new tweet or retweeting an existing tweet is considered as an activity. In our work, we consider only those actions that are performed between any two social users. For example, a user <italic>u</italic> in Twitter replies to a tweet posted by user <italic>v</italic>. This activity is recorded as an activity tuple &#x02329;<italic>u, v</italic>, &#x003C8;<sub><italic>uv</italic></sub>&#x0232A;, where &#x003C8;<sub><italic>uv</italic></sub> indicates the set of attributes (topics) exchanged between <italic>u</italic> and <italic>v</italic> (Anwar et al., <xref ref-type="bibr" rid="B1">2018</xref>).</p>
<p><bold>Query:</bold> An input query <italic>Q</italic>={<italic>T</italic><sub>1</sub>, <italic>T</italic><sub>2</sub>&#x02026;, <italic>T</italic><sub><italic>n</italic></sub>} contains a set of query topics.</p>
<p><bold>Active Interaction Edge:</bold> If any two social users <italic>u</italic> and <italic>v</italic> in <italic>G</italic> have a certain number of direct interactions (&#x003B3;(&#x02265; 1)) between them related to <italic>Q</italic>, then we consider the interaction link between those two users as an active interaction edge (<italic>e</italic><sub><italic>uv</italic></sub>). Factor <italic>w</italic><sub><italic>uv</italic></sub> indicates their involvement in direct interactions compared with the most active pair of users in the network.</p>
<disp-formula id="E1"><label>(1)</label><mml:math id="M6"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>w</mml:mi></mml:mrow><mml:mrow><mml:mi>u</mml:mi><mml:mi>v</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mo stretchy="false">|</mml:mo><mml:mi>A</mml:mi><mml:mi>C</mml:mi><mml:mi>T</mml:mi><mml:mi>S</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>u</mml:mi><mml:mo>,</mml:mo><mml:mi>v</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003C8;</mml:mi></mml:mrow><mml:mrow><mml:mi>u</mml:mi><mml:mi>v</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mi>a</mml:mi><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>x</mml:mi><mml:mo>,</mml:mo><mml:mi>y</mml:mi><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>U</mml:mi></mml:mrow><mml:mrow><mml:mi>Q</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo stretchy="false">|</mml:mo><mml:mi>A</mml:mi><mml:mi>C</mml:mi><mml:mi>T</mml:mi><mml:mi>S</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>x</mml:mi><mml:mo>,</mml:mo><mml:mi>y</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003C8;</mml:mi></mml:mrow><mml:mrow><mml:mi>x</mml:mi><mml:mi>y</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow></mml:mfrac></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>ACTS</italic>(<italic>u, v</italic>, &#x003C8;<sub><italic>uv</italic></sub>) indicates the number of direct interactions between <italic>u</italic> and <italic>v</italic> containing &#x003C8;<sub><italic>uv</italic></sub> &#x02286; <italic>Q</italic>.</p>
<p><bold>Active User:</bold> The users of an active interaction edge <italic>e</italic><sub><italic>uv</italic></sub> are considered as active users. The set of all the active users for a given query <italic>Q</italic> is denoted as <italic>U</italic><sup><italic>Q</italic></sup>.</p>
<sec>
<title>3.1. Problem Definition</title>
<p>Given a graph <italic>G</italic> &#x0003D; <italic>(U, E</italic>, <inline-formula><mml:math id="M7"><mml:mrow><mml:mi mathvariant="-tex-caligraphic">A</mml:mi></mml:mrow></mml:math></inline-formula><italic>)</italic>, an input query <italic>Q</italic> and an integer <italic>k</italic>, we first find the set of active edges between the social users by measuring interaction strength <italic>w</italic><sub><italic>uv</italic></sub>(<italic>w</italic><sub><italic>uv</italic></sub> &#x02208; [0, 1]). Then an induced sub graph <inline-formula><mml:math id="M8"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula> is considered as an active interactive community if it satisfies the following criteria.</p>
<list list-type="order">
<list-item><p><bold><italic>Connectivity</italic>. </bold><inline-formula><mml:math id="M9"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msubsup><mml:mo>&#x02282;</mml:mo><mml:mi>G</mml:mi></mml:math></inline-formula> is connected;</p></list-item>
<list-item><p><bold><italic>Structure cohesiveness</italic>. </bold><inline-formula><mml:math id="M10"><mml:mo>&#x02200;</mml:mo><mml:mi>u</mml:mi><mml:mo>&#x02208;</mml:mo><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula> has interaction degree of at least <italic>k</italic>;</p></list-item>
<list-item><p><bold><italic>Active interaction</italic>. </bold><inline-formula><mml:math id="M11"><mml:mo>&#x02200;</mml:mo><mml:msub><mml:mrow><mml:mi>e</mml:mi></mml:mrow><mml:mrow><mml:mi>u</mml:mi><mml:mi>v</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02208;</mml:mo><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula>, the interaction strength of <italic>e</italic><sub><italic>uv</italic></sub> is <italic>w</italic><sub><italic>uv</italic></sub>&#x02265;&#x003B8; and &#x003B8; &#x02208; [0, 1] is a threshold.</p></list-item>
</list>
<p><xref ref-type="fig" rid="F1">Figure 1</xref> shows a social graph <italic>G</italic> with the core number for each node, e.g., the three-core nodes are {A,B,C,I}. <xref ref-type="table" rid="T1">Table 1</xref> represents the interaction frequencies among the users for topic <italic>T</italic><sub>1</sub> and <italic>T</italic><sub>2</sub>. In <xref ref-type="table" rid="T2">Table 2</xref>, we show the interactive communities for a query <italic>Q</italic> &#x0003D; {<italic>T</italic><sub>1</sub>, <italic>T</italic><sub>2</sub>}. We get different community members for different values of <italic>Q</italic>, <italic>k</italic>, and &#x003B8;. For example, when <italic>Q</italic>={<italic>T</italic><sub>1</sub>}, <italic>k</italic> &#x0003D; {2}, and &#x003B8; &#x0003D; {0.4}, we get <inline-formula><mml:math id="M12"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula> &#x0003D; {A,B,C,I}, <inline-formula><mml:math id="M13"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula> &#x0003D; {O,P,Q,R,S,T} while for the same values of <italic>Q</italic> and &#x003B8; with an increase value of <italic>k</italic> &#x0003D; {3}, we get <inline-formula><mml:math id="M14"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula> &#x0003D; {A,B,C,I}, <inline-formula><mml:math id="M15"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula> &#x0003D; {O,P,S,T}. Again, for <italic>Q</italic>={<italic>T</italic><sub>1</sub>, <italic>T</italic><sub>2</sub>}, <italic>k</italic> &#x0003D; {2} and &#x003B8; &#x0003D; {0.5}, we get <inline-formula><mml:math id="M16"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula> &#x0003D; {A,B,C,D,H,I}, <inline-formula><mml:math id="M17"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula> &#x0003D; {O,P,Q,R,S,T} and <inline-formula><mml:math id="M18"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula> &#x0003D; {M,N,U}</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>Social Graph (the number denotes the node core number).</p></caption>
<graphic xlink:href="fdata-02-00010-g0001.tif"/>
</fig>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Interaction log.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>(e<sub><italic>ab</italic></sub>)</bold></th>
<th valign="top" align="center"><bold><italic>T</italic><sub>1</sub></bold></th>
<th valign="top" align="center"><bold><italic>T</italic><sub>2</sub></bold></th>
<th valign="top" align="center"><bold>(e<sub><italic>ab</italic></sub>)</bold></th>
<th valign="top" align="center"><bold><italic>T</italic><sub>1</sub></bold></th>
<th valign="top" align="center"><bold><italic>T</italic><sub>2</sub></bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">(M,N)</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">18</td>
<td valign="top" align="center">(A,B)</td>
<td valign="top" align="center">20</td>
<td valign="top" align="center">7</td>
</tr>
<tr>
<td valign="top" align="left">(M,U)</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">(A,C)</td>
<td valign="top" align="center">18</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">(N,U)</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">15</td>
<td valign="top" align="center">(A,I)</td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">(N,O)</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">(B,C)</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">13</td>
</tr>
<tr>
<td valign="top" align="left">(O,T)</td>
<td valign="top" align="center">20</td>
<td valign="top" align="center">13</td>
<td valign="top" align="center">(B,I)</td>
<td valign="top" align="center">13</td>
<td valign="top" align="center">8</td>
</tr>
<tr>
<td valign="top" align="left">(O,P)</td>
<td valign="top" align="center">19</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">(C,I)</td>
<td valign="top" align="center">15</td>
<td valign="top" align="center">9</td>
</tr>
<tr>
<td valign="top" align="left">(O,S)</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">(C,D)</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">12</td>
</tr>
<tr>
<td valign="top" align="left">(P,T)</td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">(C,H)</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">17</td>
</tr>
<tr>
<td valign="top" align="left">(P,S)</td>
<td valign="top" align="center">18</td>
<td valign="top" align="center">9</td>
<td valign="top" align="center">(D,H)</td>
<td valign="top" align="center">12</td>
<td valign="top" align="center">9</td>
</tr>
<tr>
<td valign="top" align="left">(S,T)</td>
<td valign="top" align="center">16</td>
<td valign="top" align="center">9</td>
<td valign="top" align="center">(D,E)</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">18</td>
</tr>
<tr>
<td valign="top" align="left">(S,R)</td>
<td valign="top" align="center">12</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">(H,G)</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">8</td>
</tr>
<tr>
<td valign="top" align="left">(R,Q)</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">(E,F)</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">9</td>
</tr>
<tr>
<td valign="top" align="left">(P,Q)</td>
<td valign="top" align="center">20</td>
<td valign="top" align="center">9</td>
<td valign="top" align="center">(F,G)</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">20</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>Community members for different values of <italic>Q</italic>, <italic>k</italic>, and &#x003B8;.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Query</bold></th>
<th valign="top" align="left"><bold>Community</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><italic>Q</italic> &#x0003D; {<italic>T</italic><sub>1</sub>}, <italic>k</italic> &#x0003D; {2}, &#x003B8;={0.4}</td>
<td valign="top" align="left"><inline-formula><mml:math id="M19"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula> &#x0003D; {A,B,C,I}, <inline-formula><mml:math id="M20"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula> &#x0003D; {O,P,Q,R,S,T}</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Q</italic> &#x0003D; {<italic>T</italic><sub>1</sub>}, <italic>k</italic> &#x0003D; {3}, &#x003B8;={0.4}</td>
<td valign="top" align="left"><inline-formula><mml:math id="M21"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula> &#x0003D; {A,B,C,I},<inline-formula><mml:math id="M22"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula> &#x0003D; {O,P,S,T}</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Q</italic> &#x0003D; {<italic>T</italic><sub>2</sub>}, <italic>k</italic> &#x0003D; {2}, &#x003B8;={0.4}</td>
<td valign="top" align="left"><inline-formula><mml:math id="M23"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula> &#x0003D; {B,C,D,E,F,G,H,I},<inline-formula><mml:math id="M24"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula> &#x0003D; {M,N,O,P,S,T,U}</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Q</italic> &#x0003D; {<italic>T</italic><sub>2</sub>}, <italic>k</italic> &#x0003D; {2}, &#x003B8;={0.5}</td>
<td valign="top" align="left"><inline-formula><mml:math id="M25"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula> &#x0003D; {C,D,H},<inline-formula><mml:math id="M26"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula> &#x0003D; {M,N,U}</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Q</italic> &#x0003D; {<italic>T</italic><sub>1</sub>,<italic>T</italic><sub>2</sub>}, <italic>k</italic> &#x0003D; {2}, &#x003B8;={0.4}</td>
<td valign="top" align="left"><inline-formula><mml:math id="M27"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula> &#x0003D; {A,B,C,D,E,F,G,H,I},<inline-formula><mml:math id="M28"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula> &#x0003D; {M,N,O,P,Q,R,S,T,U}</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Q</italic> &#x0003D; {<italic>T</italic><sub>1</sub>,<italic>T</italic><sub>2</sub>}, <italic>k</italic> &#x0003D; {2}, &#x003B8;={0.5}</td>
<td valign="top" align="left"><inline-formula><mml:math id="M29"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula> &#x0003D; {A,B,C,D,H,I},<inline-formula><mml:math id="M30"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula> &#x0003D; {O,P,Q,R,S,T},<inline-formula><mml:math id="M31"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula> &#x0003D; {M,N,U}</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Q</italic> &#x0003D; {<italic>T</italic><sub>1</sub>,<italic>T</italic><sub>2</sub>}, <italic>k</italic> &#x0003D; {3}, &#x003B8;={0.4}</td>
<td valign="top" align="left"><inline-formula><mml:math id="M32"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula> &#x0003D; {A,B,C,I},<inline-formula><mml:math id="M33"><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula> &#x0003D; {O,P,S,T}</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec>
<title>3.2. Highly Interactive Community Detection Approach</title>
<p>In this work, we propose a method to detect highly interactive communities for a given a query <italic>Q</italic> in an online social attributed graph <italic>G</italic>. The desired communities from the graph <italic>G</italic> can be identified in the following three steps:
<list list-type="order">
<list-item><p>Identify the set of active users based on their direct interaction with each other for a given query <italic>Q</italic>.</p></list-item>
<list-item><p>Refine the original social graph <italic>G</italic> by filtering the inactive social users.</p></list-item>
<list-item><p>Apply <italic>k</italic>-core technique on the refined social graph in order to detect the desired online communities.</p></list-item>
</list></p>
<p>The first step of our approach is measuring the interaction frequencies among the users for a given query <italic>Q</italic> in social graph <italic>G</italic> to filter the weakly connected topology links. For this purpose, we consider users who have direct communication with others via retweets or mentions and consider an interaction link between two users irrespective of whether they have a topology link or not.</p>
<p>After establishing the newly active interaction edges and filtering the inactive topology links from the social graph <italic>G</italic>, we apply <italic>k</italic>-core on the refined social graph to find the connected components in which every node has degree of at-least <italic>k</italic>.</p>
<p>We develop an algorithmic framework to detect highly interactive communities for a given <italic>Q</italic>.</p>
<p><bold>Algorithm overview</bold>. The algorithm, called <monospace>Query Algorithm</monospace>, has three steps. First, it computes the interaction strength <italic>w</italic><sub><italic>uv</italic></sub> of each edge <italic>e</italic><sub><italic>uv</italic></sub> for a given query <italic>Q</italic> in order to find the set of active users (line 1-5). Next, we compute the induced sub graph <italic>G</italic><sup><italic>Q</italic></sup> from <italic>U</italic><sup><italic>Q</italic></sup> (line 6). Finally, we identify the maximal <italic>k</italic>-core of <italic>C</italic><sup><italic>k</italic></sup>(<italic>G</italic><sup><italic>Q</italic></sup>) from the induced graph <italic>G</italic><sup><italic>Q</italic></sup> to find the set of active connected components (i.e., desired connected communities) (line 6-7) &#x003A6;<sup><italic>Q</italic></sup> from <italic>C</italic><sup><italic>k</italic></sup>(<italic>G</italic><sup><italic>Q</italic></sup>) (line 7-8).</p>
</sec></sec>
<sec id="s4">
<title>4. Experiment and Result</title>
<p>We conduct our experiment on an academic coauthor (DBLP) dataset (Jie et al., <xref ref-type="bibr" rid="B8">2008</xref>) and choose research papers that were published within 2005 to 2011. This revised dataset is a network of 15,516 authors with 48,862 co-author relationships between these authors and contains 193,512 research papers. The co-author information in DBLP is considered as interaction between the authors. We extract the authors&#x00027; details, publication year, and abstract from each research paper. We apply latent dirichlet allocation (LDA) topic modeling (Blei et al., <xref ref-type="bibr" rid="B2">2003</xref>) on the abstracts of the research papers in order to find the research topics.</p>
<p><bold>Comparison Methods</bold>. We compare our <xref ref-type="table" rid="T3">Algorithm 1</xref> (<monospace>Query Algorithm</monospace>), denoted here as <monospace>TO-HIOC</monospace>, with two other existing methods: <monospace>HICD </monospace>method (Lim and Datta, <xref ref-type="bibr" rid="B9">2016</xref>) and <monospace>iTop </monospace>algorithm (Correa et al., <xref ref-type="bibr" rid="B5">2012</xref>).</p>
<table-wrap position="float" id="T3">
<label>Algorithm 1</label>
<caption><p><monospace>Query Algorithm</monospace>&#x000A0;</p></caption>
<table frame="hsides" rules="groups">
<tbody>
<tr><td align="left" valign="top"><bold>Input:</bold> <italic>G</italic>=(<italic>U, E</italic>), <italic>Q, k</italic>, &#x003B8;</td></tr>
<tr><td align="left" valign="top"><bold>Output:</bold> set of active interactive communities &#x003A6;<sub><italic>Q</italic></sub>=<inline-formula><mml:math id="M34"><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mo>}</mml:mo></mml:mrow></mml:math></inline-formula></td></tr>
<tr><td align="left" valign="top"> 1: &#x000A0;&#x000A0;&#x000A0;for each (<italic>u, v</italic>)&#x02208;<italic>E</italic> <bold>do</bold> </td></tr>
<tr><td align="left" valign="top"> 2: &#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0; compute <italic>w</italic><sub><italic>uv</italic></sub> </td></tr>
<tr><td align="left" valign="top"> 3: &#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0; if <italic>w</italic><sub><italic>uv</italic></sub>&#x0003E;&#x003B8; <bold>then</bold> </td></tr>
<tr><td align="left" valign="top"> 4: &#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0; <italic>U</italic><sup><italic>Q</italic></sup>.<italic>add</italic>(<italic>u</italic>) </td></tr>
<tr><td align="left" valign="top"> 5: &#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0; <italic>U</italic><sup><italic>Q</italic></sup>.<italic>add</italic>(<italic>v</italic>) </td></tr>
<tr><td align="left" valign="top"> 6:&#x000A0;&#x000A0;&#x000A0; compute the induced graph <italic>G</italic><sup><italic>Q</italic></sup> on <italic>U</italic><sup><italic>Q</italic></sup> </td></tr>
<tr><td align="left" valign="top"> 7:&#x000A0;&#x000A0;&#x000A0; compute the maximal <italic>k</italic>-core <italic>C</italic><sup><italic>k</italic></sup>(<italic>G</italic><sup><italic>Q</italic></sup>) of <italic>G</italic><sup><italic>Q</italic></sup> </td></tr>
<tr><td align="left" valign="top"> 8:&#x000A0;&#x000A0;&#x000A0; Output the set of active connected components &#x003A6;<sub><italic>Q</italic></sub> from <italic>C</italic><sup><italic>k</italic></sup>(<italic>G</italic><sup><italic>Q</italic></sup>)</td></tr> 
</tbody>
</table>
</table-wrap>
<p><bold>Evaluation Measures</bold>. We vary the length of the <italic>Q</italic> to |<italic>Q</italic>| &#x0003D; 2, 3, 4 and use three measures of density, entropy and modularity to evaluate the quality of the detected online communities discovered by different methods. The definition of density, entropy, and modularity are as follows.</p>
<disp-formula id="E2"><label>(2)</label><mml:math id="M35"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:mi>d</mml:mi><mml:mi>e</mml:mi><mml:mi>n</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mi>y</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mo>}</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:mfrac><mml:mrow><mml:mo stretchy="false">|</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>u</mml:mi><mml:mo>,</mml:mo><mml:mi>v</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo stretchy="false">|</mml:mo><mml:mi>u</mml:mi><mml:mo>,</mml:mo><mml:mi>v</mml:mi><mml:mo>&#x02208;</mml:mo><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>u</mml:mi><mml:mo>,</mml:mo><mml:mi>v</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x02208;</mml:mo><mml:mi>E</mml:mi></mml:mrow><mml:mo>}</mml:mo></mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:mo stretchy="false">|</mml:mo><mml:mi>E</mml:mi><mml:mo stretchy="false">|</mml:mo></mml:mrow></mml:mfrac></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>n</italic> denotes the total number of detected communities. Density measures the compactness of the communities in structure.</p>
<disp-formula id="E3"><mml:math id="M36"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:mi>e</mml:mi><mml:mi>n</mml:mi><mml:mi>t</mml:mi><mml:mi>r</mml:mi><mml:mi>o</mml:mi><mml:mi>p</mml:mi><mml:mi>y</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mo>}</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mo>=</mml:mo></mml:mtd><mml:mtd><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:mfrac><mml:mrow><mml:mo stretchy="false">|</mml:mo><mml:mi>U</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:mo stretchy="false">|</mml:mo><mml:mi>U</mml:mi><mml:mo stretchy="false">|</mml:mo></mml:mrow></mml:mfrac><mml:mi>e</mml:mi><mml:mi>n</mml:mi><mml:mi>t</mml:mi><mml:mi>r</mml:mi><mml:mi>o</mml:mi><mml:mi>p</mml:mi><mml:mi>y</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mtext>where&#x02003;</mml:mtext></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E4"><label>(3)</label><mml:math id="M37"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:mi>e</mml:mi><mml:mi>n</mml:mi><mml:mi>t</mml:mi><mml:mi>r</mml:mi><mml:mi>o</mml:mi><mml:mi>p</mml:mi><mml:mi>y</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mo>=</mml:mo></mml:mtd><mml:mtd><mml:mo>-</mml:mo><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mi>l</mml:mi><mml:mi>o</mml:mi><mml:msub><mml:mrow><mml:mi>g</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>and <italic>p</italic><sub><italic>ij</italic></sub> is the percentage of members in a community <inline-formula><mml:math id="M38"><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="-tex-caligraphic">G</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> who are active on the query topic <italic>T</italic><sub><italic>i</italic></sub>. <inline-formula><mml:math id="M39"><mml:mi>e</mml:mi><mml:mi>n</mml:mi><mml:mi>t</mml:mi><mml:mi>r</mml:mi><mml:mi>o</mml:mi><mml:mi>p</mml:mi><mml:mi>y</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mi mathvariant="-tex-caligraphic">G</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>}</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula> measures the weighted entropy considering all the query topics over all the communities. Entropy indicates the randomness of the topics which are covered in the communities.</p>
<disp-formula id="E5"><label>(4)</label><mml:math id="M40"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:mi>m</mml:mi><mml:mi>o</mml:mi><mml:mi>d</mml:mi><mml:mi>u</mml:mi><mml:mi>l</mml:mi><mml:mi>a</mml:mi><mml:mi>r</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mi>y</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mo>}</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn><mml:mi>m</mml:mi></mml:mrow></mml:mfrac><mml:mstyle displaystyle="true"><mml:munder class="msub"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:munder></mml:mstyle><mml:mrow><mml:mo stretchy="false">[</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mfrac><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mn>2</mml:mn><mml:mi>m</mml:mi></mml:mrow></mml:mfrac></mml:mrow><mml:mo stretchy="false">]</mml:mo></mml:mrow><mml:mi>&#x003B4;</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Here, <italic>m</italic> denotes the number of edges corresponding to an adjacency matrix <italic>A</italic><xref ref-type="fn" rid="fn0001"><sup>1</sup></xref>, <italic>d</italic><sub><italic>i</italic></sub> denotes the degree corresponding to node <italic>n</italic><sub><italic>i</italic></sub>, <italic>s</italic><sub><italic>i</italic></sub> denotes the community membership of node <italic>n</italic><sub><italic>i</italic></sub> and &#x003B4;(<italic>s</italic><sub><italic>i</italic></sub>, <italic>s</italic><sub><italic>j</italic></sub>) &#x0003D; 1 if <italic>s</italic><sub><italic>i</italic></sub>=<italic>s</italic><sub><italic>j</italic></sub>.</p>
<p>Generally, a good interactive community should have high density, high modularity, and low entropy.</p>
<p><xref ref-type="fig" rid="F2">Figure 2A</xref> shows the density comparison between all the methods on the DBLP dataset. We set <italic>k</italic> &#x0003D; 4 as there are usually many small-sized research groups existing in DBLP. We see that <monospace>TO-HIOC</monospace> achieves better performance compared to the other two methods because it considers query-oriented active interactions among the community members. The <monospace>HICD </monospace>method fails to achieve better density values as it requires interaction between users (authors) to the celebrities (i.e., very high profile researchers in DBLP), which is not very common. The <monospace>iTop </monospace>method ignores the interactions between the non-seed users, resulting in poor performance. We also observe that all the methods achieve better density values for higher values of |<italic>Q</italic>|. The reason is that the number of interactive connections of the users increases as |<italic>Q</italic>|increases, which results in large and more densely connected communities.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>Performance comparison on DBLP dataset <bold>(A)</bold> Density, <bold>(B)</bold> Entropy, <bold>(C)</bold> Modularity (in all cases, <italic>Q</italic> &#x0003D; {Semantic web, Data mining, Social network analysis}, <italic>k</italic> &#x0003D; {4}, &#x003B8; &#x0003D; {0.5}, &#x003B3; &#x0003D; {4}, the publications are chosen from the time period of 2005 to 2009).</p></caption>
<graphic xlink:href="fdata-02-00010-g0002.tif"/>
</fig>
<p><xref ref-type="fig" rid="F2">Figure 2B</xref> shows the entropy comparison between the three methods. <monospace>TO-HIOC </monospace>achieves better performance in the aspect of the entropy as it considers the topical relevance (with regard to the query topics) during the interactions between the authors while forming a community. On the other hand, <monospace>HICD </monospace>achieves higher entropy value because not all the connected authors in a community have interests or active interactions in the common research topics. <monospace>iTop </monospace>also achieves a higher entropy value due to the lack of active topical interactions between the seed users and their followers. We see in <xref ref-type="fig" rid="F2">Figure 2C</xref> that our proposed method <monospace>TO-HIOC </monospace>outperforms <monospace>HICD </monospace>and <monospace>iTop </monospace>in modularity comparison.</p>
<p>We examined a community in a co-author dataset which includes Jie Tang, who is one of the leading researchers in the data mining area, to see the differences in the community members for different values of <italic>k</italic> and <italic>Q</italic> &#x0003D; {semantic web, topic mining, social network analysis}.</p>
<p>We observe the effect of value <italic>k</italic> in <xref ref-type="fig" rid="F3">Figures 3A&#x02013;C</xref>. By varying the values of <italic>k</italic>, we get communities of different sizes. We see that the community size decreases for higher values of <italic>k</italic> as the cohesiveness constraint becomes more strict, resulting in the exclusion of some active community members, for example &#x0201C;Yi Li,&#x0201D; &#x0201C;Jing Zhang,&#x0201D; &#x0201C;Limin Yao&#x0201D;leave the group. We also see that more researchers joined the community when the length of <italic>Q</italic> is increased as higher values of |<italic>Q</italic>|covered more interactive researchers (<xref ref-type="fig" rid="F4">Figures 4A&#x02013;C</xref>).</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p><bold>(A)</bold> <italic>k</italic> &#x0003D; {3}; <bold>(B)</bold> <italic>k</italic> &#x0003D; {4}; <bold>(C)</bold> <italic>k</italic> &#x0003D; {5}; (in all cases, <italic>Q</italic> &#x0003D; {Semantic web}, &#x003B8; &#x0003D; {0.5}, &#x003B3; &#x0003D; {4}, the publications are chosen from the time period of 2005 to 2009).</p></caption>
<graphic xlink:href="fdata-02-00010-g0003.tif"/>
</fig>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p><bold>(A)</bold> <italic>Q</italic> &#x0003D; {Semantic web}; <bold>(B)</bold> <italic>Q</italic> &#x0003D; {Semantic web, Data mining}; <bold>(C)</bold> <italic>Q</italic> &#x0003D; {Semantic web, Data mining, Social network analysis}; (in all cases, <italic>k</italic> &#x0003D; {4}, &#x003B8; &#x0003D; {0.5}, &#x003B3; &#x0003D; {4}, the publications are chosen from the time period of 2007 to 2011).</p></caption>
<graphic xlink:href="fdata-02-00010-g0004.tif"/>
</fig></sec>
<sec sec-type="conclusions" id="s5">
<title>5. Conclusion</title>
<p>In this paper, a topic-oriented highly interactive community detection approach is proposed. This method detects global communities where users have active interaction with each other on common topics. We observed that users have different degrees of interactions for different topics. As future work, we will consider the temporal factor to measure the recency behavior of users&#x00027; interactions.</p></sec>
<sec id="s6">
<title>Data Availability</title>
<p>The raw data supporting the conclusions of this manuscript will be made available by the authors, without undue reservation, to any qualified researcher.</p></sec>
<sec id="s7">
<title>Author Contributions</title>
<p>For this research paper, SD conducted the experiments (100%), designed the algorithm (75%), and wrote the paper (70%). MA designed the algorithm and experiments (25%), revised the paper (30%) as well as provided helpful insights and contribution in problem formulation, idea refinement, reviewing, and polishing the paper.</p>
<sec>
<title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p></sec></sec>
</body>
<back>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Anwar</surname> <given-names>M. M.</given-names></name> <name><surname>Liu</surname> <given-names>C.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name></person-group> (<year>2018</year>). <article-title>&#x0201C;Uncovering attribute-driven active intimate communities,&#x0201D;</article-title> in <source>Australasian Database Conference</source> (<publisher-loc>Gold Coast, QLD</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>109</fpage>&#x02013;<lpage>122</lpage>.</citation></ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Blei</surname> <given-names>D. M.</given-names></name> <name><surname>Ng</surname> <given-names>A. Y.</given-names></name> <name><surname>Jordan</surname> <given-names>M. I.</given-names></name></person-group> (<year>2003</year>). <article-title>Latent dirichlet allocation</article-title>. <source>J. Mach. Learn. Res.</source> <volume>3</volume>, <fpage>993</fpage>&#x02013;<lpage>1022</lpage>.</citation></ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clauset</surname> <given-names>A.</given-names></name> <name><surname>Newman</surname> <given-names>M. E.</given-names></name> <name><surname>Moore</surname> <given-names>C.</given-names></name></person-group> (<year>2004</year>). <article-title>Finding community structure in very large networks</article-title>. <source>Phys. Rev. E</source> <volume>70</volume>:<fpage>066111</fpage>. <pub-id pub-id-type="doi">10.1103/PhysRevE.70.066111</pub-id><pub-id pub-id-type="pmid">15697438</pub-id></citation></ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cohen</surname> <given-names>J.</given-names></name></person-group> (<year>2008</year>). <article-title>Trusses: Cohesive subgraphs for social network analysis</article-title>. <source>Natl. Secur. Agency Tech. Rep.</source> <volume>16</volume>, <fpage>3</fpage>&#x02013;<lpage>1</lpage>.</citation></ref>
<ref id="B5">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Correa</surname> <given-names>D.</given-names></name> <name><surname>Sureka</surname> <given-names>A.</given-names></name> <name><surname>Pundir</surname> <given-names>M.</given-names></name></person-group> (<year>2012</year>). <article-title>&#x0201C;itop: interaction based topic centric community discovery on twitter,&#x0201D;</article-title> in <source>Proceedings of the 5th Ph. D. Workshop on Information and Knowledge</source> (<publisher-loc>Maui, HI</publisher-loc>: <publisher-name>ACM</publisher-name>), <fpage>51</fpage>&#x02013;<lpage>58</lpage>.</citation></ref>
<ref id="B6">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Dev</surname> <given-names>H.</given-names></name> <name><surname>Ali</surname> <given-names>M. E.</given-names></name> <name><surname>Hashem</surname> <given-names>T.</given-names></name></person-group> (<year>2014</year>). <article-title>&#x0201C;User interaction based community detection in online social networks,&#x0201D;</article-title> in <source>International Conference on Database Systems for Advanced Applications</source> (<publisher-loc>Bali</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>296</fpage>&#x02013;<lpage>310</lpage>.</citation></ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fortunato</surname> <given-names>S.</given-names></name> <name><surname>Hric</surname> <given-names>D.</given-names></name></person-group> (<year>2016</year>). <article-title>Community detection in networks: a user guide</article-title>. <source>Phys. Rep.</source> <volume>659</volume>, <fpage>1</fpage>&#x02013;<lpage>44</lpage>. <pub-id pub-id-type="doi">10.1016/j.physrep.2016.09.002</pub-id></citation></ref>
<ref id="B8">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Jie</surname> <given-names>T.</given-names></name> <name><surname>Jing</surname> <given-names>Z.</given-names></name> <name><surname>Limin</surname> <given-names>Y.</given-names></name> <name><surname>Juanzi</surname> <given-names>L.</given-names></name> <name><surname>Li</surname> <given-names>Z.</given-names></name> <name><surname>Zhong</surname> <given-names>S.</given-names></name></person-group> (<year>2008</year>). <article-title>&#x0201C;Arnetminer: extraction and mining of academic social networks,&#x0201D;</article-title> in <source>Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining</source> (<publisher-loc>Beijing</publisher-loc>: <publisher-name>ACM</publisher-name>), <fpage>990</fpage>&#x02013;<lpage>998</lpage>.</citation></ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lim</surname> <given-names>K. H.</given-names></name> <name><surname>Datta</surname> <given-names>A.</given-names></name></person-group> (<year>2016</year>). <article-title>An interaction-based approach to detecting highly interactive twitter communities using tweeting links</article-title>. <source>Web Intell</source>. <volume>14</volume>, <fpage>1</fpage>&#x02013;<lpage>15</lpage>. <pub-id pub-id-type="doi">10.3233/WEB-160328</pub-id></citation></ref>
<ref id="B10">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>Y.</given-names></name> <name><surname>Niculescu-Mizil</surname> <given-names>A.</given-names></name> <name><surname>Gryc</surname> <given-names>W.</given-names></name></person-group> (<year>2009</year>). <article-title>&#x0201C;Topic-link lda: joint models of topic and author community,&#x0201D;</article-title> in <source>Proceedings of the 26th Annual International Conference on Machine Learning</source> (<publisher-loc>Montreal, QC</publisher-loc>: <publisher-name>ACM</publisher-name>), <fpage>665</fpage>&#x02013;<lpage>672</lpage>.</citation></ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Luo</surname> <given-names>W.</given-names></name> <name><surname>Yan</surname> <given-names>Z.</given-names></name> <name><surname>Bu</surname> <given-names>C.</given-names></name> <name><surname>Zhang</surname> <given-names>D.</given-names></name></person-group> (<year>2017</year>). <article-title>Community detection by fuzzy relations</article-title>. <source>IEEE Trans. Emerg. Top. Comput.</source> <fpage>1</fpage>&#x02013;<lpage>14</lpage>. <pub-id pub-id-type="doi">10.1109/TETC.2017.2751101</pub-id></citation></ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Luo</surname> <given-names>W.</given-names></name> <name><surname>Zhang</surname> <given-names>D.</given-names></name> <name><surname>Jiang</surname> <given-names>H.</given-names></name> <name><surname>Ni</surname> <given-names>L.</given-names></name> <name><surname>Hu</surname> <given-names>Y.</given-names></name></person-group> (<year>2018</year>). <article-title>Local community detection with the dynamic membership function</article-title>. <source>IEEE Trans. Fuzzy Syst.</source> <volume>26</volume>, <fpage>3136</fpage>&#x02013;<lpage>3150</lpage>. <pub-id pub-id-type="doi">10.1109/TFUZZ.2018.2812148</pub-id></citation></ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Newman</surname> <given-names>M. E.</given-names></name> <name><surname>Park</surname> <given-names>J.</given-names></name></person-group> (<year>2003</year>). <article-title>Why social networks are different from other types of networks</article-title>. <source>Phys. Rev. E</source> <volume>68</volume>:<fpage>036122</fpage>. <pub-id pub-id-type="doi">10.1103/PhysRevE.68.036122</pub-id><pub-id pub-id-type="pmid">14524847</pub-id></citation></ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shang</surname> <given-names>R.</given-names></name> <name><surname>Luo</surname> <given-names>S.</given-names></name> <name><surname>Zhang</surname> <given-names>W.</given-names></name> <name><surname>Stolkin</surname> <given-names>R.</given-names></name> <name><surname>Jiao</surname> <given-names>L.</given-names></name></person-group> (<year>2016</year>). <article-title>A multiobjective evolutionary algorithm to find community structures based on affinity propagation</article-title>. <source>Phys. A Stat. Mech. Appl.</source> <volume>453</volume>, <fpage>203</fpage>&#x02013;<lpage>227</lpage>. <pub-id pub-id-type="doi">10.1016/j.physa.2016.02.020</pub-id></citation></ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tang</surname> <given-names>X.</given-names></name> <name><surname>Xu</surname> <given-names>T.</given-names></name> <name><surname>Feng</surname> <given-names>X.</given-names></name> <name><surname>Yang</surname> <given-names>G.</given-names></name> <name><surname>Wang</surname> <given-names>J.</given-names></name> <name><surname>Li</surname> <given-names>Q.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>Learning community structures: global and local perspectives</article-title>. <source>Neurocomputing</source> <volume>239</volume>, <fpage>249</fpage>&#x02013;<lpage>256</lpage>. <pub-id pub-id-type="doi">10.1016/j.neucom.2017.02.026</pub-id></citation></ref>
<ref id="B16">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>J.</given-names></name> <name><surname>McAuley</surname> <given-names>J.</given-names></name> <name><surname>Leskovec</surname> <given-names>J.</given-names></name></person-group> (<year>2013</year>). <article-title>&#x0201C;Community detection in networks with node attributes,&#x0201D;</article-title> in <source>2013 IEEE 13th International Conference on Data Mining</source> (<publisher-loc>Dallas, TX</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>1151</fpage>&#x02013;<lpage>1156</lpage>.</citation></ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>Y.</given-names></name> <name><surname>Cheng</surname> <given-names>H.</given-names></name> <name><surname>Yu</surname> <given-names>J. X.</given-names></name></person-group> (<year>2009</year>). <article-title>Graph clustering based on structural/attribute similarities</article-title>. <source>Proc. VLDB Endow.</source> <volume>2</volume>, <fpage>718</fpage>&#x02013;<lpage>729</lpage>. <pub-id pub-id-type="doi">10.14778/1687627.1687709</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="fn0001"><p><sup>1</sup>An adjacency matrix of a network is represented by <italic>A</italic>, where <italic>A</italic><sub><italic>uv</italic></sub> &#x0003D; 0 means there is no edge (no interaction) between nodes <italic>u</italic> and <italic>v</italic> and <italic>A</italic><sub><italic>uv</italic></sub> &#x0003D; 1 means there is an edge between the two.</p></fn>
</fn-group>
</back>
</article>