ORIGINAL RESEARCH article

Front. Physiol., 28 November 2022

Sec. Developmental Physiology

Volume 13 - 2022 | https://doi.org/10.3389/fphys.2022.1038048

Investigating pregnant women’s health information needs during pregnancy on internet platforms

  • 1. School of Health Sciences, Guangzhou Xinhua University, Guangzhou, China

  • 2. School of Management, Zhengzhou University, Zhengzhou, China

Article metrics

View details

4

Citations

5,7k

Views

1,7k

Downloads

Abstract

Artificial intelligence gives pregnant women another avenue for receiving healthcare information. With the advancement of information and communication technology, searching online for pregnancy information has become commonplace during COVID-19. This study aimed to explore pregnant women’s information-seeking behavior based on data mining and text analysis in China. Posts on maternal and infant-related websites were collected during 1 June 2020, and 31 January 2021. A total of 5,53,117 valid posts were obtained. Based on the data, we performed correlation analysis, topic analysis, and sentiment analysis. The correlation analysis showed the positive effects of population, population with a college education or above, and GDP on post counts. The topic analysis extracted six, nineteen, eighteen, thirteen, eleven, sixteen, thirteen, sixteen, nineteen, and fourteen topics in different months of pregnancy, reflecting different information needs in various pregnancy periods. The results of sentiment analysis show that a peak of the posts emerged in the second month of pregnancy and the proportion of emotionally positive posts reached its peak in the sixth month of pregnancy. The study provides important insights for understanding pregnant women’s information-seeking behavior.

1 Introduction

Artificial intelligence (AI) creates opportunities for enabling pregnant women to receive healthcare information. Pregnancy is a crucial period in a woman’s life accompanied by physical change, psychological change, and role transformation. Information-seeking can play an important role in addressing the issue of a healthy delivery. Access to advantageous and concerned information contributes to health-related decisions and the life of both pregnant women and unborn children (Kamali et al., 2018). Childbirth-related information is considerable for performing beneficial interventions and suggestions for pregnant women (Kamali et al., 2018). For example, health-related information will enable women to prepare for pregnancy, concentrate on balanced nutrition and medication use during pregnancy, and make decisions on exercise intensity and mode.

Extant research on health information has addressed the crucial role of research contexts, such as the user group and the domain of information subject in determining information needs (Pian et al., 2020; Reifegerste et al., 2020). The development of information technology and the spread of the mobile Internet enable pregnant women to seek information in a more conveniently and fairly way. Centered on the information needs of maternal health, recent studies have shown that pregnant women’s information-seeking behavior is crucial to enriching the knowledge of childbirth and maternal health and improving maternal health outcomes (Kamali et al., 2018; Ahmadian et al., 2020; Jin et al., 2020; Kassim, 2021). For example, Kamali et al. (2018) found that pregnant women need information such as psychological and physical complications after delivery and pregnancy nutrition in the descriptive study. The qualitative study conducted by Kassim (2021) found that the unavailability of health facilities and limited chances of accessing professional health care could lead to the results that pregnant women seek information from non-professional and informal sources. Ahmadian et al. (2020) identified commonly searched topics during pregnancy using the questionnaire. However, researchers have not treated the topics of information-seeking and pregnant women’s emotions in much detail by employing a relatively large amount of data.

The objective of this research is to explore pregnant women’s information-seeking behavior during the whole pregnancy, including the factors that contribute to the information-seeking behavior, the topics that cause pregnant women’s attention at different months of pregnancy, and the change in pregnant women’s emotions at different stages of pregnancy. By collecting and analyzing the posts in the “pregnant section” under “Mama.cn” from 1 June 2020, to 31 January 2021, and 5,53,117 valid posts, the current work provides a comprehensive study.

2 Materials and methods

2.1 Data collection

With the advancement of Internet technology, pregnant women’s behavior of seeking online health information has become a universal trend worldwide because of insufficient information received from healthcare providers and the natural advantage of the Internet to ask questions anonymously (Al-Dahshan et al., 2021). As one of the largest maternal and child health websites in China, “Mama.cn” has integrated websites, APPS, new media, micro-network celebrities, and other media resources, covering hundreds of millions of pan-maternal and infant groups. Dedicated to serving all kinds of needs of pregnant women, the company has built several service sections including information, social networking, tools, and e-commerce, aiming to build a diversified Internet maternal, and infant service platform with pregnant women as the core. “Mama.cn” is widely popular among people who are preparing for pregnancy, during pregnancy, and childrearing. In August 2019, “Mama.cn” had 16.479 million active users. The number of active users of “Mama.cn” reached 19.31 million in June 2020, ranking first in the parenting subdivision list in China. Therefore, “Mama.cn” was selected as the research data source for this study. This study collected the posts in the “pregnant section” under “Mama.cn” from 1 June 2020, to 31 January 2021, involving data from “the first month of pregnancy” to “the tenth month of pregnancy.” The current study extracted the following information from the “pregnant section” under “Mama.cn” posts: username, post time, duration of pregnancy, city, and text. A total of 5,75,970 posts were obtained. Examples of our dataset are presented in Table 1.

TABLE 1

NoCityPost timeDuration of pregnancyText
1Dazhou city31/01/2021Gestation: 3 weeks + 2 daysI just found out I am pregnant, I feel intermittent pain in belly. What's going on?
2Linyi city28/01/2021Gestation: 6 weeks + 1 dayI just went to the toilet and saw a little brown secretion. Not much rubbing, a little worried
3Wuhan city25/01/2021Gestation: 11 weeks + 5 daysMy nuchal translucency test passed at one time. The doctor said that the baby was well behaved and in good upgrowth
4Yinchuan city31/12/2020Gestation: 15 weeks + 3 daysMy New Year’s resolution is to have a healthy baby! No matter if you are a boy or a girl, stay healthy!
5Zhongshan city23/01/2021Gestation: 29 weeks + 5 daysSometimes the fetus moves so much. It feels like she is about to jump out from my belly
6Fuzhou city27/01/2021Gestation: 34 weeks + 4 days34 weeks, I feel pain in public bone, back, and coccyx

The examples of dataset.

We pre-processed the original data before formal analysis by the following procedures. First, the raw information may include missing city tags, irrelevant advertising messages, or posts that did not match the actual time of pregnancy. We filtrated and deleted the above data and finally obtained 5,53,117 texts. Second, the original message may contain distracting information, such as interpunction, emoticons, blank, and hashtags. For excluding data noise and improving data analysis efficiency, we employed regular expressions operations in Python for text filtering.

Measures were performed to ensure data privacy, anonymity, and security. The data collection and analysis did not disclose any privacy issues regarding pregnant women’s identifiable and sensitive information (Favaretto et al., 2020). During data collection, only username, post time, duration of pregnancy, city, and text were extracted. In data processing and analysis, only the duration of pregnancy, city, and text data was used, while the personal information of users was not disclosed. By involving as many samples as possible, more anonymity was preserved as a combination of the variables will be repeated among the samples (Leon-Sanz, 2019).

2.2 Methods

2.2.1 Text topic analysis based on latent Dirichlet allocation model

LDA (Latent Dirichlet Allocation) topic model is a topic probability distribution model based on PLSI (Probabilistic Latent Semantic Indexing) model (Blei et al., 2003). The LDA topic model simulates the process of document generation by using an implied random variable that follows a Dirichlet distribution to represent the document’s topic mixing ratio. Its model structure is more complete and clearer, and the probability inference algorithm is adopted to process the text, which can greatly reduce the dimension of the text representation, to avoid dimension disaster (Blei, 2012). Therefore, LDA is widely used in text mining, text clustering, language processing, and other aspects. The topic number K contained in the document set is a hyperparameter. Given other hyperparameters, the selection process of topic number K is the process of the model searching for the optimal topic number. When the number of topics is too large, there will be many topics without obvious classification semantic information. When the number of topics is too small, broad topics will be generated with a mixture of two or more distributions (Panichella, 2021). Therefore, the determination of the optimal number of topics is an important issue. A coherence score was used to determine the optimal number of topics, with a higher coherence score indicating better quality of topics (Korenčić et al., 2018; Panichella, 2021; Shah et al., 2021). This study used the open-source LDA tool in the Gensim library. The LDA model was evaluated by topic coherence to determine the optimal number of topics. According to the trained LDA model, the topic words under each topic were obtained and the probability of each text belonging to each topic could be directly predicted. Finally, the corresponding topic name was summarized in accordance with the topic words. Figure 1 presents the process of topic extraction in this study.

FIGURE 1

FIGURE 1

The process of topic extraction.

2.2.2 Text sentiment analysis based on SnowNLP

In recent years, there has been an increasing interest in sentiment analysis (Wang et al., 2019). Sentiment analysis, also known as opinion mining, is an application of text mining and computational linguistics to mine subjective texts with emotional colors and identify the emotional tendencies contained in them. It is a process of identifying information from texts and analyzing, processing, induction, and reasoning subjective texts with emotional color. Through sentiment analysis, researchers can determine users’ emotional orientation in the text. Text-based sentiment analysis methods are mainly divided into three types: sentiment dictionary-based, machine learning-based, and deep learning-based (Xu et al., 2019; Li et al., 2020). The machine learning-based analysis method trains the emotion classifier with emotion-labeled data to achieve emotion classification. Classification accuracy relies on high-quality human-annotated training sets, and large-scale high-quality training data requires a lot of labor costs, and the results of human subjective data annotation will also affect the classification effect. The deep learning-based analysis is based on feature self-learning and deep neural network. It has a good classification effect when dealing with high-dimensional, unlabeled big data, but it is difficult to accurately classify the semantically ambiguous and short text content in social networks. The sentiment dictionary-based method is an unsupervised method, which uses a sentiment dictionary to discriminate the sentiment polarity of text containing keywords, to achieve sentiment classification for each text. There is no need for complex data labeling in the research process and the accuracy of emotion recognition can be improved by adjusting and expanding the vocabulary of the sentiment dictionary according to the specific research background.

SnowNLP, a Python library for Chinese natural language processing, is used to analyze the sentiment of texts. The tool is based on a sentiment dictionary to analyze the sentiment orientation of texts. SnowNLP employs a sentiment dictionary to realize the sentiment tendency analysis of the text. The main functions include part-of-speech tagging, sentiment analysis, keyword extraction, and text summarization (He et al., 2020; Zhang et al., 2021).

3 Correlation analysis

The correlation analysis is performed using SPSS24.0. The results of descriptive statistical analysis at the provincial level are presented in Table 2. The data of posts, population, population with a college education or above, illiteracy rate, and GDP of the province are from mainland China. More precisely, population, population with a college education and above, and illiteracy rate are all data from the 2020 census.

TABLE 2

VariablesMinimum valueMaximum valueMean valueStandard deviationNumber (N)
Post counts4109344917819.09717596.22131
Population364810012601251045476733.0330506939.64
Population with college education or above (ten thousand)401978700.6452443.05752
Illiteracy rate (%)0.7821.113.423.72
GDP (100 million yuan)1902.7110760.932658.554826661.80805

The results of descriptive statistical analysis at the provincial level.

Table 3 presents the correlation analysis results at the provincial level. Post counts was found to positively related to population (ß = 0.889, p < 0.001), population with college education or above (ß = 0.835, p < 0.001), and GDP (ß = 0.819, p < 0.001). However, a significant relationship between post counts and illiteracy rate (p > 0.05) was not found in this study. This result is consistent with previous research which indicates that the illiteracy rate had a small and insignificant correlation with computer and Internet penetration rates statistically (Chinn and Fairlie, 2010).

TABLE 3

VariableCorrelation coefficientp-value
Post countsPopulation0.889***<0.001
Population with college education or above0.835***<0.001
Illiteracy rate−0.227>0.05
GDP0.819***<0.001

The results of correlation analysis at the provincial level (N = 31).

4 Topic analysis of information needs

4.1 Emerged topics in different months of pregnancy

4.1.1 Information needs in the first month

As mentioned above, topic analysis was divided based on the stages of pregnancy, corresponding to the period from “the first month of pregnancy” to “the tenth month of pregnancy”. Table 4 presents the topics identified in the first month of pregnancy, relative weight, and LDA keywords. Six topics emerged in the first month of pregnancy in which the first frequent topic. “Test strip,” accounts for 20.53% of all topics. “Pregnancy tests consultation,” “early pregnancy inspection,” and “early pregnancy reaction,” accounting for 16.59%, 14.73%, and 14.02%, respectively, were the second, third, and fourth most frequent topics. Among them, early pregnancy reaction refers to pregnant women’s body response during the early pregnancy period. The next two frequent topics are “appeals and desire” and “question for help,” at 12.54% and 10.96%, respectively.

TABLE 4

Topic nameRate (%)LDA keywords
1Test strip20.53Last menstrual period, ovulation, pregnancy test paper, deepen, detect, intercourse, color, one deep and one shallow, obvious, ovulatory period
2Pregnancy tests consultation16.59Yes or no, pregnancy, take a look, give a hand, pray, really, two lines, duration
3Early pregnancy inspection14.73Hospital, detect, normal, low progesterone, HCG doubled, draw blood, worry, B ultrasound, brown secretion, blood test
4Early pregnancy reaction14.02Eat, feeling, early pregnancy, collywobbles, everyday, night, emesis, symptom, not good, uncomfortable
5Appeals and desire12.54Baby, hope, mother, good pregnancy, healthy, earnestly hope, love, must, finally
6Question for help10.96Pregnancy, have you ever, discern, circumstance, affect, find, why, need, question

Topics in the first month of pregnancy.

4.1.2 Information needs in the second month

Nineteen topics are identified in the second month of pregnancy. The most frequent ten topics in the second month of pregnancy, relative weight, and LDA keywords are presented in Table 5. The results show that “precautions for early pregnancy,” “early pregnancy inspection,” and “symptoms of early pregnancy” emerged to be the top three frequent topics, accounting for 8.59%, 8.35%, and 7.39%, respectively. The next five frequent topics are “the gender of baby,” “fetal heart and embryo bud,” “vomiting during pregnancy,” “early pregnancy indicators,” and “calculation of pregnancy period,” at 7.04%, 6.54%, 6.40%, 6.31%, and 6.30%, respectively. The following two frequent topics are “appeals and desire” and “prenatal diet,” at 5.70% and 5.34%, respectively.

TABLE 5

Topic nameRate (%)LDA keywords
1Precautions for early pregnancy8.59Early stages of pregnancy, purchase, affect, fetus, catch a cold, recommend, create profile, skin care product, attention, nuchal translucency, clothes, prepare
2Early pregnancy inspection8.35Check, B ultrasound, gestational sac, report, show, ectopic pregnancy, in utero, yolk, transvaginal ultrasound, germ, recheck
3Symptoms of early pregnancy7.39Feeling, collywobbles, normal, symptom, 6 weeks, 7 weeks, why, once in a while, lower abdominal pain
4The gender of baby7.04Take a look, give a hand, boy, girl, discern, everyone, make out, whether or not
5Fetal heart and embryo bud6.54Fetal heart, embryo bud, hope, healthy, good pregnancy, bless, happy, antenatal care, all the best, in utero, rest assured
6Vomiting during pregnancy6.40Vomiting during pregnancy, uncomfortable, reaction, nausea, serious, stomach, anesis, loss of appetite, dizziness, retch
7Early pregnancy indicators6.31Low progesterone, HCG doubled, normal, doctor, draw blood, recheck, decline, blood test, relatively low
8Calculation of pregnancy period6.30Month, day, last menstrual period, count pregnancy period, the last time, intercourse, detect, menstrual cycle, the first day, ovulatory period, pattern
9Appeals and desire5.70Baby, mother, hope, cheer, healthy, love, expectation, grow up, safety, happy, birth
10Prenatal diet5.34Eat, hungry, food, drink, folic acid, loss of appetite, like, not allowed, eat nothing, meat

Topics in the second month of pregnancy.

4.1.3 Information needs in the third month

Eighteen topics are extracted in the third month of pregnancy. Table 6 presents the top ten topics in the third month of pregnancy. The results indicate that “nuchal translucency and filling,” “vomiting during pregnancy,” “the gender of baby,” and “symptom of early pregnancy” emerged to be the four most frequent topics, accounting for 13.05%, 12.51%, 8.65%, and 7.94%, respectively. The next four frequent topics are “prenatal diet,” “fetal heart and embryo bud,” “threatened miscarriage,” and “fetus protection,” at 5.80%, 5.72%, 5.12%, and 4.45%, respectively. The following two most frequent topics are “share and exchange” and “household affairs,” at 4.24% and 4.22%, respectively.

TABLE 6

Topic nameRate (%)LDA keywords
1Nuchal translucency and filling13.05Hospital, nuchal translucency, create a profile, need, appointment, prepare, expense, antenatal care, several weeks, empty stomach, draw blood
2Vomiting during pregnancy12.51Vomiting during pregnancy, reaction, serious, food, everyday, stomach, nausea, hungry, loss of appetite, retch, dizziness
3The gender of baby8.65Take a look, girl, boy, everyone, give a hand, curious, make out, discern, checklist
4Symptom of early pregnancy7.94Feeling, collywobbles, normal, bloat, symptom, why, back pain, buttock, lower abdomen, once in a while
5Prenatal diet5.80Eat, drink, prefer, meat, unthink, unable, fruit, spicy, sour, nutrition, appetite, breakfast
6Fetal heart and embryo bud5.72Fetal heart, embryo bud, check, B ultrasound, Last menstrual period, doctor, show, gestational sac, recheck, upgrowth, yolk
7Threatened miscarriage5.12Brown secretion, bleeding, hospital, restroom, find, suddenly, fetus protection, abortion, in hospital
8Fetus protection4.45Doctor, inspection, low progesterone, HCG, progesterone, recheck, take medicine, fetus protection, draw blood, take an injection, suggestion
9Share and exchange4.24Whether or not, expectant mother, expected date of confinement, the same kind, experience, early pregnancy, the same month, exchange, inform, Wechat group
10Household affairs4.22Husband, cry, mother-in-law, work, at home, think, really, not good, marriage, afterwards, mood

Topics in the third month of pregnancy.

4.1.4 Information needs in the fourth month

Thirteen topics are identified in the fourth month of pregnancy. Table 7 presents the top ten topics in the fourth month of pregnancy. “The gender of baby” accounts for 18.98% of all topics. “Down’s syndrome,” “household affairs,” “nuchal translucency,” and “appeals and desire,” accounting for 9.18%, 7.50%, 7.42%, and 7.25%, respectively, were the second, the third, the fourth, and the fifth most frequent topics. The next five frequent topics are “abnormality in antenatal care,” “prenatal diet,” “fetal movement,” “question for help,” and “belly size and weight,” at 6.91%, 6.85%, 6.83%, 6.26%, and 6.25%, respectively.

TABLE 7

Topic nameRate (%)LDA keywords
1The gender of baby18.98Take a look, boy, girl, everyone, give a hand, curious, nuchal translucency, discern, the first pregnancy, the second pregnancy, want, son, daughter
2Down’s syndrome9.18Inspection, hospital, non-invasive prenatal testing, Down’s syndrome, screening, risks, amniocentesis, suggestions, draw blood, four-dimensional
3Household affairs7.50Husband, child, mother-in-law, mood, work, at home, the first child, home, not good
4Nuchal translucency7.42Nuchal translucency, once, doctor, baby, the first time, successfully, cooperate, finally, twice, make out
5Appeals and desire7.25Baby, hope, mother, healthy, smoothly, cheer, happy, antenatal care, successfully, love, anticipate, bless
6Abnormality in antenatal care6.91Doctor, inspect, worry, B ultrasound, placenta, problem, fetus, bleeding, secreta, upgrowth
7Prenatal diet6.85Eat, food, dislike, drink, hungry, not allowed, unthink, meat, have a meal, specially, loss of appetite
8Fetal movement6.83Feel, belly, move, night, fetal movement, sleep, lie, recently, seem, somewhile, always
9Question for help6.26Pregnant, normal, pain, have you ever, discern, fetal heart, circumstance, why, suddenly, cause, question
10Belly size and weight6.25Pregnant, 3 months, big stomach, almost 4 months, gain, weight, many kilograms, obviously pregnant

Topics in the fourth month of pregnancy.

4.1.5 Information needs in the fifth month

Eleven topics are extracted in the fifth month of pregnancy. Table 8 indicates the top ten topics in the fifth month of pregnancy. The top two frequent topics are “the gender of baby” and “Down’s syndrome,” at 12.73% and 11.97%. “Fetal movement,” “prenatal diet,” “pregnant women’s physical discomfort,” and “inspection of a large row of deformities” emerged to be the third, fourth, fifth, and sixth frequent topics, accounting for 9.93%, 9.26%, 9.11%, and 8.68%, respectively. The next four most frequent topics are “ponderal growth,” “experience sharing,” “household affairs,” and “appeals and desire,” accounting for 8.09%, 7.34%, 6.95%, and 6.26%, respectively.

TABLE 8

Topic nameRate (%)LDA keywords
1The gender of baby12.73Take a look, girl, boy, the second pregnancy, give a hand, curious, discern, the first pregnancy, son, daughter
2Down’s syndrome11.97Non-invasive prenatal testing, low risk, Down’s syndrome, smoothly, high risk, DNA, amniocentesis, threshold, suggestion, hope
3Fetal movement9.93Feel, fetal movement, obvious, fetal heart, the first time, seem, normal, once in a while
4Prenatal diet9.26Eat, emesis, pregnancy, food, prefer, everyday, drink, calcium tablet, constipation, meat, hungry, DHA
5Pregnant women’s physical discomfort9.11Night, sleep, legs, buttocks, pain, get up, lie, special, feel ill, uncomfortable, not good, difficulty in sleeping
6Inspection of a large row of deformities8.68Inspection, doctor, hospital, four-dimensional ultrasound, Down’s syndrome, placenta, appointment, antenatal care, Nuchal translucency
7Ponderal growth8.09Pregnant, over 4 months, big belly, weight, gain, kilogram, 5 months, first trimester, without getting fat
8Experience sharing7.34Pregnancy, expected date of confinement, sign in, catch a cold, recommendation, the same kind, exchange, chat, prepare, share
9Household affairs6.95Husband, child, mother-in-law, work, the first child, cry, unthink, in bad mood, look after a baby
10Appeals and desire6.26Baby, mother, hope, cheer, healthy, love, happy, anticipate, bless, birth

Topics in the fifth month of pregnancy.

4.1.6 Information needs in the sixth month

Sixteen topics are identified in the sixth month of pregnancy. Table 9 shows the top ten topics in the sixth month of pregnancy. The top two topics are “the gender of baby” and “four-dimensional ultrasound,” accounting for 18.65% and 17.38% of all topics. The following four topics, “pregnant women’s physical discomfort,” “prenatal diet,” “household affairs,” and “fetal movement,” comprise 7.53%, 6.33%, 6.03%, and 5.73%, respectively. “Appeals and desire,” “ponderal growth,” “glucose tolerance test,” and “sleep during pregnancy” accounted for 5.55%, 5.21%, 4.79%, and 4.04%, respectively.

TABLE 9

Topic nameRate (%)LDA keywords
1The gender of baby18.65Take a look, boy, girl, four-dimensional ultrasound results, everyone, give a hand, curious, guess, son, daughter
2Four-dimensional ultrasound17.38Inspection, fetus, four-dimensional ultrasound, worry, problem, normal, umbilical cord, recheck, relatively small
3Pregnant women’s physical discomfort7.53Fetal heart, restroom, bleeding, secreta, pain, feel ill, catch a cold, constipation, serious, afford no relief
4Prenatal diet6.33Eat, prefer, pregnancy, drink, food, calcium tablet, hungry, meat, DHA, breakfast, nutrition, anemia
5Household affairs6.03Husband, mother-in-law, work, at home, cry, unthink, everyday, marriage, boring, insist, work
6Fetal movement5.73Feel, fetal movement, obvious, sometimes, frequent, severe, kick, immovability, belly
7Appeals and desire5.55Baby, mother, hope, girl, boy, healthy, love, cheer, anticipate, birth, bless, all the best, safety
8Ponderal growth5.21Pregnant, over 5 months, kilogram, weight, big belly, fat, gain, control, 6 months
9Glucose tolerance test4.79Hospital, prepare, appointment, glucose tolerance test, expense, drink sugar water, blood glucose, empty stomach, high, normal
10Sleep during pregnancy4.04Night, difficulty in sleeping, everyday, stay awake, uncomfortable, lie, always, tired, sleeplessness, often

Topics in the sixth month of pregnancy.

4.1.7 Information needs in the seventh month

Thirteen topics are extracted in the seventh month of pregnancy. Table 10 shows the top ten most frequent topics, rates, and LDA keywords. The results present that “the gender of baby” and “pregnant women’s physical discomfort” emerged to be the first and the second most frequent topic, accounting for 21.67% and 13.34% of all topics, respectively. The following four topics are “sleep during pregnancy,” “ponderal growth,” “glucose tolerance test,” and “prenatal diet,” accounting for 7.19%, 6.63%, 6.58%, and 6.49%, respectively. “Items for childbirth,” “household affairs,” “appeals and desire,” and “fetal movement” then comprised 6.36%, 6.09%, 5.82%, and 5.77%, respectively.

TABLE 10

Topic nameRate (%)LDA keywords
1The gender of baby21.67Four-dimensional ultrasound results, give a hand, take a look, guess, boy, girl, the second pregnancy, the first pregnancy, want, curious
2Pregnant women’s physical discomfort13.34Mid-pregnancy, pain in the legs, pain in the buttocks, uncomfortable, tired, anesis, serious, method, constipation, feel ill
3Sleep during pregnancy7.19Night, sleep, everyday, not good, morning, get up, stay awake, restroom, always, sleeplessness, midnight
4Ponderal growth6.63Big belly, pregnant, kilogram, weight, gain, quick, 6 months, small, fat
5Glucose tolerance test6.58Glucose tolerance, drink sugar water, blood glucose, empty stomach, high, normal, accused of sugar, check, doctor, draw blood
6Prenatal diet6.49Eat, food, prefer, pregnancy, hungry, calcium tablet, fruit, emesis, nutrition, breakfast, meat
7Items for childbirth6.36Expected date of confinement, prepare, purchase, need, goods, maternity package, hospital, clothes, recommend, price, share
8Household affairs6.09Husband, mother-in-law, work, at home, everyday, cry, play with mobile phone, look after a baby
9Appeals and desire5.82Baby, mother, hope, love, healthy, cheer, anticipate, happy, birth, successfully
10Fetal movement5.77Feel, belly, fetal movement, special, normal, recently, obvious, more and more frequent

Topics in the seventh month of pregnancy.

4.1.8 Information needs in the eighth month

Sixteen topics are identified in the eighth month of pregnancy. Table 11 presents the top ten most frequent topics. As shown in the results, the top two topics are “the gender of baby” and “emotion sharing”, accounting for 10.36% and 10.31%. The following four topics, “prenatal care,” “sleep during late pregnancy,” “prenatal diet,” and “appeals and desire,” account for 7.94%, 7.15%, 6.93%, and 6.76%, respectively. The next four topics are “ponderal growth,” “pregnant women’s physical discomfort,” “household affairs,” and “preparation for delivery,” at 6.47%, 6.43%, 6.26%, and 5.88%, respectively.

TABLE 11

Topic nameRate (%)LDA keywords
1The gender of baby10.36Take a look, boy, girl, give a hand, the second pregnancy, four-dimensional ultrasound results, curious, daughter, want, the first pregnancy
2Emotion sharing10.31Nervous, anxiety, uncomfortably, smoothly, cheer, emotion, unthink, insist, at home, work, boring
3Prenatal care7.94Inspect, B ultrasound, amniocentesis, too large, too small, position of the fetus, normal, four-dimensional ultrasound, cord around neck, recheck, breech position
4Sleep during late pregnancy7.15Night, sleep, later pregnant trimester, difficulty in sleeping, get up, restroom, stay awake, daytime, wake up in midnight
5Prenatal diet6.93Eat, prefer, drink, hungry, anemia, food, pregnant women, emesis, constipation, meat, nutrition, breakfast, calcium tablet
6Appeals and desire6.76Baby, mother, hope, love, birth, healthy, anticipate, term delivery, father, meet, safety, all the best
7Ponderal growth6.47Expected date of confinement, kilogram, weight, awaiting delivery, control, fat belly, count down, gain
8Pregnant women’s physical discomfort6.43Pain, recently, feel ill, upset stomach, why, lie, later pregnant trimester, tired, sometimes, walk, sit, pain in public bone
9Household affairs6.26Husband, child, mother-in-law, look after the first child, cry, home, marriage, unthink, come back
10Preparation for delivery5.88Buy, hospital, need, clothes, pregnant women, breast pump, goods, recommend, child, prepare, price

Topics in the eighth month of pregnancy.

4.1.9 Information needs in the ninth month

Nineteen topics are extracted from the ninth month of pregnancy. Table 12 presents the top ten topics, rates, and LDA keywords. The results show that “prenatal care,” “the gender of baby,” and “emotion sharing” emerged to be the top three topics, accounting for 10.78%, 7.96%, and 7.13%, respectively. The next four most frequent topics are “items for childbirth,” “sleep during late pregnancy,” “fetal movement,” and “pregnant women’s physical discomfort” which comprised 6.27%, 6.16%, 6.10%, and 6.05%, respectively. “Prenatal diet,” “household affairs,” and “expected date of confinement” emerged to be the last three topics, at 5.72%, 5.40%, and 4.87%.

TABLE 12

Topic nameRate (%)LDA keywords
1Prenatal care10.78Doctor, fetal heart, monitor, prenatal care, B ultrasound, fetus, relatively small, normal, biparietal diameter, cord around neck, amniocentesis
2The gender of baby7.96Boy, girl, give a hand, take a look, name, curious, guess, four-dimensional ultrasound results, shape of belly
3Emotion sharing7.13Cheer, anticipate, sign in, count down, insist, the last month, emotion, finally
4Items for childbirth6.27Prepare, purchase, package for delivery, hospital, need, preparation for delivery, goods, clothes, price, delivery, recommendation
5Sleep during late pregnancy6.16Night, sleep, not good, sleeplessness, later pregnant trimester, restroom, daytime, tantalization, last night, midnight, awake
6Fetal movement6.10Belly, fetal movement, recently, whether or not, uterine constraction, terrible, frequent, sometimes, belly firmness
7Pregnant women’s physical discomfort6.05Pain, feel ill, lie, walk, pubis, tired, later pregnant trimester, sit, turn over, get up, buttocks, back pain
8Prenatal diet5.72Eat, pregnant women, prefer, drink, hungry, food, emesis, not allowed, morning, nutrition
9Household affairs5.40Husband, mother-in-law, at home, work, look after, unthink, marriage, child, cook, accompany
10Expected date of confinement4.87Expected date of confinement, pregnant, in advance, over 8 months, day, count, puerperal period, chat

Topics in the ninth month of pregnancy.

4.1.10 Information needs in the tenth month

Fourteen topics are identified in the tenth month of pregnancy. Table 13 presents the top ten topics in the tenth month. “Appeals and desire” emerged to be the most frequent topics, accounting for 21.17% of all topics. The following five topics, “delivery,” “expected date of confinement,” “pregnant women’s physical discomfort,” “full term,” and “prenatal care,” comprised 13.16%, 7.62%, 6.86%, 6.46%, and 6.19%, respectively. The next four topics are “sleep during late pregnancy,” “nutrition and weight during pregnancy,” “household affairs,” and “good things to recommend,” accounting for 5.96%, 5.64%, 5.28%, and 5.03%, respectively.

TABLE 13

Topic nameRate (%)LDA keywords
1Appeals and desire21.17Earnestly hope, eutocia, meet, no tear, no side out, safety, throes, super quick, uterine contraction, healthy
2Delivery13.16Uterine contraction, hospital, bleed, stomachache, amniorrhea, the opening of the cervix, boy, girl, have sons and daughters
3Expected date of confinement7.62Expected date of confinement, time, reaction, anxious, no action, steady, delay, 2 days
4Pregnant women’s physical discomfort6.86Belly, pain, feel, fetal movement, pubis, walk, lie, become hard, frequent, waist, sometimes
5Full term6.46Full term, get ready, anticipate, cheer, give birth, count down, finally, nervous, time, insist
6Prenatal care6.19Doctor, inspect, in hospital, amniocentesis, B ultrasound, prenatal care, fetal heart, normal, monitor, worry, fetus, umbilical cord
7Sleep during late pregnancy5.96Night, later pregnant trimester, everyday, difficulty in sleeping, feel ill, tired, tantalization, sleeplessness
8Nutrition and weight during pregnancy5.64Eat, pregnancy, nutrition, weight, grow, pregnant, biparietal diameter, drink, striae gravidarum, control, fat
9Household affairs5.28Husband, child, mother-in-law, look after the first child, at home, expense, work, cry, confinement in childbirth
10Good things to recommend5.03Purchase, compare, recommend, need, choose, paper diaper, clothes, body, prefer, pregnant women, share

Topics in the tenth month of pregnancy.

4.2 Summary of topic analysis about information needs

To more vividly show the main topics that pregnant women pay attention to during the whole pregnancy, we conducted a word cloud analysis on the LDA keywords of the topics during pregnancy. The results are presented in Figure 2. In word cloud statistics, word frequency is distributed by font size. As shown in Figure 2, the fonts of words such as “pregnancy,” “child,” and “fetus” are prominent, indicating that the topic of pregnancy is centered on pregnant women and babies. Secondly, the fonts of words such as “hospital,” “normal,” “doctor,” and “healthy” are also clearly displayed, indicating that obstetric examination is an important topic that pregnant women continue to pay attention to during pregnancy, which can help pregnant women to keep abreast of their physical status and fetal upgrowth. Then, words such as “pain,” “belly,” “good,” “eat,” “hungry,” “drink,” and “food” appeared frequently, reflecting pregnant women’s concerns about their physical condition and diet during pregnancy. Words such as “cheer,” “hope,” “happy,” “love,” “boy,” and “girl” reflect pregnant women’s good wishes for their babies and their curiosity about their babies’ gender.

FIGURE 2

FIGURE 2

The results of word cloud analysis.

5 Sentiment analysis

Sentiment analysis is performed to further understand the changes in pregnant women’s information-seeking behavior during pregnancy. As discussed earlier, we use Python to call the third-party library SnowNLP to calculate the sentiment value of each post text, and the range of sentiment value results is [0, 1]. Among them, a sentiment with a value greater than 0.5 is positive, and a sentiment less than or equal to 0.5 is negative. The closer the value is to 1, the more positive the emotion; the closer the value is to 0, the more negative the emotion. Figure 3 presents the posts with a sentiment value greater than 0.5 in each pregnancy month.

FIGURE 3

FIGURE 3

The results of sentiment analysis.

By combining the outcomes of topic analysis and sentiment analysis, the results show that in the first month of pregnancy, the number of posts is relatively small, mainly focusing on topics such as “test strip” and “pregnancy tests consultation,” and the proportion of emotionally positive posts is also relatively low. On the one hand, many pregnant women have not found out that they are pregnant in the first month of pregnancy; on the other hand, the first month of pregnancy is often unstable and at a loss for pregnant women, so their emotions are relatively negative.

The number of posts in the second month of pregnancy is the most, but the proportion of posts with positive emotions is also relatively low. In the second month of pregnancy, most pregnant women have already guessed or confirmed pregnancy, but new pregnant women have little knowledge about pregnancy. Therefore, posts about “precautions for early pregnancy,” “early pregnancy inspection,” “symptoms of early pregnancy” and other related early pregnancy topics surged. However, due to the uncertainty of the baby’s status and the lack of relevant knowledge of pregnant women, the proportion of emotionally positive posts in the second month of pregnancy is relatively low. After the first 2 months of relevant inspections and understanding of pregnancy knowledge, pregnant women have entered a relatively mature stage. At the same time, the status of the baby gradually stabilized, so the number of posts from the second month of pregnancy to the third month dropped significantly, and it continued to be stable until the ninth of pregnancy.

The proportion of emotionally positive posts from the third month to the ninth month of pregnancy is higher than that in other months, and there is an upward trend from the third month to the sixth month of pregnancy. The proportion is the highest in the sixth month of pregnancy, and then gradually decreases. After the third month of pregnancy, the baby’s state gradually stabilizes, the pregnant women’s belly gradually bulges, and the pregnant women can even feel the baby’s fetal movement, but there is generally no obvious physical discomfort, so the pregnant women’s emotions are relatively more positive. Since the seventh month of pregnancy, the baby’s weight increases, the pregnant women’s belly increases, the body gradually becomes clumsy, and the body also has various discomforts such as soreness and difficulty sleeping, so pregnant women show more negative emotions.

The number of posts in the tenth month of pregnancy surged again, second only to the second month of pregnancy, and the proportion of emotionally positive posts also dropped sharply, only higher than in the first of pregnancy. The tenth month of pregnancy is the month when the baby is about to be born. On the one hand, the pregnant women’s body aches and sleep problems are more prominent. On the other hand, pregnant women are faced with the uncertainty of childbirth, and a state of fear and anxiety appears. It can also be seen from the results of the topic analysis that in the current month, “appeals and desire” ranked first among the topics that pregnant women paid attention to, accounting for 21.17%. In addition, “expected date of confinement” and “pregnant women’s physical discomfort” are also the main contents of concern for pregnant women.

6 Discussion and conclusion

6.1 Summary of findings

The purpose of the current study was to investigate pregnant women’s information-seeking behavior. By a combination of descriptive analysis, topic analysis, and sentiment analysis, the current work expands our knowledge by proving important findings. The correlation analysis showed that more pregnant women contribute to more posts. Moreover, pregnant women with a college education or above are more likely to seek information about pregnancy on internet platforms. The more economically developed cities have higher Internet usage. Therefore, pregnant women will be more probable to use Internet platforms to seek information.

Furthermore, the topics from the first month to the tenth month of pregnancy were extracted in topic analysis. The findings show that the topics in different months of pregnancy relate to the present stages of pregnancy. The current paper identified six, nineteen, eighteen, thirteen, eleven, sixteen, thirteen, sixteen, nineteen, and fourteen topics in different months of pregnancy. The specific topics in different stages show the changes in pregnant women’s attention.

In addition, the sentiment analysis showed the variation of pregnant women’s emotions in information-seeking. The results of sentiment analysis show a peak of the posts in the second month of pregnancy. The proportion of emotionally positive posts reached its peak in the sixth month of pregnancy. Pregnant women’s emotional sentiment deeply interacts with the results of topic analysis.

6.2 Practical and theoretical implications

Our study presents theoretical and practical significance. First, this is one of the first studies to understand pregnant women’s information-seeking using the methods of data mining and text analysis. Previous studies on the information needs of maternal health revealed the topics that pregnant women pay attention to; however, the existing work is limited in the descriptive analysis and self-reported questionnaire data (Kamali et al., 2018; Ahmadian et al., 2020; Jin et al., 2020; Kassim, 2021). This study is unique by employing enormous quantities of data and the research data covers a long period. By visualizing the posts of every province, the geographical distribution of pregnant women’s posts was clearly displayed. The current study enriches our understanding of the relationships among pregnant women’s information-seeking, regional economic development level, and educational level.

Second, this study provides comprehensive research, involving abundant analysis. Compared with previous research (Kamali et al., 2018), the current work divides the data from the first month of pregnancy to the tenth month of pregnancy and analyzes the large amounts of data according to the pregnancy period. This study provides important insights for understanding the change of emotions during different pregnant stages and connecting the changes of emotions with the topics that cause pregnant women’s attention. The current work provides the perspectives for future research by the subdivision of data in different pregnant stages.

Third, the findings of this study have several practical implications. The findings indicate that pregnant women pay attention to different topics during various months of pregnancy. The maternal and infant-related websites should provide customized information recommendations for pregnant women according to their stages of pregnancy. For example, information such as precautions and inspection for early pregnancy should be recommended for pregnant women in the second month of pregnancy. Moreover, the proportion of emotionally positive posts reached its peak in the sixth month of pregnancy and is relatively low in the first and the tenth of pregnancy. The relevant government management departments and hospitals should concern about anxiety during early pregnancy and before delivery. The popularization of knowledge about pregnancy and childbirth would be useful for improving pregnant women’s emotions.

6.3 Limitations and future research

The study is subject to several inevitable limitations. First, the data source of this study is “Mama.cn” mainly located in China. What is now needed in the future is a cross-national study involving data for countries at different levels of development. The present study lays the groundwork for future research into pregnant women’s information-seeking behavior around the world. Future studies are encouraged to improve the generalizability of the current work by involving data from different countries and understanding the role of cultural identity in determining pregnant women’s information-seeking. Second, the data such as personal attributes and specific family environments are not included in the paper since such data cannot be obtained from the website. It would be interesting to investigate the effect of family-related variables on pregnant women’s emotional sentiment in future work.

Statements

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

Author contributions

KH provided the conceptualization, data collection, initial analysis, review and editing. TH worked on the results, methodology, and writing. All authors contributed to this study, read and agreed to the submitted version of the manuscript.

Funding

This study was funded by the College Youth Innovation Talent Project of Guangdong Province, China (No. 2022WQNCX099), the Higher Education Research Project sponsored by Guangdong Higher Education Academy (No. 22GQN14), and the Teaching and Research Project of Guangzhou Xinhua University (No. 2022J036).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

  • 1

    AhmadianL.KhajoueiR.KamaliS.MirzaeeM. (2020). Use of the Internet by pregnant women to seek information about pregnancy and childbirth. Inf. Health Soc. Care45 (4), 385395. 10.1080/17538157.2020.1769106

  • 2

    Al-DahshanA.ChehabM.MohamedA.Al-KubaisiN.SelimN. (2021). Pattern of internet use for pregnancy-related information and its predictors among women visiting primary healthcare in Qatar: A cross-sectional study. BMC Pregnancy Childbirth21, 747. 10.1186/s12884-021-04227-0

  • 3

    BleiD. M.NgA. Y.JordanM. I. (2003). Latent Dirichlet allocation. J. Mach. Learn. Res.3, 9931022.

  • 4

    BleiD. M. (2012). Probabilistic topic models. Commun. ACM55 (4), 7784. 10.1145/2133806.2133826

  • 5

    ChinnM. D.FairlieR. W. (2010). ICT use in the developing world: An analysis of differences in computer and internet penetration. Rev. Int. Econ.18 (1), 153167. 10.1111/j.1467-9396.2009.00861.x

  • 6

    FavarettoM.ShawD.De ClercqE.JodaT.ElgerB. S. (2020). Big data and digitalization in dentistry: A systematic review of the ethical issues. Int. J. Environ. Res. Public Health17 (7), 2495. 10.3390/ijerph17072495

  • 7

    HeD.YaoZ.ZhaoF.FengJ. (2020). How do weather factors drive online reviews? The mediating role of online reviewers’ affect. Industrial Manag. Data Syst.120 (11), 21332149. 10.1108/imds-02-2020-0121

  • 8

    JinH.WangH.GongC.LiuL. (2020). A study on the influencing factors of consumer information-seeking behavior in the context of ambient intelligence. J. Ambient. Intell. Humaniz. Comput.11 (4), 13971404. 10.1007/s12652-018-1005-y

  • 9

    KamaliS.AhmadianL.KhajoueiR.BahaadinbeigyK. (2018). Health information needs of pregnant women: Information sources, motives and barriers. Health info. Libr. J.35 (1), 2437. 10.1111/hir.12200

  • 10

    KassimM. (2021). A qualitative study of the maternal health information‐seeking behaviour of women of reproductive age in Mpwapwa district, Tanzania. Health info. Libr. J.38 (3), 182193. 10.1111/hir.12329

  • 11

    KorenčićD.RistovS.ŠnajderJ. (2018). Document-based topic coherence measures for news media text. Expert Syst. Appl.114, 357373. 10.1016/j.eswa.2018.07.063

  • 12

    Leon-SanzP. (2019). Key points for an ethical evaluation of healthcare big data. Processes7 (8), 493. 10.3390/pr7080493

  • 13

    LiD.RzepkaR.PtaszynskiM.ArakiK. (2020). Hemos: A novel deep learning-based fine-grained humor detecting method for sentiment analysis of social media. Inf. Process. Manag.57 (6), 102290. 10.1016/j.ipm.2020.102290

  • 14

    PanichellaA. (2021). A Systematic Comparison of search-Based approaches for LDA hyperparameter tuning. Inf. Softw. Technol.130, 106411. 10.1016/j.infsof.2020.106411

  • 15

    PianW.SongS.ZhangY. (2020). Consumer health information needs: A systematic review of measures. Inf. Process. Manag.57 (2), 102077. 10.1016/j.ipm.2019.102077

  • 16

    ReifegersteD.BlechS.DechantP. (2020). Understanding information-seeking about the health of others: Applying the comprehensive model of information-seeking to proxy online health information-seeking. J. Health Commun.25 (2), 126135. 10.1080/10810730.2020.1716280

  • 17

    ShahA. M.YanX.QayyumA.NaqviR. A.ShahS. J. (2021). Mining topic and sentiment dynamics in physician rating websites during the early wave of the COVID-19 pandemic: Machine learning approach. Int. J. Med. Inf.149, 104434. 10.1016/j.ijmedinf.2021.104434

  • 18

    WangL.NiuJ.YuS. (2019). SentiDiff: Combining textual information and sentiment diffusion patterns for Twitter sentiment analysis. IEEE Trans. Knowl. Data Eng.32 (10), 20262039. 10.1109/tkde.2019.2913641

  • 19

    XuG.MengY.QiuX.YuZ.WuX. (2019). Sentiment analysis of comment texts based on BiLSTM. IEEE Access7, 5152251532. 10.1109/access.2019.2909919

  • 20

    ZhangC.JiangJ.JinH.ChenT. (2021). The impact of COVID-19 on consumers’ psychological behavior based on data mining for online user comments in the catering industry in China. Int. J. Environ. Res. Public Health18 (8), 4178. 10.3390/ijerph18084178

Summary

Keywords

pregnancy, health information, text analysis, topic analysis, sentiment analysis

Citation

Hou K and Hou T (2022) Investigating pregnant women’s health information needs during pregnancy on internet platforms. Front. Physiol. 13:1038048. doi: 10.3389/fphys.2022.1038048

Received

06 September 2022

Accepted

17 November 2022

Published

28 November 2022

Volume

13 - 2022

Edited by

Frank Spradley, University of Mississippi Medical Center, United States

Reviewed by

Paula Tavares, University of Coimbra, Portugal

Alyssa Cheadle, Hope College, United States

Updates

Copyright

*Correspondence: Tingting Hou,

This article was submitted to Developmental Physiology, a section of the journal Frontiers in Physiology

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics