ORIGINAL RESEARCH article

Front. Psychol., 30 June 2020

Sec. Cognitive Science

Volume 11 - 2020 | https://doi.org/10.3389/fpsyg.2020.01125

Are All Remote Associates Tests Equal? An Overview of the Remote Associates Test in Different Languages

  • Cognitive Systems Group, Human-Centered Computing, Freie Universität Berlin, Berlin, Germany

Article metrics

View details

15

Citations

7,3k

Views

1,7k

Downloads

Abstract

The Remote Associates Test (RAT, CRA) is a classic creativity test used to measure creativity as a function of associative ability. The RAT has been administered in various different languages. Nonetheless, because of how embedded in language the test is, only a few items are directly translatable, and most of the time, the RAT is created a new in each language. This process of manual (and in two cases, computational) creation of RAT items is guided by the researchers' understanding of the task. This paper focuses on the question of whether RAT datasets administered in different languages within the literature are comparable. To answer this question, datasets acquired using different RAT stimuli are analyzed qualitatively and quantitatively. Kruskal-Wallis tests are conducted to find out whether there is a significant difference between any of the datasets for a given time frame. Pairwise Mann-Whitney post-hoc tests are then used to find out which pairs are different. Significant differences are observed between 18 dataset pairings regarding Accuracy and between 16 in terms of Response Time. The potential sources of these differences are discussed, together with what this means for creativity psychometrics and computational vs. manual creation of stimuli.

1. Introduction

The Remote Associates Test is a creativity test that is often used in the literature (Mednick and Mednick, 1971; Ansburg and Hill, 2003; Ward et al., 2008; Cai et al., 2009; Cunningham et al., 2009). A RAT problem given to a participant contains three words, for example, Fish, Mine, Rush; the participant has to come up with a fourth word related to all of the three given words. In this case, Gold is an answer, because the compounds Goldfish, Gold Mine, Gold Rush can be built with it. For a human or a machine (Olteţeanu and Falomir, 2015) to solve the RAT, knowledge about the compound words of a language is needed.

Because solving the RAT relies on knowing various expressions and compound words from a language, native speakers have an advantage and are generally the target population when deploying the RAT. This gives rise to a need for different RAT stimulus sets in different languages.

As the RAT relies on knowledge and expressions that are language-specific, the RAT is, in most part, not translatable between languages. Exceptions to this are the rare cases in which all compounds required as knowledge by a RAT item in a specific language also exist in another language—for example, Goldfisch, Goldmine, Goldrausch as the German counterpart of the above-mentioned query.

As only a few items are translatable, RAT sets of items are created anew in each language by researchers. This means that RAT queries are probably impacted by the language itself and quite likely by the preferences and knowledge of compound words of the authors of the stimulus dataset. The Remote Associates Test (RAT) in the native language of the participants is administered in many creativity studies. Results reported in these studies are, therefore, impacted by the quality and difficulty of RAT items in each language. How can this impact be assessed?

No overview exists of human performance in the RAT/CRA in the different languages. Such an overview would help us understand whether significant differences exist between performance on different RAT problem sets in the various languages in which it is employed. If no significant differences exist, this may indicate that results reported for creativity studies that use the RAT in different languages are, indeed, cross-comparable. If a significant difference does exist, however, the comparability of the RAT as a tool across languages may require more nuance and the development of an understanding of the sources of this difference.

This paper sets out to construct an overview of the RAT across eight languages and two types of RAT (compound and functional) and to provide an initial comparative analysis between RAT sets across all of these languages. Section 2 introduces the different language datasets that will be used. The third section compares the RAT datasets quantitatively and qualitatively. In section 4, results are presented regarding the differences between language and gender. The fifth and last section discusses the results and gives a view of possible future work.

2. The Remote Associates Test and languages

Sets of RAT/CRA problems in the following languages were analyzed—please note that some languages have multiple datasets (D):

  • – German (Landmann et al., 2014)

  • – Chinese D1 (Shen et al., 2016)

  • – Chinese D2 (Wu and Chen, 2017)

  • – Italian (Salvi et al., 2016)

  • – Romanian (Olteţeanu et al., 2019b)

  • – Polish (Sobków et al., 2016)

  • – English D1 (Bowden and Jung-Beeman, 2003)

  • – English D2 (Olteţeanu et al., 2017)

  • – English D3 (Olteţeanu et al., 2019a)

  • – Finnish (Toivainen et al., 2019)

  • – Russian (Toivainen et al., 2019)

The Dutch (Chermahini et al., 2012) and both of the Japanese versions (Baba, 1982) and (Orita et al., 2018) had to be excluded because either the author was unreachable or the requested data were not sent to us in time.

3. Remote Associates Test comparison

A qualitative and quantitative comparison of the above-mentioned RAT datasets is provided in the next sections.

3.1. Qualitative Comparison

English datasets D2 and D3 contain different types of items: compound vs. functional. For compound items, the relationship between the three given words and the answer word is a relationship manifested in language—for example, Gold Fish, Gold Mine, and Gold Rush are compounds that all appear in language. By contrast, the relationship between functional query words and the answer reflects a functional relationship between these words, and there may or may not be a compound linguistic relationship. For example, the relationship between Clockwise and Right or Wrong and Right is a functional relationship. Of the above datasets, English D3 is functional.

Independent of the compound/functional classification, RAT problems have also been divided into two types based on the order of the words: homogeneous and heterogeneous items. RAT items are homogeneous if the solution word is either a prefix or a suffix to all three of the words in the problem (like in the query Fish, Mine, Rush, where Gold acts as a prefix to each of the query items). Problems are heterogeneous if the solution word is the prefix for some of the words and the suffix to other words in the problem (e.g., in the query River, Note, Account, the answer Bank is a suffix for the first word and a prefix for the other two).

Of the above datasets, the German, Italian, and English D1 distinguish between heterogeneous and homogeneous queries. ANOVAs with task type as a factor were run by the dataset authors on these sets. The task-type factor showed no significant effect on accuracy (the number of queries solved by the participants). Only in the German version was a significant effect of the task-type factor on reaction times observed.

Because of the linguistic differences between Chinese and English, the Chinese authors came up with a character pairing method rather than compound words. In the authors' example, 生(to generate), 天(the sky), and 溫(warm) paired with the solution creates three actual two-character words. The answer, in this case, would be氣(air), and the resulting two-character words are生氣(anger),天氣(weather), and氣溫(temperature).

The Chinese D2 distinguished not between heterogenous and homogeneous but between heteronym and non-heteronym words. A heteronym is a word that has the same spelling but different pronunciation and meaning, for example, desert (arid region)/desert (leave). They found that the pass rate on heteronymous items was lower for the 20 and 30-s time limit condition but that the response time was not, indicating that heteronymous items were more difficult.

3.1.1. Test Item Creation

In the Italian study, 150 CRA items inspired by Mednick (1962) were initially tested and then reduced to 122 items by filtering out items that were always or never solved. At the beginning, the German study also contained 150 items. Its creation was based on the original of Bowden and Jung-Beeman (2003) and was later filtered down to 130 items because 13 items had multiple solutions and 7 contained unclear words. The approach of the Romanian study was to first translate items from Bowden and Jung-Beeman (2003) and Salvi et al. (2016). If the translation was impossible (most items), the item was adapted or a single translated word out of the item was used as a seed for the creation of a new item. Afterward, the 198 created items were rated by the authors and five student volunteers in terms of how suitable they were, and then the dataset was reduced to the 111 most suitable items. The Polish dataset was created based on the original items of Bowden and Jung-Beeman (2003) and first contained 50 triads. These were then further reduced to 25 with diverse difficulty and one dominating solution. A subsequent test resulted in another reduction to 17 triads because of low factor loadings. The 47 Finnish items were all created by the research team, whereas the 48 Russian ones contained 12 created items and 36 items adopted from Druzhinin (1999). The authors of the Chinese D1 selected, according to Sio and Rudowicz (2007), 192 out of 288 items previously constructed by Jen et al. (2004), with the criterion that no solutions were repeated or used as problem words. After another reduction based on relative difficulty, the dataset consisted of 128 items. The Chinese D2 authors designed 120 items based on Mednick and Mednick (1971), Bowden and Jung-Beeman (2003), and Jen et al. (2004), of which they finally used the 90 that had a pass rate above 0%.

Of the dataset items above, most are manually created. Exceptions to this are items from the English D2 and English D3 datasets. For English D2, (Olteţeanu et al., 2017) successfully attempted the computational creation of RAT items and compared the results with an existing (English D1) normative dataset. For English D3, (Olteţeanu et al., 2019a) applied a computational approach using a new type of language knowledge for the creation of functional items, thus resurrecting an older idea of Worthen and Clark (1971) regarding the existence of such items and their differences from compound items. These items are compared to compound items of a subset of English D1 in the paper. This subset—specifically 24 items from English D1—is marked as Bowden, J.-B. in Figures 1, 2.

Figure 1

Figure 2

3.2. Quantitative Comparison

In the following, a descriptive statistics overview of the different datasets is provided.

3.2.1. Descriptive Data

The various RAT datasets contained varying numbers of items, between 17 (Polish) and 144 (English D1). An exception is comRAT-G, which computationally creates 13.4 m items and the frequency-based probabilities of solving them. Furthermore, the various items were deployed either (a) by giving participants different time frames to solve each query, between 2 and 60 s or (b) without setting a time limit. Since 2, 5, 7, 20, and 60-s time frames were only used once across these datasets, only items with a 15 or 30-s time frame or no time frame are analyzed in this paper. Assuming that different solving strategies may be deployed for different time frames, we did not want to average across time frames. The stimuli were deployed on populations of various sizes, with n ranging between 26 participants in the English D3 S1 and 317 in the Italian dataset.

As shown in Table 1, Figures 1, 2, the easiest sets to solve were the Chinese D1, with 0.58 accuracy, and the Italian, with a response time of only 6.52 s. The hardest sets seem to be the Chinese D2, with an average accuracy of 0.26 within a 20-s time frame, and the Finnish dataset in terms of response times, with a mean of 37.34 s. The response times of the Russian RAT were also noticeably higher that for the rest (23.53 s). Please note that means and standard deviations were calculated for this paper from the given data where they were not provided by the initial dataset authors.

Table 1

TimeAccuracyRT [s]Cron-
FrameSum%Per itemBach'sα
Languagein s|x|nsssAcc.RT
German both601308054.9934.97442716.977.12
heterogeneous60568026.1015.79472818.506.70
homogeneous60748030.1919.17412615.807.50
German both30130803927
German both15130803027
Chinese D13012812374.4658259.743.130.92
Chinese D2 both30907125.262815.314.14
Non-heteronymous30607118.072415.49
Heteronymous3030717.193015.21
Chinese D2 both20909323.45269.772.17
Non-heteronymous20609316.762210.01
Heteronymous2030936.69289.65
Italian both1512231747.5828.0639236.521.46
Heterogeneous156631725.4814.723922
Homogeneous155631722.1213.444024
RomanianNone1116359.9447.73544315.3710.530.930.97
Polish30172066.903.90412314.023.060.79
English D1 both3014428972.72512510.453.47
Heterogeneous305928929.7450
Homogeneous308528942.9351
English D1 both1514428931227.261.65
English D2 bothNone10011352.6416.1653160.940.99
comRAT-GNone5011326.207.03521414.529.890.850.99
Bowden, J.-B.None5011326.4111.24532316.5612.840.930.99
English D3 S1 fRATNone752635.277.99471113.918.42
comRAT-GNone502625.027.26501512.386.23
English D3 S2 fRATNone486117.105.77361214.1413.390.790.90
Compound bothNone486115.857.60331611.6810.960.870.96
comRAT-GNone24617.253.72301611.0010.620.750.93
Bowden, J.-B.None24618.615.06362111.640.650.850.92
FinnishNone476721.605.30461137.3417.360.73
RussianNone486726.606.90551423.5310.380.83

Number of elements (|x|), sample size (n), mean (), and standard deviation (s) of accuracy and response time and Cronbach's α for the RAT in different languages.

S1 and S2 reflect different studies of the same article.

The age, level of education, and gender of the participants taking the different RATs also varied, as shown in Tables A1A5, and Figure 3. For example, 70% of the participants of the Russian RAT were between 20 and 29 years old, whereas over 50% of the English D3S2 were between 30 and 39 years old. The Romanian RAT had the most equal gender ratio, at nearly 50/50, while the Finnish had the worst, with 90% females. Table 1 gives an overview of all of the datasets and various descriptive metrics across all languages.

Figure 3

3.2.2. Cronbach's Alpha

Cronbach's alpha is the most commonly used method for estimating the reliability of a test, as reflected by its internal consistency between items. Scores below 0.5 indicate an unacceptable internal consistency, whereas higher scores indicate a better one. Generally, scores above 0.7 are considered to reflect an acceptable amount of reliability, and an α above 0.9 is excellent. The Cronbach's α scores were calculated by authors for some of the initial papers (see Table 1) and vary between 0.73 and 0.99.

4. Results

4.1. Language

In order to find out whether differences between results for different languages exist at all, Kruskal-Wallis Tests were conducted for different timesteps and on two existing performance metrics: Accuracy and Response Time. To further investigate which of the language pairings were different, we used pairwise Mann-Whitney tests with Bonferroni-Holm correction as post-hoc tests. Heterogeneous and homogeneous items were tested both separately and combined (where possible).

4.1.1. Accuracy

We found a significant effect of group on value for the 30-s time frame [p < 0.0001], the 15-s time frame [p < 0.001], and for no time frame [p < 0.01]. Post-hoc tests showed significant differences of means regarding the Accuracy metric for 18 different dataset pairings in different time frames (Table 2). For example, a significant difference exists between Italian vs. German in a 15-s time frame (p = 0.00062, α = 0.01667).

Table 2

TimeDataset pairHolm's method
FramepRankαSig
15 sItalianGerman0.000630.0167Yes
ItalianEnglish D10.001320.025Yes
GermanEnglish D10.469410.05No
30 sChinese D1Chinese D2<.0001210.0024Yes
Chinese D1Chinese D2 n.h.<.0001200.0025Yes
Chinese D1Chinese D2 het.<.0001190.0026Yes
Chinese D2English D1<.0001180.0028Yes
Chinese D1German<.0001170.0029Yes
Chinese D2 het.English D1<.0001160.0031Yes
Chinese D2 n.h.English D1<.0001150.0033Yes
Chinese D2 het.Polish0.0003140.0036Yes
Chinese D1English D10.0006130.0038Yes
Chinese D2Polish0.0006120.0042Yes
Chinese D1Polish0.0035110.0045Yes
Chinese D2 n.h.Polish0.0037100.005Yes
English D1German0.003790.0056Yes
Chinese D2German0.014180.0063No
Chinese D2 het.German0.018370.0071No
Chinese D2 n.h.German0.088960.0083No
English D1Polish0.249850.01No
Chinese D2 het.Chinese D2 n.h.0.320640.0125No
GermanPolish0.404830.0167No
Chinese D2Chinese D2 het.0.482220.025No
Chinese D2Chinese D2 n.h.0.655910.05No
NoneEnglish D3 S2 fRATRomanian<0.0001100.005Yes
English D3 S2 fRATRussian0.000690.0056Yes
English D2English D3 S2 fRAT0.005480.0063Yes
English D3 S2 fRATFinnish0.030170.0071No
FinnishRussian0.069860.0083No
FinnishRomanian0.088450.01No
English D2Finnish0.351540.0125No
English D2Russian0.758330.0167No
RomanianRussian0.903320.025No
English D2Romanian0.949210.05No

Results of Mann-Whitney testing with Bonferroni-Holm correction regarding the Accuracy.

4.1.2. Response Time

We found a significant effect of group on value for the 30-s time frame [p < 0.0001] and for no time frame [p < 0.0001]. Post-hoc tests using Mann-Whitney tests with Bonferroni-Holm correction showed significant differences of means regarding the Response Time metric for 16 different dataset pairings in different time frames (Table 3). For example, a significant difference was noted between English D2 vs. English D3 S2 fRAT with no time frame (p = 0.0095, α = 0.0125).

Table 3

TimeDataset pairHolm's method
FramepRankαSig
15 sEnglish D1Italian<0.000110.05Yes
30 sChinese D2Chinese D1<0.0001150.0033Yes
Chinese D2English D1<0.0001140.0036Yes
Chinese D2 n.h.Chinese D1<0.0001130.0038Yes
Chinese D2 n.h.English D1<0.0001120.0042Yes
Chinese D2 het.Chinese D1<0.0001110.0045Yes
Chinese D2 het.English D1<0.0001100.005Yes
Chinese D1Polish<0.000190.0056Yes
English D1Polish<0.000180.0063Yes
Chinese D1English D10.120170.0071No
Chinese D2 n.h.Polish0.238460.0083No
Chinese D2Polish0.283850.01No
Chinese D2 het.Polish0.517640.0125No
Chinese D2 het.Chinese D2 n.h.0.901830.0167No
Chinese D2Chinese D2 het.0.930420.025No
Chinese D2Chinese D2 n.h.0.955810.05No
NoneFinnishRomanian<0.0001100.005Yes
FinnishEnglish D3 S2 fRAT<0.000190.0056Yes
FinnishEnglish D2<0.000180.0063Yes
RussianEnglish D3 S2 fRAT<0.000170.0071Yes
RussianRomanian<0.000160.0083Yes
FinnishRussian<0.000150.01Yes
English D2English D3 S2 fRAT0.009540.0125Yes
English D3 S2 fRATRomanian0.070130.0167No
English D2Romanian0.074920.025No
English D2Russian0.137010.05No

Results of Mann-Whitney testing with Bonferroni-Holm correction regarding the RT.

4.2. Gender

In order to measure differences between genders, Welch's unequal variances t-test was conducted to measure the difference between means on the two existing performance metrics: Accuracy and Response Time. Moreover, Cohen's d was calculated to measure the effect size.

4.2.1. Differences Between Genders

Significant differences of means for Accuracy with medium effect sizes were observed between genders in:

  • Romanian; t(59.47) = 2.29*, male M = 55.25, female M = 64.61

  • English D3; t(24.17) = 2.21*, male M = 29.78, female M = 37.88

as shown in Table A5, but no differences were observed regarding the Response Time. The authors of the Chinese D2 and an older Italian version (Salvi et al., 2015) also stated that gender was not a factor in their experiments.

5. Discussion and Further Work

This paper set out to compare the RAT in different languages and across different datasets. Significant differences were observed between multiple languages and datasets on both the Accuracy and Response Time performance metrics.

The significant difference observed between the English D2 and English D3 sets may have as a source the difference between types of items (compound vs. functional).

In the cases in which a significant difference exists between different language datasets, the main potential causes are:

  • different population samples are more creative (or at least better at the associative factor in creativity),

  • the RAT is more difficult in some languages because of the language itself and the cognitive factors resulting from encoding linguistic knowledge and solving the RAT in that language, and/or

  • sets of RAT queries vary in difficulty because they are created without using standardized methods and thus depend on the inspiration and knowledge base of the researchers creating them, or

  • the lack of a common time frame.

Other causes could be, as pointed out by our reviewers, differences in the instructions/explanation of the task, in participants' motivations, in the study setting (e.g., fMRI scanner/EEG, etc.), in other tasks performed during the same session, and in whether solution feedback was given, and, also, the associations between the items themselves might affect the difficulty (Luft et al., 2018).

This initial investigation shows that differences between results obtained with the RAT in different languages need to be addressed in more detail. Before cross-comparison of creativity results can be performed, the source of these differences needs to be found. Experimental or analytical setups need to be designed in order to establish which one of the above-mentioned causes, or what combination thereof, is the source of the differences.

An initial thought on establishing comparability could be to attempt to find items that are translatable across the various languages. By keeping stimulus items constant, differences in creativity pertaining to the population or use of language could be established.

However, even if translatable, the same RAT items may not be of the same difficulty in different languages. Some light is shed on this by computational models like comRAT-C (Olteţeanu and Falomir, 2015), essentially models of memory search, which can solve the RAT by organizing their knowledge in a semantic net-like structure, propagating activation through word associations and convergence. comRAT-C's probability of solving a query correlates with human performance. Such models indicate that, even if different RAT queries can be translated in different languages, equivalence does not necessarily exist between them: the number of word associates and the strength of association may not be the same in different languages. Different tools may thus need to be used to try to establish query equivalence.

A potential solution may be to establish a stronger item equivalence in computational terms: for example by using computational RAT query generators like comRAT-G (Olteţeanu et al., 2017) to create sets of items where a high degree of control can be maintained over the number of associates and the association strength of the query words. Such approaches have already proven fruitful in the deployment of more precise empirical designs (Olteţeanu and Schultheis, 2017) and in the creation of other types of items (Olteţeanu et al., 2019a). We have not yet attempted to generate comparable RAT stimulus sets in different languages. To apply the computational approach above for RAT generation in multiple languages would require initial sets of word associations or n-grams for each of the respective languages together with data on how often the n-grams occur within a specific dataset or the frequency of eliciting a particular associate (if a certain number of participants is asked to produce associates).

Another direction of future work would be to establish a creative association measure that transcends the constraints of language such as a visual Remote Associates Test—some work in this direction has already been done by Olteţeanu et al. (2015) and Toivainen et al. (2019). As one of our reviewers very interestingly points out, visual information, though not as varied as language, nonetheless varies in different cultures. The visual RATs would thus not be completely immune to differences, for example, when apple trees are more common in some parts of the world and mango trees in others (our reviewer's example) or when certain objects are more likely to exist, be used, or be central in various cultures. However, these difference may be smaller than linguistic differences for specific sets of objects, and the visual RAT may thus provide a measure with stronger comparability across languages.

This paper gives an overview of RAT datasets in multiple languages and shows that cross-linguistic comparability should not be taken for granted in the case of this broadly used creativity test.

Statements

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.

Author contributions

A-MO contributed the conception and design of the study and wrote the first draft of the manuscript. JB performed the statistical analysis. JB and A-MO wrote sections of the manuscript. All authors contributed to manuscript revision and read and approved the submitted version.

Funding

The support of the Deutsche Forschungsgemeinschaft (DFG) for the project CreaCogs via grant OL 518/1-1 and the Open Access Funding provided by the Freie Universität Berlin are gratefully acknowledged.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Appendix

Table A1

ERCHID2ITAPOLROM
Mage23.4122.6725.3025.10
SDage2.933.248.307.60
rangeage20–3118–3416–6518–5518–70

Mean, standard deviation, and range of participant age.

Table A2

ROMED2ED3S1ED3S2FINRUS
<204.81.80.02.00.04.5
20–2942.929.211.523.022.470.1
30–3930.131.953.821.023.922.4
40–4914.315.011.520.028.31.5
50–596.317.723.125.020.91.5
59 <1.64.40.010.04.50.0

Percentage of participants in certain age ranges.

Table A3

ROMED2ED3S1ED3S2
Secondary school0.06.211.55.0
High school diploma14.323.926.925.0
Enrolled in undergraduate courses17.517.715.47.0
Completed undergraduate courses50.830.123.152.0
Enrolled in postgraduate courses4.85.311.52.0
Completed postgraduate courses12.716.811.510.0

Percentage of participants with a certain level of education.

Table A4

GERCHID2ITAPOLROMED2ED3S1ED3S2FINRUS
Female68.856.284.968.048.563.784.672.189.674.6
Male31.243.815.132.051.536.315.427.910.425.4

Percentage of participants with certain gender.

Table A5

ROM femaleENG D3 S2 fRAT female
tdfpdtdfpd
ROM male2.2959.470.030.58
ENG D3 S2 fRAT male2.2124.170.040.70

Welch test results for accuracy without a time frame between genders.

References

  • 1

    AnsburgP. I.HillK. (2003). Creative and analytic thinkers differ in their use of attentional resources. Pers. Individ. Diff.34, 11411152. 10.1016/S0191-8869(02)00104-6

  • 2

    BabaY. (1982). JARAT FORM A-remote associates test. Jpn. J. Psychol.52, 330336. 10.4992/jjpsy.52.330

  • 3

    BowdenE. M.Jung-BeemanM. (2003). Normative data for 144 compound remote associate problems. Behav. Res. Methods35, 634639. 10.3758/BF03195543

  • 4

    CaiD. J.MednickS. A.HarrisonE. M.KanadyJ. C.MednickS. C. (2009). Rem, not incubation, improves creativity by priming associative networks. Proc. Natl. Acad. Sci. U.S.A.106, 1013010134. 10.1073/pnas.0900271106

  • 5

    ChermahiniS. A.HickendorffM.HommelB. (2012). Development and validity of a Dutch version of the Remote Associates Task: an item-response theory approach. Think. Skills Creat.7, 177186. 10.1016/j.tsc.2012.02.003

  • 6

    CunninghamJ. B.MacGregorJ.GibbJ.HaarJ. (2009). Categories of insight and their correlates: an exploration of relationships among classic-type insight problems, rebus puzzles, remote associates and esoteric analogies. J. Creat. Behav.43, 262280. 10.1002/j.2162-6057.2009.tb01318.x

  • 7

    DruzhininV. N. (1999). Psychology of General Abilities. St. Petersburg: Publishing House Peter.

  • 8

    JenC.-H.ChenH.-C.LienCho (2004). The development of the Chinese remote association test. Res. Appl. Psychol.21, 195217.

  • 9

    LandmannN.KuhnM.PiosczykH.FeigeB.RiemannD.NissenC. (2014). Entwicklung von 130 deutsch sprachigen Compound Remote Associate (CRA)-Wortraetseln zur Untersuchung kreativer Prozesse im deutschen Sprachraum. Psychol. Rundschau65, 200211. 10.1026/0033-3042/a000223

  • 10

    LuftC. D. B.ZiogaI.ThompsonN. M.BanissyM. J.BhattacharyaJ. (2018). Right temporal alpha oscillations as a neural mechanism for inhibiting obvious associations. Proc. Natl. Acad. Sci. U.S.A.115, E12144E12152. 10.1073/pnas.1811465115

  • 11

    MednickS. (1962). The associative basis of the creative process. Psychol. Rev.69, 220232. 10.1037/h0048850

  • 12

    MednickS. A.MednickM. (1971). Remote associates test: Examiner's manual. Houghton Mifflin.

  • 13

    OlteţeanuA.-M.FalomirZ. (2015). comrat-c : a computational compound remote associate test solver based on language data and its comparison to human performance. Pattern Recogn. Lett.67, 8190.

  • 14

    OlteţeanuA.-M.GautamB.FalomirZ. (2015). Towards a visual remote associates test and its computational solver, in Proceedings of the Third International Workshop on Artificial Intelligence and Cognition 2015, Vol. 1510 (Turin: CEUR-Ws), 1928.

  • 15

    OlteţeanuA.-M.SchoettnerM.SchuberthS. (2019a). Computationally resurrecting the functional Remote Associates Test using cognitive word associates and principles from a computational solver. Knowl. Based Syst. 168, 19. 10.1016/j.knosys.2018.12.023

  • 16

    OlteţeanuA.-M.SchultheisH. (2017). What determines creative association? revealing two factors which separately influence the creative process when solving the remote associates test. J. Creat. Behav. 53, 389395. 10.1002/jocb.177

  • 17

    OlteţeanuA.-M.SchultheisH.DyerJ. B. (2017). Computationally constructing a repository of compound Remote Associates Test items in American English with comRAT-G. Behav. Res. Methods50, 19711980. 10.3758/s13428-017-0965-8

  • 18

    OlteţeanuA.-M.TaranuM.IonescuT. (2019b). Normative data for 111 compound Remote Associates Test problems in Romanian. Front. Psychol. 10:1859. 10.3389/fpsyg.2019.01859

  • 19

    OritaR.HattoriM.NishidaY. (2018). Development of a Japanese remote associates task as insight problems. Jpn. J. Psychol.89, 376386. 10.4992/jjpsy.89.17201

  • 20

    SalviC.BricoloE.FranconeriS. L.KouniosJ.BeemanM. (2015). Sudden insight is associated with shutting out visual inputs. Psychon. Bull. Rev.22, 18141819. 10.3758/s13423-015-0845-0

  • 21

    SalviC.CostantiniG.BricoloE.PeruginiM.BeemanM. (2016). Validation of Italian rebus puzzles and compound remote associate problems. Behav. Res. Methods48, 664685. 10.3758/s13428-015-0597-9

  • 22

    ShenW.YuanY.LiuC.YiB.DouK. (2016). The development and validity of a Chinese version of the compound remote associates test. Am. J. Psychol.129, 245258. 10.5406/amerjpsyc.129.3.0245

  • 23

    SioU. N.RudowiczE. (2007). The role of an incubation period in creative problem solving. Creat. Res. J.19, 307318. 10.1080/10400410701397453

  • 24

    SobkówA.PołećA.NosalC. (2016). Rat-pl- construction and validation of polish version of remote associates test. Stud. Psychol.54, 113. 10.2478/V1067-010-0152-2

  • 25

    ToivainenT.OlteţeanuA.-M.RepykovaV.LihanovM.KovasY. (2019). Visual and linguistic stimuli in the Remote Associates Test: a cross-cultural investigation. Front. Psychol.10:926. 10.3389/fpsyg.2019.00926

  • 26

    WardJ.Thompson-LakeD.ElyR.KaminskiF. (2008). Synaesthesia, creativity and art: what is the link?Br. J. Psychol.99, 127141. 10.1348/000712607X204164

  • 27

    WorthenB. R.ClarkP. M. (1971). Toward an improved measure of remote associational ability. J. Educ. Meas.8, 113123. 10.1111/j.1745-3984.1971.tb00914.x

  • 28

    WuC.-L.ChenH.-C. (2017). Normative data for Chinese compound remote associate problems. Behav. Res. Methods49, 21632172. 10.3758/s13428-016-0849-3

Summary

Keywords

remote associates test, RAT, CRA, creativity, creativity evaluation and metrics, creativity test

Citation

Behrens JP and Olteţeanu A-M (2020) Are All Remote Associates Tests Equal? An Overview of the Remote Associates Test in Different Languages. Front. Psychol. 11:1125. doi: 10.3389/fpsyg.2020.01125

Received

02 June 2019

Accepted

04 May 2020

Published

30 June 2020

Volume

11 - 2020

Edited by

Eddy J. Davelaar, Birkbeck, University of London, United Kingdom

Reviewed by

Edward Bowden, University of Wisconsin-Parkside, United States; Caroline Di Bernardi Luft, Queen Mary University of London, United Kingdom

Updates

Copyright

*Correspondence: Jan Philipp Behrens Ana-Maria Olteţeanu

This article was submitted to Cognitive Science, a section of the journal Frontiers in Psychology

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics