ORIGINAL RESEARCH article

Front. Res. Metr. Anal., 11 April 2023

Sec. Patent Analytics

Volume 8 - 2023 | https://doi.org/10.3389/frma.2023.978249

Should an R&D manager refer to distant technical fields? The effectiveness of new combinations with knowledge from different technical fields through the quantitative analysis of patent data related to NetZero

  • Ph.D. Program, Graduate School of Business Administration, Hitotsubashi University, Tokyo, Japan

Abstract

This study showcases a technique to categorize NetZero-related patent applications into three technical fields according to the degree of proximity between claimed inventions and cited inventions by comparing technological classifications between the patent applications and cited applications thereof. In this technique, the author first describes the existing methods used in previous studies. The technique proposed in this article is different from those of previous studies in that it is characterized by comparing the technical fields of not only the primary classification but also the subsequent classifications. This is made possible by using two patent classifications without having a specific classification corresponding to the middle hierarchy in between, rather than using three patent classifications with different hierarchies. This technique reduces the possibility that two applications, even if they are the same in their subsequent classification, will be judged as applications in different technical fields because they are in different classes in the primary classification. Using the proposed technique, the author examined the impact on the subsequent patent application of NetZero-related patent applications filed in Japan. As a result of the analysis, the author found that approximately 33% of subject applications, whose technical field differs from the backward citations when comparing the primary classification only, match one of the subsequent classifications when comparing them in consideration of the subsequent classifications as well. The author then found that these 33% of subject applications had a greater impact on subsequent patent applications than the remaining applications.

Introduction

This study showcases a technique to categorize NetZero-related patent applications into three technical fields according to the degree of proximity between claimed inventions and cited inventions by comparing technological classifications between the patent applications and cited applications thereof.

NetZero refers to a state in which the greenhouse gases including carbon dioxide going into the atmosphere are balanced by removal from the atmosphere. Achievement of NetZero is necessary to prevent further global warming because global warming is considered to be interlinked with climate change. In the 26th session of the Conference of the Parties to the United Nations Framework Convention on Climate Change (COP 26) held in November 2021, the participating countries proposed an action to reduce emissions.

The efforts for NetZero in Japan have already started 30 years ago. Fujii and Managi (2016) reported that more than 10,000 patent applications related to environmental protection technology have been filed every year since the 1990s with the increasing demands of the market and society for environmental protection and mitigation of climate change (Fujii and Managi, 2016, p. 4). Furthermore, the adoption of the Kyoto Protocol in 1997 spurred this trend of patent applications, and this trend seemed to continue. The OECD (2015) indicates that Japan has led the world in high-value invention in environmental technology (Fujii and Shirakawa, 2015, p. 3; Haščiči and Migotto, 2015, p. 27–35).

However, after 2009, when the global financial crisis arising from the bankruptcy of Lehman Brothers began to affect Japan, the number of Japanese patent applications gradually decreased until around 2015. Figure 1 is a graph showing changes in the number of Japanese patent applications in which “carbon neutral” and its related terms1 are described in the specification of patent applications filed over the 25 years from 1995 to 2020. As can be seen from this graph, the number of applications has declined since peaking around 2009 and has been almost flat since around 20152. COP21 was held in 2021 against a background of international interest in environmental issues rising again. The Japanese government took this opportunity to declare in October 2020 that Japan would realize carbon neutrality by 2050, and, in April 2021, the government set a very ambitious target of reducing greenhouse gas emissions by 46% by 2030. Coping with the issue of global warming has become one of the most important tasks for companies as well. In Japan, the public and private sectors are currently working together to develop new technologies for NetZero. Achieving the goals set by the Paris Agreement will require innovation that brings more change.

Figure 1

To address this challenge, the author focuses on two long-standing concepts in innovation. One is that recombination is one of the processes that leads to new innovations (Shumpeter, 1934; Nelson and Winter, 1982; Hargadon and Sutton, 1997). Schumpeter identified five cases of new combinations.3 The other is that important innovation involves the transfer of knowledge from one technical field to another. Fleming (2001) suggested that new combinations of distant knowledge may give rise to a path-breaking innovation. Miller et al. (2007) stated that obtaining a remote source of knowledge is important to revitalize existing knowledge and develop new capabilities. There are many studies suggesting that the use of distant knowledge leads to the development of breakthrough inventions (e.g., Cohen and Levinthal, 1990; Katila and Ahuja, 2002).

By combining technological components from more distant technological domains, the degree of novelty goes up (Keijl et al., 2016). However, with a further increase in novelty through the inclusion of components from more distant technological domains, the familiarity of the invention will commonly decrease as compared with combining within the same technical domain and building on existing knowledge (Shane, 2000). Fu et al. (2013), who examined the effect of the distance of such analogical design stimuli as “near” and “far” on design solution generation, showed that if the stimuli are too distant, they can become harmful to the design process.

The use of “distant knowledge” has also been discussed in the context of corporate search in comparison to local knowledge (e.g., March, 1991; Rosenkopf and Almeida, 2003; Kim et al., 2013). Local search is defined as knowledge search in closely related, familiar, and similar technical areas, whereas distant search is defined as a search for knowledge across diverse technical areas beyond the familiar technical area (March and Simon, 1958; Nelson and Winter, 1982; Stuart and Podolny, 1996; Rosenkopf and Nerkar, 2001; Kim et al., 2013). Distant search, in contrast to local knowledge, makes it possible to acquire new and external knowledge from distant technical fields (Tushman, 1977).

On the other hand, it has also been pointed out that the accumulated capabilities of firms can limit the scope of searches and their ability to understand and apply new knowledge (Cohen and Levinthal, 1990; Miller et al., 2007). It has been also pointed out that barriers to the search and transfer of knowledge across organizational boundaries arise from the accumulated competencies of firms and the cognitive constraints of individuals (Walsh, 1995; Miller et al., 2007) and shared routines developed from the unique histories of organizations (Nelson and Winter, 1982; Leonard-Barton, 1992; Garud and Rappa, 1994, Tripsas and Gavetti, 2000).4

Furthermore, recently, in addition to the well-known concepts of incremental and radical innovations, there is an idea to focus on adjacent innovations between them. Nagji and Tuff (2012, p. 4) who proposed this idea mentioned that “adjacent innovation” involves leveraging something established firms do well into a new area. The “adjacent innovation” is considered to be in line with a series of movements that have been attracting attention in Japan in recent years, in which Companies cross industry boundaries and enter various fields. They try to cultivate new fields by taking advantage of their own strengths in different fields.5

To make such an innovation that addresses the issue of global warming and promotes NetZero, should R&D managers refer to a more distant technical field? (Research Question).

In tackling this research question, the author believed that it would be useful to consider the concept of “adjacent innovation” as well. In other words, the author thought it would be beneficial not only to divide a series of patent applications into two technical fields, namely, the “same” technical field and the “distant” technical field but also to consider and analyze the technical fields located in the “neighboring fields”.

With this in mind, the author looked at previous literature showing previous methods of categorizing patent applications into three technical fields. However, some common issues were found in the existing methods: since they focus on the primary patent classification when comparing the technical fields of backward citations, the problem remains that even if the primary patent classifications are different from each other, it is not possible to distinguish the possibility of matching one of the subsequent classifications. Therefore, to solve the problem raised earlier in this article, the author proposed a method of categorizing a set of patent applications into three technical fields according to the degree of proximity between claimed inventions and cited inventions (backward citations).

This article proceeds as follows. After reviewing the previous studies that categorize patent applications into three technical fields using the classification of cited applications (backward citations), the author addresses issues common to the previous methods in Section Previous literature. Next, after deriving a hypothesis in Section Hypothesis, the author showcases how the proposed technique works in Section Proposed measurement method, and describes the data and methods for analysis in Section Data and method for analysis. After testing the hypothesis in Section Results, the author discusses the analysis results in Section Discussion. Finally, the author presents the conclusion of this study in Section Conclusion with limitations and suggests future research directions.

Previous literature

There is one natural way to examine the technological proximity between a patent and its citations by using the technical classifications listed on them, the value of which was suggested by Trajtenberg et al. (1997) and has been shown to be useful for assessing the nature and impact of inventions for patent applications (Fleming, 2001; Hall et al., 2001; Harrigan et al., 2017).

Trajtenberg et al. (1997) constructed a methodology for measuring the technical distance between a patent and its citations using a hierarchical structure of the U.S. patent classification system that consisted of three-digit patent classes, two-digit categories, and six very broad fields. Applying this methodology, they attempted to compare the “originality” of universities and companies. On the contrary, this methodology can be applied to indicate flows of knowledge from each cited patent to each focal patent. Next, the author focuses on previous studies that categorize the distance between the focal patent and its cited patents into three categories using backward citations and the patent classification. A representative study is conducted by Nemet and Johnson (2012).

Previous literature on analyzing knowledge streams from three technology domains

Nemet and Johnson (2012) referred to each citation from a focal patent to a previous patent as a citation pair and measured knowledge flowing from one technical domain to another by comparing, for each pair, the technical classification assigned to each cited patent with classification for the citing patent. They axiomatically categorize each patent–citation pair into three categories by comparing their classifications at each level in the hierarchy. For example, a pair is coded as “near” if they are in the same sub-class; a pair is set as “internal” if they are in the same class; a pair is set as “external” if they are in different classes; and a pair is set as “far external” if they are in different super-classes. This coding scheme is used to develop three variables using counts of backward citation pairs for each patent: far external, external, and near (Nemet and Johnson, 2012, p. 13). They minimized truncation bias by imposing a 10-year window on both forward and backward citations so that each patent has a similar weight (Nemet and Johnson, 2012, p. 10).

The results of the measurements showed significant effect in the two most highly cited individual technology categories. They found that external citations had a significant negative effect on forward citations for the category of computers and communications (Nemet and Johnson, 2012, p. 26). They also found that external citations were much less effective than other types of citations for the category of medical and drugs. The coefficient for far external was not actually negative, but the ratio comparing it to other citations was very small (Nemet and Johnson, 2012, p. 26). Conversely, for these most highly cited individual technologies, citing technically closer prior art had a stronger effect than far citations (Nemet and Johnson, 2012, p. 28).

Their explanation does not explicitly state that they used not only the primary classification but also the subsequent classifications. However, the phrase “our effort to use only high-level classes to compare results across multiple classification systems” (Nemet and Johnson, 2012, p. 30) is considered to imply that only primary classifications were used. They also count the total number of patents that the patent cites to define the citations made (Nemet and Johnson, 2012, p. 12). By using counts of backward citation pairs for each patent together with the abovementioned coding scheme, they developed three variables (far external i, external i, and near i) (Nemet and Johnson, 2012, p. 13). The counts of backward citation pairs are also used to compute the counts of other citations. Therefore, it is considered that the number of backward citations is taken into account in the development of these three variables. It is also understood, from Figure 2 and Table 1 provided by them, that “super-class” corresponds to “Section” in the case of the IPC. In addition, it is understood, from the coding scheme, that the pair coded as “far external” is used as the primary indicator of external knowledge flow and, thus, the IPC's eight sections correspond to “far external”.

Figure 2

Table 1

CategoriesNos.TitleIPC subclasses
Energy creation technology1Fuel cellsH01M8H01M4/86
2Solar cellsH01L31H01L51H01M14
3Wind motorsF03D
4Nuclear physics; nuclear engineeringG21
5Natural gas, liquid carbonaceous fuelC10LC10MC10GC10J
C10KC10BF17C
Processing, separation, mixing6SeparationB01D
7Catalysis or colloid chemistryB01J
Material processing, laminate8Layered productsB32B
9Shaping or joining of plasticsB29C
Organic macromolecular compounds10Other than by reactions only involving unsaturated carbon bondsC08G
11Compositions of macromolecular compoundsC08L
12Working-up; general processes of compoundingC08J
13By reactions only involving carbon unsaturated bondsC08F
Organic chemistry14Heterocyclic compoundsC07D
17Acyclic or carbocyclic compoundsC07C
Biotechnology15Microorganisms or enzymes; compositions thereofC12NC12P
Inorganic chemistry16Separation of hydrogen from mixtures containing thereofC01B
18Lime, magnesia, slag, cements, compositions thereofC04B
19Chemical composition of glasses, glazes or vitreous enamelsC03BC03C
Dyes20Coating compotitionsC09DC09K3C09K5
Metallurgy21AlloysC22C
22Coating metallic materialC23C
Heating (energy saving)23Fluid heaters, heat pump systems, heat-exchange apparatusF24HF25BF28DF28F
24Air-conditioning, air-humidificationF24F
Energy storing technology25Secondary batteriesH01M10H01M2H01M4 (excluding 4/86)
26System for supplying or distributing electric powerH02J
27Containers for storage or transport of materialsB65D
46Electrolysis: electrolytic or electrophoretic processesC25BC25CC25DC25F
Transporting technology28Controlling combustion enginesF02DF02M
29Control systems for hybrid vehiclesB60WB60LB60K
47Traffic control systemG08GG08BG08C
Conversion of electric power30Dynamo-electric machineH02K
31Control of electric motors/generatorsH02P
32Power converterH02M
Environmental aanalysis technology33Methods for supervisory or forecasting purposesG06Q50G06Q30G06Q10G06Q40
34Investigating or analyzing materialsG01N27G01N21G01N25G01N30
G01N33
Optical technology35Optical elements or systemsG02B1G02B5G02B6G02B26
G02B27
Display device36Circuit for control of indicating devices using static meansG09G
42Display device for variable informationG09F9
Printing technology37Photomechanical production of textured surfacesG03F
Environment technology38Treatment of wastewater or sludgeC02F
39Disposal of solid waste, recovery from waste materialB09BB09CB29B
40Clean and detoxify exhaust gasF01N
Energy saving technology41Electric heating; electric light source, especially organic ELH05B33
Greening technology43Agriculture, herbicide and pest controlA01GA01MA01N
Building technology45Foundations; excavations; embankmentsE02DE02FE21BE21C
E21DE21F
48General building constructions; wallsE04BE04DE04CE04F
Textile and fibers49Natural or man-made threads or fibers; spinningD01FD06FD06MD21C
D21HD03DD02G
Others44Drive device transmission mechanismF16H61F16H45F16H57F16H1
F16H25F16H41F16H63
50Spraying or atomising: applying fluent material to surfacesB05BB05CB05D

Top 50 NetZero-related technical fields.

Nemet (2012) used the same method of measurement as Nemet and Johnson (2012), but in contrast to them, he showed that for energy technologies the integration of technologically distant prior art (i.e., distant knowledge) has a stronger positive effect on knowledge flows than the integration of technologically near prior art (i.e., local knowledge). He found that coefficients on external citations were positive and significantly larger than those for other types of citations in the analysis of energy patents, whereas coefficients on citations to prior art that is technologically near have a negative effect on forward citation frequency (Nemet and Johnson, 2012, p. 1268). However, he found no significant difference between external citations and other citations in the analysis of all other patents including highly cited areas such as computers and medical patents, except for the category of electrical and electronics, which partially overlaps with energy technology (Nemet and Johnson, 2012, p. 1268). The energy technologies he targeted for patent analysis were defined according to the list of energy patent classes in Popp and Newell (2009, p. 1261).

Keijl et al. (2016) also measured the three knowledge flows from local, adjacent, and distant domains. They tracked recombination patterns in biotechnology patent applications by weighting the number of backward citations with their relative distance from a patent's focal domain. Although they showed that an intermediate level of recombination had the highest impact, the concept of “an intermediate level of recombination” they used is different from the concept of the combination with technologies in the neighboring fields referred to in this article. Rather, it should be noted that the concept refers to a combination of local and distant domains or a combination of near-adjacent and distant technologies (Keijl et al., 2016, p. 35). They argued that “an intermediate level of recombination” consisting of these combinations had the highest impact because it combines both novelty and familiarity (Keijl et al., 2016, p. 35, 36).

The technological distance between a sampled patent and its cited patents was determined as follows: when they are assigned to one of the three-digit biotechnology classes, this citation is categorized as a “near” citation. When the backward-cited patent is not assigned to a three-digit biotechnology class, but assigned to another class within “chemicals” or “drugs and medical”, both of which were some of the categories aggregated by Hall et al. (2001), this citation is categorized as an “adjacent” citation. When the backward-cited patent is not assigned to the “chemicals” or “drugs and medical” category, this citation is categorized as a “distant” citation. Next, the level of recombination was calculated by attaching different weights, respectively, to the three kinds of citations with near citations as “1”, adjacent citations as “2”, and distant citations as “3”. The sum of these weighted backward citations is then divided by the number of backward citations (Hall et al., 2001, p. 22).

They were aware that patents could be assigned to multiple patent classes, but only considered the number of patent classes as a controlling variable in their regression models. They controlled cross-reference classes other than the primary patent class used for the sampling strategy. This is because they viewed multiple patent classes as another possible indicator of the scope of a patent, and thus they considered that there was a possibility that patents assigned to multiple patent classes are cited by subsequent patents with a wider variety of patent classes. Therefore, this is different from the aim of this article to consider not only the primary classification but also the subsequent classifications of patents in order to analyze knowledge flows from other technical fields (Hall et al., 2001, p. 26).

Need for modified measurement methods of patent content proximity

All of the previous studies discussed earlier have simultaneously considered multiple levels of technological aggregation. However, there is still room for further improvement in existing measurement methods when analyzing patent data using patent classification and citation information.

The issue is that when comparing the technical fields of backward citations, only the primary patent classification assigned to them was considered and not the subsequent listed patent classes. Even if the primary classifications are different, there are many backward citations that the subsequent classifications have in common. As previous methods focus on only the primary patent classifications assigned to patents, the challenge remains that two patents common to any of the subsequent classifications assigned to the patents cannot be distinguished even if the primary patent classifications are different.

After this problem was pointed out in other contexts by Thompson and Fox-Kean (2005)6 and Benner and Waldfogel (2008), subsequent studies began to consider the use of additional classifications beyond the primary classification in order to reduce this problem in other measurement contexts (e.g., McNamee, 2013; Aharonson and Schilling, 2016; Kuhn, 2017). This is a problem that could commonly occur in the International Patent Classification (“IPC”) as well as the U.S. patent classification (“UPC”). As technology has become more complicated, there are many patent applications in which the two inventions differ in the first classification but match in the subsequent classifications.

Hypothesis

Here, the author would like to return to the research question posed in Section Introduction. To make such innovation that addresses the issue of global warming and promotes NetZero, should R&D managers refer to a more distant technical field as many existing studies emphasize?

The author believes that subsequent patent classifications, like the primary patent classification, have some impact on subsequent patent applications. Then, it is considered that the results of the analysis considering the subsequent patent classifications shift the peak from the distant field to the neighboring field compared to the results of using only the primary patent classifications. Therefore, the author sets the following hypothesis.

Hypothesis: There is a mountain-shaped relationship (an inverted U-shaped relationship) between the patent proximity between claimed invention and cited invention and the impact of the subject application on subsequent applications.

In the next section, the author describes the proposed technique of this article, which makes it possible to consider the subsequent patent classifications.

Proposed measurement method

In this section, the author explains how the methodological challenges discussed in the first section are solved by the technique proposed in this article.

Methodology for judging the incorporation of technologies from other fields using backward citations and patent classification

To identify patent applications for inventions incorporating knowledge in different technical fields, the author focuses on the possibility that “at least one” of the cited applications (backward citations) cited in the examination may be in a different technical field from the subject matter of the invention claimed in the subject patent application in the case that the invention incorporates technologies from other fields. This is based on the premise that the patent classification represents the subject matter of the claimed invention (Tsunoda, 2016, p. 267).

This is not limited to Japan but is commonly seen in patent examinations overseas as well. For example, Shane (2001) stated the following regarding the relationship between claimed inventions of U.S. patents and patent classifications assigned thereto:

The assignment of a patent to a particular patent class represents the U.S. Patent and Trademark Office's (USPTO's) assessment that the patent belongs in a particular technical field. Patent examiners also determine what previous inventions must be cited in a patent by searching prior patents. Because patents belong to technical classes and because they cite previous patents, citations to patents in particular technical fields represent the USPTO's assessment that a particular invention builds upon (cites) knowledge in that technical field. When a patent cites previous patents in classes other than the ones it is in, that pattern suggests that the invention builds upon different technical paradigms from the one in which it is applied.” (Shane, 2001, p. 210)

What should be taken into consideration here is that inventions often consist of multiple technologies and that two or more patent classifications are often assigned to one patent application (Tsunoda, 2016, p. 266). Therefore, according to the proposed technique, all patent classifications assigned to the present application (including the primary classification and the subsequent classifications) are compared with all patent classifications assigned to its backward citations. Thus, it is checked whether there is “at least one” of the applications (backward citations) cited in the examination that is assigned a patent classification different from the patent classification assigned to the present application. If there is such a backward citation, the hierarchical structure of the patent classifications is used to categorize the degree of proximity between the present invention and the cited application into three levels: “same,” “neighboring,” and “distant”.

Furthermore, to characterize the proximities in the technological space defined by the classification scheme, the patent classification must be a ruler that reflects the state of the current technology and shows a hierarchical structure with an appropriate number of classifications between layers. To achieve this, the author proposes to combine two existing technical classifications, which are explained later.

IPC-based aggregated technology classification

In the measurement technique proposed in this article, the Aggregated Technology Classification (“ATC”) based on the IPC is also used. This is described as one of the classification systems used in the WIPO statistical reports and matched to top-level categories in the NBER Patent Database with six technology types (Goto and Motohashi, 2005, p. 46; Goto and Motohashi, 2007, p. 1433). The IPC classification table is a set of classification items and has a structure in which all technical fields are arranged in a hierarchical, tree-like structure in different levels with codes, including sections, classes, and subclasses. In this article, the author proposes to use ATC, which does not have its own hierarchical structure, in combination with the IPC. There are two reasons for this. First, the ATC has an appropriate measurement range divided into 33 fields, as can be seen in Figure 2, which is narrower in its focus than the section level of the IPC with 8 sections and wider in its focus than the class level of the IPC with 118 classes. Second, the classification of the ATC is closer to the current state of technology than the IPC.

These reasons are supplemented with a comparative table (refer to Appendix I) that shows the relationship between the ATC and the IPC. In the table, the ATC technical field numbers (“ATC Nos”). One to Thirty-three are shown on the left, and the corresponding IPC sections and subclasses are shown on the right. In this table, pharmaceutical products (A61K), which are classified into IPC Section A representing Human Necessities together with Agricultural and Marine Products (A01) and pesticides/herbicides (A01N), are indistinguishable from them at the level of IPC section. In addition, enzymes (C12N9/) and genetic engineering (C12N15/) are classified together into the subclass C12N that represents microorganisms, and it is not possible to distinguish between the two at the subclass level. While genetic engineering is a state-of-the-art technology that is often subject to analysis, enzymes are essential technologies for food and medicine that produce Koji, yeast, and lactic-acid bacteria. Koji is the “national fungus of Japan” that is indispensable for fermenting sake, producing miso, soy sauce, and a whole range of traditional Japanese foods. In this way, the hierarchical structure of the IPC does not necessarily reflect the current state of the art in Japan.7 In contrast, the ATC has an independent classification for pharmaceuticals (No. 5) and separate classifications for biotechnology (No. 16) and genetic engineering (No. 17). Therefore, the ATC is more reflect the current state of technology than other classifications. It is neither too broad like IPC sections nor too detailed like IPC classes.

The proposed technique exemplifies the combination of the ATC and the IPC as classifications in different layers, but it is not limited to this combination. Rather, the feature to be emphasized is, instead of using three patent classifications with different hierarchies as in previous studies, the use of only two classifications consisting of the broadest patent class and the narrowest class. In the proposed technique, instead of using a specific classification corresponding to the middle layer, the target applications can be categorized into three technical fields by considering not only the primary classification but also the subsequent classifications. This point is a feature of the proposed technique. Although the author chooses the ATC and the ITC as the optimal combination for analyzing current technology in Japan, the combination can be changed depending on the times and the technology, and thus two classifications with appropriate spacing and hierarchical structure may be selected.

Proposed measurement technique

Using the hierarchical structure of the primary classification and subsequent classifications given to patent applications, the author explains the proposed technique of categorizing a set of patent applications into three technical fields by comparing technological classifications between patent applications for an invention examined by an examiner (“Subject Application”) and prior applications cited by an examiner in the examination of subject application (“Backward Citations”).

Explanation of hypothetical examples

Figure 3 shows hypothetical examples for explaining the measurement technique proposed in this article. In this figure, combinations of subject applications with three backward citations are shown as three patterns A–C. Each subject application is classified as ATC No. 7, which corresponds to the primary IPC classification assigned to each subject application. In contrast, the ATC classification of backward citations differs for each subject application. All backward citations of Pattern A are classified as the same ATC No. 7 as the subject application. For Patterns B and C, two of the three backward citations are classified as the same ATC No. 7 as subject application, but one backward citation is assigned a different ATC classification from its subject application.

Figure 3

The difference between the ATC No. 3 and No. 5 classification numbers does not make much sense here, and it is sufficient if they are different from the ATC classification of the subject application (No. 7 in this pattern). Rather, the point to be emphasized here is that in Pattern B, any one of the IPC subclasses of the backward citation (ATC No. 3 in this pattern) matches any of the IPC subclasses of subject application, whereas in Pattern C, the IPC subclass of backward citation (ATC No. 5 in this pattern) does not match any of the IPC subclasses of the subject application.

In summary, the differences between the three patterns are as follows. Pattern A is different from Patterns B and C in that the ATC Nos of all backward citations are the same as those of the subject application. Patterns B and C, in contrast, are similar in that at least one ATC No. of a backward citation differs from that of the subject application. However, Patterns B and C can be distinguished by comparing not only the primary classification but also the subsequent classifications.

According to the technique proposed in this article, the group data of a subject application can be categorized into the three patterns by comparing a subject application and its citations using two patent classifications with different hierarchies. Figure 4 shows the flowchart of the proposed technique. This figure shows a two-stage filtering process that compares a subject application and its backward citations using different hierarchies of patent classifications. The upper half of the figure shows the first filtering process using the ATC No. corresponding to the primary classification of the IPC subclass assigned in each application, and the lower half shows the second filtering process using subclasses of the IPC classification. In the second filtering, not only the primary classification but also all IPC classifications (including subsequent classifications) assigned to each application are targeted for comparison.

Figure 4

Extraction of subject applications falling into the same field

First, the first filtering process extracts subject applications one by one from a set of patent applications and compares subject applications and backward citations (usually multiple) with the ATC. Specifically, as shown in the upper half of Figure 4, comparisons are made using the ATC No. assigned to each application, and if they match (that is, if the judgment of “non-coincidence” is “No”), subject application is categorized into the Same Field. In contrast, if the comparison reveals at least one unmatched backward citation (that is, if the judgment of “non-coincidence” is “Yes”), analysis of the subject application can proceed to the second filtering process. This first filtering process may be done by analyzing with MySQL using the citation data of the IIP patent database.8

Distinguishing between subject applications falling into neighboring and distant fields

Next, the second filtering process is conducted for the set of applications extracted in the first filtering process. The second filtering process is performed, as shown in the lower half of Figure 4, by comparing “all IPC sub-classes including subsequent classes” assigned to the extracted subject applications and those assigned to backward citations. This is because, even if the subject application is different from backward citations at the level of the ATC, there is a possibility that the subject application has close proximity with any one of the backward citations at the level of the IPC subclass. In Pattern B of Figure 3, the subject application classified in ATC No. 7 differs from at least one other backward citation as far as its ATC No. is concerned. However, when compared with the IPC subclass of its backward citations, its subject application classified in IPC subclass B24A is consistent with one of its backward Citations.

To find such a combination of pattern B, if there is at least one of the backward citations that match the subject application at the level of the IPC subclass (that is, if the judgment of “non-coincidence” is “No”), the subject application is categorized into Neighboring Field. On the contrary, the subject application shown in Pattern C of Figure 3 is not only different from one of the backward citations categorized in ATC No. 5 but also different from its backward citation even when comparing with the IPC subclasses. The IPC subclasses A41B and B24A applied to this subject application do not match either of the IPC subclasses A21B and B01C applied to the backward citations. Thus, if there is no backward citation that matches the subject applications at the level of the IPC subclass (that is, if the judgment of “non-coincidence” is “Yes”), the subject application is categorized into Distant Field. This second filtering process is conducted using a commercial database with IPC patent classification data containing not only the primary classification but also subsequent classes of patent applications.

From the explanation mentioned earlier, it is understood that the proposed technique is different from those of previous studies in that it is characterized by (a) comparing the technical fields of not only the primary classification but also the subsequent classifications and (b) the use of the ATC having 33 classifications that are finer than IPC sections and broader than IPC classes. This is made possible by using two patent classifications without having a specific classification corresponding to the middle hierarchy in between, rather than using three patent classifications with different hierarchies. This proposed technique can be applied to compare not only between a subject application and backward citations but also between a subject application and forward citations (Hirose, 2021).

Data and method for analysis

In this section, after describing datasets, the author explains how the proposed technique to identify patent “proximity” to test the hypothesis.

Data

Preparation of preliminary data

To collect patent data related to inventions that promote NetZero, such as green technology and carbon neutrality, the author extracted the most frequently occurring natural languages and technical terms for NetZero from 5,249 articles published in the Nihon Keizai Shimbun from 2011 to 2021. Then, a search formula was created using the extracted terms to search for the Japanese patent applications filed in 2008.

The search formula is shown in Attachment II. As a result, 11,304 NetZero-related patent applications were extracted as preliminary data. In conducting the search, in order to prevent unintended search omissions due to the narrowing down of the IPC, the author avoided using patent classifications such as the IPC as much as possible. A method of searching a plurality of technical terms using proximity operators was adopted for the entire text of a patent specification. Therefore, it is undeniable that the search results may contain some level of noise, but a high priority was put on extracting inventions related to NetZero from a wide range of technical fields.

The preliminary data were grouped by IPC sections, and the grouped IPC sections were sorted in descending order of the number of patent applications filed in 2008. Therefore, IPC classes sorted in descending order of NetZero-related applications were prepared. Among the sorted IPC classes, those whose contents are related or close to each other are grouped together, and the top 50 NetZero-related technical fields are identified in descending order of the number of applications. Table 1 shows a list of the titles of the top 50 NetZero-related technical fields and the corresponding IPC classes.

Dataset of subject applications

Next, using the IPC classes corresponding to the top 50 NetZero-related technical fields, the patent applications that were filed in 2008 and have already been examined were searched. As a result, 67,302 patent applications with those IPCs in the first classification data were extracted. These patent applications were merged into one data table, including data on patent classifications, citation information, applicant information, and forward citations updated by March 26, 2022.

In this study, the analysis was carried out on Japanese patent applications filed in 2008. There are two reasons why this filing year was chosen. First, the year 2008 is considered to be the latest year for which the Japanese Patent Office has completed most of the examinations requested under the examination-on-demand system. Second, analyzing forward citations requires sufficient time necessary to measure the impact on subsequent applications. The filing year of 2008 refers to the actual filing year of the subject application. Thus, the filing year based on the priority date or the filing date of the patent application is not included. In the case of an international application, the year of the international filing date is adopted.

Dataset of backward citations

The total number of backward citations in the 67,302 subject patent applications was 375,851. Among them, Japanese patent publications including international patent publications accounted for the largest share at 91%, and the remaining 9% were utility model patent publications (4%), general technical publications (3%), and foreign patent publications (2%), as can be seen in Figure 5. As utility model publications and foreign patent publications have a certain proportion of old scan data from the time before patent system was digitized, and general technical publications are not classified, they were excluded from this survey on the premise that it is considered in Section “Discussion”.

Figure 5

To reflect as much as possible the examiner's evaluation of the invention in this analysis, prior patent publications that are mentioned as citations in the official communication but are not cited as the basis for refusal are also included as backward citations in this study.9 On the contrary, the number of backward citations cited in a patent application varies from patent application to patent application, and the number reaches nearly 50 at most. It is not necessarily desirable, however, to easily increase the number of citations when considering the efficiency of analysis. Figure 6 is a graph showing the distribution of the number of citations. The horizontal axis indicates the number of citations per application in order of citation, and the vertical axis indicates the number of patent applications. With reference to this figure, the average number of citations per application in this survey is calculated to be six. Since the database used in this search records the citations in the order of description of the official actions, the citations mentioned by the examiner as the basis for refusal in the examination are recorded at the beginning, and the citations cited as reference tend to be recorded later. For this reason, the analysis in this study limited citations to six per application.

Figure 6

Method for analysis

The author assumed that there is a mountain-shaped relationship (inverted U-shaped relationship) between the patent proximity of the claimed invention to the cited invention and the influence of the claimed invention on subsequent applications in Section Hypothesis. This means that a patent application in which a backward citation in Neighboring Field is cited in the examination is more cited in a subsequent examination than a patent application in which a backward citation in Distant Field or only the backward citation in Same Field is cited in the examination.

To test the hypothesis, the author uses the patent “proximity” between claimed invention and cited invention as an independent valuable and the impact of the subject application on subsequent applications as a dependent valuable. The proximity is represented by three technical fields consisting of “same,” “neighboring,” and “distant” according to the degree of matching of the technical fields of combined inventions using this proposed technique.

To investigate the impact of the subject application on subsequent applications, the author uses negative binomial regression for analysis of the value of the actual number of forward citations (“F-Citations”). This is because linear regression by ordinary least squares assumes that there is a linear relationship between the dependent and independent variables. However, in this analysis subjects, there is a possibility of non-linearity and, thus, it is necessary to deal with it. As a robustness check, the linear regression for the estimation of logarithm plus one of the dependent variable is performed. Furthermore, the number of claims in the patent application (“Claims”) and the number of pages in the patent specification (“Pages”) were added as control variables indicating the degree of disclosure in the specification.

In Section “Discussion”, the author compares the proposed method with Nemet (2012).

Forward citations

To consider whether the invention will be a seed for future technological development, the authors investigate the impact of the invention on subsequent patent applications. A number of studies have focused on the technological impact of inventions as a way to assess their importance for technological development (Ahuja and Morris Lampert, 2001; Rosenkopf and Nerkar, 2001; Phene et al., 2006). The number of forward citations, widely used as an indicator of the value and quality of patents, is a value that indicates how much the patent is cited by subsequent patents, and the higher this value, the more influential the patent and the higher the value of the patent (Carpenter et al., 1981; Albert et al., 1991; Harhoff et al., 1999).

Regarding forward citations, there are “applicant citations” that the applicant describes in his patent specification and Information Disclosure Statement10 and “examiner citations” that the examiner cites as the basis for reasons for refusal when examining an application. Although there have been some skeptical opinions about the usefulness of the examiner's forward citation (Meyer, 2000; Mihara, 2012), it has been reported in the U.S. that “examiner citation” has a stronger influence on patent renewal than “applicant forward citation” (Hegde and Sampat, 2009; Cotropia et al., 2013; Yasukawa, 2016) or that the applicant's citation and the examiner's citation may not have been distinguished.11 In Japan as well, it has been reported that “examiner citation” correlates with the importance of patents, rather than “applicant citation” (Wada, 2010; Yamada, 2010; Yasukawa, 2016, 2017). Therefore, in this article that analyzes Japanese patent applications, “examiner forward citation” is used to investigate the effect of the invention on subsequent patent applications.

Definition of technical terms pertinent to the model

In this article, the technical fields and the categories of subject application are defined as follows in relation to backward Citations:

Regarding patent applications and citations, “subject application” is defined as a patent application filed in 2008 and examined by the JPO and citing at least one patent application as prior art. “Backward citation” is defined as a prior patent application cited by an examiner in the examination of the Subject application.

Regarding the three technical fields, “Same Field” is defined as a technical field that is the same as that of the subject application in the ATC Nos. “Neighboring Field” is defined as a technical field that differs from the ATC No. of the subject application but is the same as at least one of the IPC subclasses given to the subject application. “Distant Field” is defined as a technical field that differs not only from the ATC No. of the subject application but also from any one of all IPC subclasses given to the subject application.

Results

In this section, the hypothesis was tested for the dataset consisting of 67,302 patent applications (hereinafter referred to as “DATASET”) belonging to the top 50 NetZero-related technical fields.

Results for extracting subject applications falling into same field

As a result of the first filtering process shown in the upper half of Figure 4, subject applications categorized in Same Field were separated from those categorized in Neighboring and Distant Fields. The number of subject applications categorized in Same Field reached 29,206, which accounts for 43% of the total applications of 67,302, whereas the number of subject applications in Neighboring and Distant Fields reached 38,093, which accounts for 57% of the total applications, as shown in Figure 7A.

Figure 7

Results for extracting subject applications falling into neighboring and distant fields

Furthermore, as a result of the second filtering process shown in the lower half of Figure 4, subject applications categorized in Neighboring Field were separated from those categorized in Distant Fields. The number of subject applications categorized in Neighboring Field reached 21,995, which accounts for 33% of the total applications, whereas the number of subject applications in Distant Fields reached 16,098, which accounts for 24% of the total applications.

This means that, considering only the primary classification assigned to subject applications and their Cited Applications, the sum of Neighboring and Distant Fields accounts for 57% of the total applications examined shown in Figure 7A. On the contrary, Neighboring Field accounts for 58% of the 57% when considering subsequent classifications as well, as shown in Figure 7B.

Advantages of including subsequent classifications

Table 2 shows the results of the subject applications categorized by the second filtering process, after being extracted by the first filtering process. The columns show the groups of citations in each application, which are shown in sets (Groups 1–6), in order of their citation, with up to six citations covered in this analysis. Results are shown for each of the six groups. The results are shown for Neighboring and Distant Fields according to the conditions shown in Figure 3 for each of the six groups. The point of focus here is the results indicated by the bold frame which take the subsequent classifications into consideration. For the result of non-coincidence when comparing only the first classification, it becomes possible to divide into Neighboring Field (26% of the total) and Distant Field (31% of the total) by adding subsequent classifications to the target of comparison. The number shown in the total column becomes the number shown in the “Remove Duplicates” column by subtracting the counted number of duplicates in multiple references to the same application.

Table 2

CategoriesComparison of the subject patent application with each citationCitations cited by the subject patent applicationTotalPct. (%)Remove duplicates
Between first classificationsBetween subsequent classificationsGroup 1Group 2Group 3Group 4Group 5Group 6
Neighboring fieldCoincidence (n*)7,9617,5486,8585,6224,2193,22235,4304321,995
Non-coincidenceMatch at least one4,6834,693 4,1183,4692,6871,94021,59026
Distant fieldNone match4,6395,5755,1214,3153,4202,50225,5723116,098
Total17,28317,81616,09713,40610,3267,66482,59210038,093

Table showing the results categorized by the second filtering process.

n*: The “coincidence” corresponds to one of citations, with one of the other five citations having a first classification of “non-coincidence”.

Examples of patent applications categorized into three technical fields

Table 3 shows examples of patent applications categorized as “Same,” “Neighboring,” and “Distant” fields according to the proposed technique. The left-side columns show the patent applications (“subject applications”), and the right-side columns show the backward citations cited in the prosecution of subject applications (“backward citations”). The number of backward citations is listed in the order of citation of the examination and is limited to six. In each subject application and backward citation, the first classification in the given IPC sections and corresponding ATC Nos. are indicated by a bold border. To the right of the ATC column are all IPC subclasses assigned to each application and citation.

Table 3

Subject applicationsBackward citations
No.Application No.Short titleIPCATCIPC Sub-classesNo.Application No.IPCATCIPC_Sub-classes
Same field12008-077964Tungsten oxide photocatalystB6B01J, B01DC11999-252707B31B01J, A61L, B01D, B01J
C21999-252707B31B01J, A61L, B01D, B01J
22008-312844Variable valve timing controller for internal combustion engineF23F02DC12000-364588F23F02D, F01L
C22002-281495F23F01L, F02D
C32005-326262F23F01L
32008-230322Lithium-ion storage deviceH31H01MC12005-259447H31H01M
C22003-131274H31H01M
C32003-419037H31H01M
C41998-179009H31H01M
42008-027941Water heater using heat pumpF25F25B, F24F, F24HC12004-303077F25F25B
C22007-049040F25F25B, F24H
C32000-184513F25F25B
C42000-184513F25F25B
C52004-025120F25F25B
C62001-024845F25F25B
Neighboring fields52008-022649CO2 recovery apparatusC12, C12006-111302C12,
C21990-115393B6
C31989-288559B6,
C42000-173353B6,
C51991-257666B6, C02F
C62007-047421B6
62008-020526Power supply system using onboard storage batteryH31H02J, C11999-171641H31H02J,
C22001-088042B10, G07F
C31998-179111B10, B60S, G01R, G01V, G06F
C42000-334844G28G06F, , B60S, H01M
C52005-181781H31H02J,
72008-058183A method for regenerating an amine liquidC13, , , C11992-289047B6, , , C10G
C21992-226969C13
C32001-207991B6, , F01K, G21C
C41996-211190C12
C52001-502951B6,
C61994-084916C13
Distant fields82008-284960An artificial algae bed constructionA1A01G, A01KC12004-119188A1A01K, B65D, , E02B
C21999-239179A1A01K, B09B
C31987-154283E21,
C41997-233809E21E02B,
C52004-302144A1A01K
C62001-093906A1A01K, A01G
92008-184526Charging system for electric vehiclesH31C11996-252494H31H02J,
C21997-64927B10B01D, B60K, B60R,
C31993-202726H31H01R, B60K, B60R, H01M
C41993-12448G28G07F, H02B,
C51998-53923G28, , ,
C61996-146433G28,
102008-109610A power management systemG28G06Q, , G06QC12002-224552G28G08C, G01D, G06F, G08C
C22001-281302G28G05B, G06F
C32000-358974G27G01R,
C41998-147097F25
C52006-271257 H31, G01R, H02J
C62007-019002H31, G01R

Table showing examples of patent applications categorized into three technical fields according to the proposed technique.

Yellow marking indicates that the IPC section differs between the subject application and the backward citation. Red marking indicates that the ATC No. differs between them.

Subject applications listed in rows Nos. 1–4 are examples of patent applications categorized as Same Field. A set of subject applications and backward citations that satisfy the conditions of Pattern A in Figure 3 is shown. This means that the ATC Nos. of all backward citations are the same as those of subject applications.

Subject applications listed in rows Nos. 5–7 are examples of patent applications categorized as Neighboring Fields. According to the conditions of Pattern B in Figures 3, 4, combinations of applications and citations satisfying the following two conditions are shown:

  • a) At least one ATC No. of a Cited Application differs from that of subject application.

  • b) In the combination that satisfies condition (a), there is at least one of the backward citations that matches subject application at the level of the IPC subclass (Corresponding IPC subclasses are indicated in blue). A clear example is given in row No. 6. In this case, there are two citations, C2 and C3 (Application Nos. 1998-53923 and 1998-145433), both of which have the same ATC number (No. 10). They are different from the ATC number (No. 31) of the subject application (Application No. 2008-020526), but they are common in the level of IPC subclass (B60L).

Subject applications listed in rows Nos. 8–10 are examples of patent applications categorized as Distant Fields. According to the conditions of Pattern C in Figures 3, 4, combinations of applications and citations satisfying the following two conditions are shown:

  • a) At least one ATC No. of a Cited Application differs from that of subject application.

  • b) In the combination that satisfies condition (a), there is no backward citation that matches subject application even at the IPC subclass level (Corresponding IPC subclasses are indicated in red). Row No. 9 provides a clear example of this. In this case, there are two citations, C5 and C6 (Application Nos. 1998-53923 and 1998-145433), both of which have the same ATC number (No. 28). They are different not only from the ATC number (No. 31) of the subject application (Application No. 2008-184526) but also from subject application in the level of their IPC subclasses.

In the previous examples, pay attention to the combinations of applications and citations determined to be Neighboring Fields (Corresponding IPC subclasses are indicated in blue). These combinations were determined to be “Neighboring” fields by considering all subsequent classifications into consideration. All of these examples are combinations that would be categorized as “Distant” fields if judged only with the first classification.

Nemet and Johnson (2012) allowed the use of the IPC in place of the U.S. System (Nemet and Johnson, 2012, p. 14). Even if IPC sections are used here, instead of ATC, as indicated by the black border in Table 3, the problems in Neighboring Fields are not resolved. A solution to this is to consider subsequent classifications, as emphasized in this proposal. This is the reason why the proposed technique is better than the ones in the previous method. The quantitative effect of using subsequent classifications is revisited in Section “Discussion”.

Analysis of the entire dataset

Table 4 shows the result of negative binomial regression (“NB Regression”) analysis in assessing the effect of the combination with technology in Neighboring and Distant Fields on the actual number of forward citations (“F-Citations”). Table 5 shows the summary of the variables used therein. The evaluation results show that the NB Regression coefficient increases at the significance level of 1% or less as the technical field shifts from Same Field to Neighboring Field, whereas it decreases as the technical field shifts to Distant Field.

Table 4

(1)
VariablesWhole
Neighbor0.113***
(0.0115)
Distant−0.0135
(0.0127)
Claims0.00722***
(0.000813)
Pages0.00282***
(0.000237)
Constant0.632***
(0.00986)
Observations67,302
Pseudo R20.00198

Result of negative binomial regression.

Note: Standard error in parentheses, Significance level: ***p < 0.01, **p < 0.05, *p < 0.1.

Table 5

(1)(2)(3)(4)(5)
VariablesNMeanSDMinMax
Pages67,30221.1526.4631,012
Claims67,3027.8576.8851128
F_Citations67,3022.1993.2810254
Neighbor67,3020.3270.46901
Distant67,3020.2390.42701

Summarizing variables of main analysis.

In Table 4, NB Regressions of Claims and Pages added as control valuables are also increased at a significant level of 1%. This is probably because the number of claims and the number of pages in the specification affect the degree of disclosure of the inventions. On the contrary, it is one order of magnitude smaller than the coefficients of Neighboring and Distant Fields, and the author believes that the influence of changes in the technological field is small.

This result of regression analysis can also be seen in the line graph of Figure 8, which shows the relation between the NB Regression coefficient and proximity between claimed invention and cited invention. The proximity is shown as three technical fields consisting of Same, Neighboring, and Distant along the horizontal axis. The NB Regression coefficient is represented by the scale shown in the left vertical axis. As understood from the figure, the line graph of the NB Regression coefficient shows a mountain-shaped pattern, increasing as the technical field shifts from “Same Field” to “Neighboring Fields” and decreasing as the technical field shifts from “Neighboring Fields” to “Distant Fields”. Therefore, from the overall perspective of the DATASET, the Hypothesis is affirmed.

Figure 8

As a robustness check, the author also performed linear regressions for the estimation of logarithm plus one of the dependent variables (“F-Citations”). The results of this additional analysis are shown in Table 6. Table 7 shows the summary of the variables used therein. The comparison of the additional analysis with the main analysis shows that the results tend to be robust as the results of the former are substantially similar to those of the latter and significance levels remain stable.

Table 6

(1)
VariablesWhole
Neighbor0.258***
(0.0293)
Distant−0.0194
(0.0321)
Claims0.0152***
(0.00195)
Pages0.00557***
(0.000508)
Constant1.882***
(0.0241)
Observations67,302
R20.006

Result of ordinary least squares regression (OLS).

Note: Standard error in parentheses, Significance level: ***p < 0.01, **p < 0.05, *p < 0.1.

Table 7

(1)(2)(3)(4)(5)
VariablesNMeanSDMinMax
Pages67,30221.1526.4631,012
Claims67,3027.8576.8851128
F_Citations67,3022.1993.2810254
Neighbor67,3020.3270.46901
Distant67,3020.2390.42701

Summarizing variables of additional analysis.

Analysis by technical field

Next, Hypothesis was tested using the NB Regression analysis on the DATASET divided by the top 50 technical fields to analyze the impact of the subject application on subsequent applications. The results of the NB Regression analysis are shown in Table 8. The summary of the variables used therein is shown in Appendix III. As understood from the comparison of the results for each of the fields of the IPC, the relationship between the results of the NB Regression analysis and the three technical fields shows unique tendencies for each subclass of the IPC. As a subject of verification, 25 fields in which the coefficient of either Neighboring or Distant Fields has a significance level of 10% or less were selected from Table 8.

Table 8

No.1No.2No.3No.4No.5No.6No.7No.8No.9No.10
VariablesH01M8 et al.H01L31F03DG21C10L et al.B01DB01JB32BB29CC08G
Neighbor0.02230.03460.891***0.146−0.1120.192**0.1480.1200.203***−0.120
(0.0705)(0.0829)(0.274)(0.151)(0.0962)(0.0778)(0.0988)(0.0966)(0.0676)(0.0748)
Distant−0.0612−0.02380.1510.171−0.04470.0870−0.227**−0.179*0.184**−0.105
(0.0875)(0.0835)(0.243)(0.158)(0.0936)(0.0981)(0.112)(0.0990)(0.0729)(0.0824)
Claims−0.003620.002750.01990.0283**−0.006810.004040.00292−0.0124**0.00172−0.00503
(0.00605)(0.00462)(0.0139)(0.0124)(0.00564)(0.00581)(0.00545)(0.00603)(0.00528)(0.00499)
Pages−0.004040.00222*−0.00213−0.005680.0151***0.00541*0.0153***0.0201***0.0178***0.00759***
(0.00362)(0.00116)(0.0172)(0.00669)(0.00403)(0.00315)(0.00369)(0.00370)(0.00374)(0.00220)
Constant0.748***0.903***0.08830.1390.404***0.368***0.246**0.709***0.09120.759***
(0.0692)(0.0671)(0.272)(0.133)(0.0815)(0.0779)(0.108)(0.107)(0.0703)(0.0712)
Observations1,7031,4001324111,0901,2951,1501,4232,1571,464
Pseudo R20.0005000.0009310.02820.005650.004340.002660.01070.008470.006070.00256
No.11No.12No.13No.14No.15No.16No.17No.18No.19No.20
VariablesC08LC08JC08FC07DC12N et al.C01BC07CC04BC03B et al.C09D et al.
Neighbor0.124**0.175*0.298***0.228***−0.08460.212*0.0862−0.001650.139−0.0602
(0.0578)(0.0984)(0.0893)(0.0695)(0.147)(0.110)(0.0896)(0.0868)(0.122)(0.0886)
Distant−0.0589−0.178*0.1290.240**0.04720.168−0.00624−0.245**0.164−0.222**
(0.0647)(0.100)(0.0937)(0.106)(0.152)(0.115)(0.106)(0.110)(0.141)(0.0897)
Claims−0.00979**0.006880.008930.0009740.0105*0.0114*0.00921**−0.007320.0215**0.00670
(0.00441)(0.00695)(0.00664)(0.00283)(0.00612)(0.00612)(0.00446)(0.00745)(0.00999)(0.00520)
Pages0.00994***0.0139***0.00956***0.00352***0.00388**0.0145***0.00326***0.0163***0.0297***0.0173***
(0.00225)(0.00336)(0.00226)(0.000417)(0.00194)(0.00457)(0.000765)(0.00536)(0.00848)(0.00248)
Constant0.810***0.494***0.363***−0.08060.1940.225*0.160*0.482***0.08730.677***
(0.0593)(0.106)(0.0897)(0.0723)(0.132)(0.117)(0.0822)(0.0974)(0.156)(0.0931)
Observations2,3591,0891,1582,0955871,0101,4909497531,335
Pseudo R20.002840.009390.01020.01520.006570.008130.007580.004470.009830.0132
No.21No.22No.23No.24No.25No.26No.27No.28No.29No.30
VariablesC22CC23CF24H et al.F24FH01M10H02JB65DF02D et al.B60W et al.H02K
Neighbor−0.148*0.137*0.02930.06250.08410.454***0.0708−0.180**0.277***−0.105
(0.0795)(0.0772)(0.0736)(0.0790)(0.0593)(0.0816)(0.0633)(0.0802)(0.0618)(0.0787)
Distant−0.328*−0.0421−0.0402−0.0485−0.02780.221***−0.185**−0.1070.210***−0.135
(0.169)(0.0881)(0.0699)(0.0696)(0.0712)(0.0857)(0.0757)(0.0894)(0.0750)(0.0828)
Claims−0.00934−0.008080.0008550.0270***0.0120***−0.00646−0.005620.005880.005720.0216***
(0.00923)(0.00571)(0.00696)(0.00765)(0.00433)(0.00557)(0.00569)(0.00748)(0.00750)(0.00699)
Pages0.0150**0.0182***0.0121***−0.001900.00760***0.0330***0.0209***0.0210***0.0166***0.00617
(0.00585)(0.00408)(0.00379)(0.00373)(0.00205)(0.00463)(0.00506)(0.00435)(0.00370)(0.00425)
Constant0.904***0.321***0.624***0.656***1.185***0.737***0.187***0.177**0.590***0.526***
(0.100)(0.0877)(0.0670)(0.0722)(0.0476)(0.0875)(0.0649)(0.0753)(0.0766)(0.0676)
Observations1,0181,4601,6951,4042,1481,1612,2741,5821,8451,699
Pseudo R20.003330.004800.002140.002930.003790.01570.003620.007570.006350.00402
No.31No.32No.33No.34No.35No.36No.37No.38No.39No.40
VariablesH02PH02MG06Q50 et al.G01N27 et al.G02B1 et al.G09GG03FC02FB09B et al.F01N
Neighbor0.324***0.03560.182**0.04520.218***0.173**−0.1170.08890.2030.0398
(0.0938)(0.0862)(0.0739)(0.0537)(0.0522)(0.0812)(0.0783)(0.0843)(0.171)(0.108)
Distant−0.215**−0.1070.0618−0.137***0.118**−0.0782−0.0851−0.1750.1250.116
(0.105)(0.0912)(0.0576)(0.0532)(0.0601)(0.103)(0.102)(0.111)(0.185)(0.133)
Claims0.004160.0133*−0.0004060.00505*0.002300.0168***0.0161***−0.002790.00438−0.00768
(0.00870)(0.00743)(0.00334)(0.00282)(0.00439)(0.00646)(0.00623)(0.00687)(0.0134)(0.00865)
Pages0.0188***0.006130.00524***0.00549***0.0140***−0.002460.0115***0.0205***0.0192**0.0164**
(0.00666)(0.00446)(0.00199)(0.00140)(0.00185)(0.00155)(0.00137)(0.00579)(0.00805)(0.00706)
Constant0.351***0.740***0.712***0.471***0.431***0.342***0.542***0.515***−0.06490.620***
(0.117)(0.0755)(0.0575)(0.0433)(0.0550)(0.0905)(0.0887)(0.0982)(0.187)(0.119)
Observations9171,2002,3363,0153,1711,6451,230902447673
Pseudo R20.009720.002780.001490.003880.007850.003100.01710.006340.006840.00281
No.41No.42No.43No.44No.45No.46No.47No.48No.49No.50
VariablesH05B33G09F9A01G et al.F16H61 et al.E02D et al.C258 et al.G08G et al.E048 et al.D01F et al.B058 et al.
Neighbor−0.02081.161***0.217**0.225**0.220***0.05330.0573−0.06240.361***0.436***
(0.113)(0.277)(0.0977)(0.0888)(0.0790)(0.131)(0.0874)(0.0910)(0.0767)(0.127)
Distant−0.02530.672**0.292***−0.01720.122−0.331**−0.00799−0.1280.282***0.198
(0.119)(0.286)(0.103)(0.111)(0.0839)(0.152)(0.0973)(0.0842)(0.0756)(0.121)
Claims0.008640.01880.0129*0.00243−0.01240.0002840.00983−0.006060.00975**0.00124
(0.00904)(0.0133)(0.00690)(0.0107)(0.00972)(0.0105)(0.00687)(0.00986)(0.00449)(0.00929)
Pages0.00693*0.001510.00524***0.0238***0.0178***0.0281***0.0185***0.0222***0.0184***0.0248***
(0.00373)(0.00756)(0.00165)(0.00541)(0.00605)(0.00868)(0.00528)(0.00569)(0.00394)(0.00640)
Constant0.709***−0.1070.132*−0.1200.180**0.1160.646***0.0543−0.00797−0.172
(0.121)(0.294)(0.0788)(0.110)(0.0838)(0.138)(0.108)(0.0759)(0.0784)(0.126)
Observations6326061,0061,0261,4224799791,7031,673904
Pseudo R20.002360.01060.008950.008640.003880.01260.006170.003730.01170.0125

Tables showing the results of negative binominal regression for hypothesis.

Standard errors in parentheses.

***p < 0.01.

**p < 0.05.

*p < 0.1.

Figures 9A–D are line graphs showing the change in the NB Regression coefficient for the 25 selected fields. They are grouped by four patterns I–IV depending on the technical fields of “Same Field” through “Distant Fields”: Pattern I shown in Figure 9A is the upward inclined pattern. Pattern II shown in Figure 9B is the mountain-shaped pattern. Pattern III shown in Figure 9C is the valley-shape pattern, and Pattern—shown in Figure 9D is the downward inclined pattern. Among these patterns, Pattern II corresponds to the technical fields that satisfy Hypothesis. From the comparison results for Patterns I through IV, it can be seen that there are many mountain-shaped patterns (Pattern II) in the analysis of the NB Regression coefficient. For example, the natural resins (C09F) listed as No. 42, as shown in Figure 9F, and the wind motors (F03D) listed as No. 3 indicate that the NB Regression coefficient increases at the significance level of 1% or less as the technical field shifts from Same Field to Neighboring Field but decreases as the technical field shifts from Neighboring Field to Distant field. Twenty ATCs, including these two, are shown in Figure 9B as technology fields that demonstrate the mountain-shaped pattern (Pattern II) that satisfies the hypothesis.

Figure 9

It can be seen that there is one example of an upward-inclined pattern (Pattern I) in Figure 9A. This is the cultivation of vegetable, rice, seaweed, forestry, and watering (A01G) listed as No. 43, which shows that the NB Regression coefficient increases at the significance level of 5% or less as the technical field shifts from Same Field to Distant Field, as shown in Figure 9E. This A01G was the only classification showing the upward-inclined pattern among the top 50 technical fields. As an example of the downward-inclined pattern (Pattern IV), Figure 9H shows the alloys (C22C) listed as No. 21 where the NB Regression coefficient decreases at the significance level of 10% or less as the technical field shifts from Neighboring Field to Distant Field. As an example of the valley-shaped pattern (Pattern III), Figure 9G shows the controlling combustion engines (F02D) listed as No. 28 where the NB Regression coefficient decreases at the significance level of 5% or less as the technical field shifts from Same Field to Neighboring Field.

To confirm the content of patent applications shown in the top 50 NetZero-related technical fields, the author reviewed several patent applications from them. A summary of 11 patent applications, taken as examples from six representative technical fields, is provided in Appendix V.

There are two reasons for analyzing the results by patent classifications that represent the technical fields. First, when conducting empirical studies using patent information, it is generally agreed upon to take into account that different technical fields have different views on the value of patents. It is generally agreed upon to take into account that different technical fields have different views on the value of patents (Mihara, 2012, p. 740). This is because there are technological fields such as chemistry and biotechnology where a small number of patents determine value and fields such as software and semiconductors where thousands of patents are intricately intertwined with a single product (Mihara, 2012, p. 740). Second, it is for comparing the method proposed in this article with those of the three previous studies dealt with in the first section. Nemet and Johnson (2012) mainly dealt with technical fields related to electricity and pharmaceuticals. Keijl et al. (2016) relied on the biopharmaceutical industry, and Nemet (2012) concerned energy technology, all of which target different technical fields.

Analysis by IPC sections

Furthermore, Hypothesis was tested using the NB Regression analysis on the datasets divided into eight sections of IPC to analyze the impact of the subject application on subsequent applications. The results of the NB Regression analysis are shown in Table 9. Table 10 shows the summary of the variables used therein. As understood from the comparison of the results for each section of the IPC, the relationship between the result of the NB Regression and the three technical fields shows unique tendencies for each section. It can be seen from the figure that IPC Sections B (performing operations and transporting), C (chemistry and metallurgy), D (textiles and paper), G (physics), and H (electricity) demonstrate the mountain-shaped pattern (Pattern II) that satisfies the hypothesis. They show that the NB Regression coefficient increases as the technical field shifts from Same Field to Neighboring Field at the significance level of 1% or less but decreases as the technical field shifts from Neighboring Field to Distant Field.

Table 9

(1)(2)(3)(4)(5)(6)(7)(8)
VariablesBest50_ABest50_BBest50_CBest50_DBest50_EBest50_FBest50_GBest50_H
Neighbor0.217**0.264***0.0668***0.361***0.112*0.04580.173***0.108***
(0.0977)(0.0281)(0.0219)(0.0767)(0.0594)(0.0367)(0.0258)(0.0286)
Distant0.292***0.0684**−0.03490.282***0.0007790.002890.0298−0.0373
(0.103)(0.0314)(0.0249)(0.0756)(0.0595)(0.0371)(0.0283)(0.0312)
Claims0.0129*−0.00548**0.001510.00975**−0.01010.00681**0.00394**0.0120***
(0.00690)(0.00231)(0.00131)(0.00449)(0.00694)(0.00334)(0.00172)(0.00219)
Pages0.00524***0.0209***0.00131***0.0184***0.0205***0.0114***0.00788***0.00611***
(0.00165)(0.00159)(0.000253)(0.00394)(0.00416)(0.00198)(0.000714)(0.000966)
Constant0.132*0.253***0.695***−0.007970.112**0.445***0.522***0.830***
(0.0788)(0.0301)(0.0191)(0.0784)(0.0564)(0.0359)(0.0254)(0.0244)
Observations1,00611,49519,1061,6733,1256,64413,39310,860
Pseudo R20.008950.007860.0008540.01170.003110.002350.004380.00296

Result of negative binomial regression for IPC sections.

Standard errors in parentheses.

***p < 0.01.

**p < 0.05.

*p < 0.1.

Table 10

IPC sections(1)(2)(3)(4)(5)
VariablesNMeanSDMinMax
APages1,00620.2129.193498
Claims1,0066.7376.007161
F_Citations1,0061.6112.404027
Neighbor1,0060.2740.44601
Distant1,0060.2290.42001
BPages11,49515.949.7953220
Claims11,4956.8995.9461112
F_Citations11,4952.0062.843053
Neighbor11,4950.3970.48901
Distant11,4950.2660.44201
CPages19,10628.7241.9931,012
Claims19,1069.1548.1261115
F_Citations19,1062.1513.115055
Neighbor19,1060.3870.48701
Distant19,1060.2450.43001
DPages1,67316.539.4464105
Claims1,6737.1557.065179
F_Citations1,6731.7912.533026
Neighbor1,6730.2730.44501
Distant1,6730.2960.45701
EPages3,12512.716.9633132
Claims3,1255.0604.014145
F_Citations3,1251.4201.958027
Neighbor3,1250.1960.39701
Distant3,1250.2040.40301
FPages6,64415.538.9353177
Claims6,6445.9834.877170
F_Citations6,6441.9722.503026
Neighbor6,6440.2230.41601
Distant6,6440.2220.41601
GPages13,39322.5220.543932
Claims13,3938.7597.1521128
F_Citations13,3932.2623.8540254
Neighbor13,3930.3500.47701
Distant13,3930.2580.43801
HPages10,86018.3416.673600
Claims10,8607.6386.111187
F_Citations10,8602.8903.943056
Neighbor10,8600.2330.42301
Distant10,8600.1900.39201

Summarizing variables.

Figures 10A–H are graphs showing the results in Table 9. From these figures, it is understood that all IPC sections, except Section A, show the mountain-shaped pattern (Pattern II) that satisfies the hypothesis. Also, as mentioned earlier, the five IPC Sections B–D, G, and H have relatively small standard errors.

Figure 10

Discussion

In this section, after reviewing the results of testing the hypothesis, the author examines the differences between the previous studies mentioned in Section Previous literature and the proposed technique.

Results of testing the hypothesis presented in this article

The hypothesis was that there was a mountain-shaped relationship (an inverted-U-shaped relationship) between the patent proximity between claimed invention and cited invention and the impact of the subject application on subsequent applications. From the overall perspective of all patent applications filed in 2008 (refer Tables 4, 6; Figure 8), the hypothesis was supported. From the perspective of view of each IPC section, IPC Sections B–D, G, and H demonstrate the mountain-shaped pattern (Pattern II) that satisfies the hypothesis. The NB regression coefficients tend to increase as the technology field shifts from Same Field to Neighboring Field at the significance level of 1% or less for the abovementioned five sections of the IPC but tend to decrease as the technical field shifts from Neighboring to Distant Field (refer Table 9; Figures 10A–H).

The hypothesis was also tested for the dataset divided by the top 50 NetZero-related technical fields to analyze the impact of the subject application on subsequent applications (refer Table 8). As a subject of verification, it was found that there were 20 technical fields demonstrating the mountain-shaped pattern (Pattern II) that satisfies the hypothesis.

The NB Regression coefficient increases at the significance level of 10% or less as the technical field shifts from Same Field to Neighboring Field but decreases as the technical field shifts from (refer Nos. 3, 6, 9, 11–14, 16, 22, 26, 29, 31, 33, 35, 36, 42, 44, 45, 49, and 50 of Table 8; Figure 9B).

Comparison with previous method

In considering the differences between the previous method mentioned in Section Previous literature and the technique proposed in this article, the author focuses on the study by Nemet (2012). This is because Nemet (2012, p. 1261) restricted the analysis to patents defined as energy technologies according to the list of energy classes suggested by Popp and Newell (2009). The energy technology patents analyzed by Nemet (2012) are included in the NetZero-related patent group analyzed in this article at a relatively high rate compared to the other two previous studies. From the comparison results of the search formulas shown in Appendix IV, it can be understood that many of the energy technology-related patents targeted by Nemet (2012) are included in the search conditions of this article.

It is understood that Nemet (2012) followed the same method as Nemet and Johnson (2012) and used only the primary classification and not the subsequent classifications. Nemet and Johnson (2012) coded each citation pair as “Far external,” “External,” and “Near” by comparing their classification at each level in the hierarchy. Since they allow the use of the IPC in place of the U.S. System (Nemet and Johnson, 2012, p. 14), IPC sections are used here as a superclass denoting “far external” and subclasses of the IPC as the converse denoting “near”. Therefore, using the analysis results of the technique proposed in this article, the author examines the difference in the results when analyzing using the IPC section and subsections.

Comparison on “far external”

Tables 11A, B show the results of the subject patent application categorized into Distant Fields by the second filtering process (refer Figure 5). As shown in Table 2, the groups of citations cited in each application are shown in sets (Groups 1–6), in order of their citation, with up to six citations covered in this analysis. Results are shown for each of the six groups.

Table 11

(A)
CategoriesComparison of the subject patent application with each citationCitations cited by the subject patent applicationTotalPct. (%)
Between first classificationsSubsequent classifications of citationsGroup 1Group 2Group 3Group 4Group 5Group 6
Distant fieldNon-coincidence3,5544,3654,0283,4192,7231,97820,06778
CoincidenceNon-considered1,0851,2101,0938966975245,50522
Total4,6395,5755,1214,3153,4202,50225,572100
(B)
CategoriesBetween first classificationsSubsequent classifications of citationsCitations cited by the subject patent applicationTotalPct. (%)
Group 1Group 2Group 3Group 4Group 5Group 6
Distant fieldNon-coincidence3,0083,6583,3362,8792,2971,65016,82866
CoincidenceConsidered1,6311,9171,7851,4361,1238528,74434
Total4,6395,5755,1214,3153,4202,50225,572100

Result of subject application categorized into distant fields.

A patent application categorized into Distant Fields must have a citation whose ATC classification is different from that of the subject application and whose IPC subclasses do not match any of the IPC subclasses of the subject application. For patent applications satisfying this condition, the author searched for citations whose IPC section of the first classification showed a match between the subject application and the citations. As a result, it turned out that about 22% corresponded as shown in Table 11A. This is mainly due to the differences in the hierarchical structure and breadth of classification between the ATC with 33 classifications and the IPC sections with 8 classifications. Next, as a result of conducting a search that included the subsequent classifications of the citations, the matching rate of the IPC sections increased to 34% as shown in Table 11B. This is obviously due to considering subsequent classifications as well as the first classifications. This search result reveals that the superclass denoting the “far external” category of Nemet (2012) partially overlaps Neighboring Fields as well as Distant Fields defined in the proposed technique.

Comparison with “near”

Tables 12A, B show the result of the subject patent application categorized into Neighboring Fields by the second filtering process (refer Figure 5). As shown in Table 2, the citations for each application are shown divided into the six groups covered in this analysis.

Table 12

(A)
CategoriesComparison of the subject patent application with each citationCitations cited by the subject patent applicationTotalPct. (%)
Between first classificationsSubsequent classifications of citationsGroup 1Group 2Group 3Group 4Group 5Group 6
Neighboring fieldNon-coincidence12,49712,14310,8929,0166,8575,16256,56799
CoincidenceNon-considered1479884754904531
Total12,64412,24110,9769,0916,9065,16257,020100
(B)
CategoriesBetween first classificationsSubsequent classifications of citationsCitations cited by the subject patent applicationTotalPct. (%)
Group 1Group 2Group 3Group 4Group 5Group 6
Neighboring fieldNon-coincidence4,6834,6934,1183,4692,6871,94021,59038
CoincidenceConsidered7,9617,5486,8585,6224,2193,22235,43062
Total12,64412,24110,9769,0916,9065,16257,020100

Result of subject application categorized into neighboring fields.

A patent application categorized into Neighboring Fields must have a citation whose ATC classification is different from that of the subject application, but at least one of whose IPC subclasses matches any of the IPC subclasses of the subject application. For a group of six citations satisfying this condition, the author searched for citations whose IPC subsection of the first classification showed a match between the subject application and the citations. As a result, it turned out that only 1% corresponded, as shown in Table 12A. Next, as a result of conducting a search that included the subsequent classifications of the citations, the matching rate of the IPC subsection increased from 1 to 62%, as shown in Table 12B. This is obviously due to considering subsequent classifications as well as the first classifications. This search result reveals that the IPC subclasses denoting the “near” category of Nemet (2012) partially overlap Same Field as well as Neighboring Fields defined in the proposed technique.

Potential impact on analytical results

From the abovementioned two comparison results, it is considered that the superclass denoting “far external” and IPC subclasses denoting “near” shown in Nemet (2012) are shifted toward the Same Field side more than the Distant Fields and Neighboring Fields defined in this proposal. This becomes clear by considering the subsequent classifications in the comparison, although it is not clear from the comparison of the first classifications alone.

Unlike Nemet and Johnson (2012) and Nemet (2012) showed that for energy technologies, the integration of technologically distant prior art (i.e., distant knowledge) has a stronger positive effect on knowledge flows than the integration of technologically near prior art (i.e., local knowledge). However, considering the possibility that the analysis results will shift from the distant prior art to the near prior art, there is a possibility that the results of Nemet (2012) will be higher in the Neighboring Fields than in the Distant Fields, as in the analysis results of this proposal.

Examination of backward citations excluded from analysis

As mentioned in Section Data, this study does not analyze utility models, general technical documents, and foreign patent documents as backward citations. This subsection examines the proportion of these non-targeted publications in each technical field. Figure 11 is bar graphs showing the proportion of these three types of non-targeted publications to the total for each ATC field. From these graphs, it can be seen that in some technical fields, the proportion of non-targeted publications to the total is relatively high. For example, in ATC 13 (organic chemistry) and ATC 16 (biotechnology), the proportion of citing general technical publications is comparatively high, reaching 10 and 29%, respectively. In ATC 13 (organic chemistry) and ATC 16 (biotechnology), the proportion of citing general technical publications is comparatively high, reaching 10 and 29%, respectively. In addition, in ATC 1 (agriculture), ATC 11 (container and wrapping), and ATC 21 (construction), the proportion of citing utility model patent publications is high, reaching 16, 19, and 10%, respectively. Therefore, the author holds off on drawing conclusions about the results of analysis in these five technical fields.

Figure 11

Conclusion

Summary

This study showcases a technique to categorize NetZero-related patent applications into three technical fields according to the degree of proximity between claimed invention and cited inventions by comparing technological classifications between the patent applications and the cited applications thereof. In showing this method, the author explains the modified technique proposed by the author as well as the existing methods used in previous studies.

The proposed technique is different from those of previous studies in that it is characterized by comparing the technical fields of not only the primary classification but also the subsequent classifications. This is made possible by using two patent classifications without having a specific classification corresponding to the middle hierarchy in between, rather than using three patent classifications with different hierarchies. This technique reduces the possibility that two applications, even if they are the same in their subsequent classification, will be judged as applications in different technical fields because they are in different classes in the primary classification.

Using the proposed technique, the author examined the impact on the subsequent patent application of NetZero-related patent applications filed in Japan. As a result of the analysis, the author found that approximately 33% of subject applications, whose technical field differs from the backward citations when comparing the primary classification only, match one of the subsequent classifications when comparing them in consideration of the subsequent classifications as well. The author then found that the 33% of subject applications had a greater impact on the subsequent patent applications than the remaining applications. The evaluation results show that, looking at the top 50 NetZero-related technical fields as a whole, IPC Sections B–D, G, and H demonstrate the mountain-shaped pattern (Pattern II) that satisfies the hypothesis. They show that the NB Regression coefficient increases as the technical field shifts from the Same Field to the Neighboring Field at the significance level of 1% or less but decreases as the technical field shifts from the Neighboring to Distant Field.

Practical implications

Japanese patent applications filed in 2008 were selected for this study for two reasons explained in Section Data and method for analysis. However, it is noted that 2008 was only selected for the initial analysis. There was a need to ensure sufficient time to obtain the backward citations necessary to measure the proximities of the patents and the forward citations necessary to measure the impact on the subsequent applications. The author believes that the trend of a mountain-shaped pattern that was obtained by this study will have a certain degree of continuity even if the patent filing year changes. Nevertheless, the need for multi-year surveys to supplement this single-year survey will be required.

Using the technique proposed in this article, an R&D manager can obtain information to consider whether to search distant technologies, neighboring technologies, or combinations within the same fields in terms of their impact on forward citations in that field of interest. According to the Japan Patent Office Annual Report published in 2022, 72% of patent applications filed in 2015 have already been requested for examination. Considering that it is possible to apply for accelerated examination for the target of green innovation, it is thought that a considerable number of backward citations have been accumulated even in patent applications filed in 2015.

The author believes that the proposed technique will be useful as support for an R&D manager in identifying/discovering technical fields to tackle the target issue in line with the COP 26 initiatives.

Contribution of this article

In modern society, where technology is becoming more and more complicated, it is difficult to define technology with one technological classification. An increasing number of patent applications are matching subsequent classifications, even though the two patents have different initial classifications.

This study makes contributions to the previous literature by showing the proposed technique, which makes it possible to reduce the possibility that two inventions common to any of the subsequent classifications cannot be distinguished even if the primary classifications are different.

Limitations

In this study, only Japanese patent publications are targeted as backward citations, and utility model patent publications, general technical publications, and foreign patent publications are not included in the analysis. However, since there are technical fields in which these non-targeted publications account for a relatively high percentage of backward citations, the author withholds conclusions about the five technical fields mentioned in the previous section. Furthermore, in this article, the author defined three technical fields, namely, “same,” “neighboring,” and “distant” in Section Hypothesis. However, it cannot be said that the definitions of these terms have been sufficiently defined and unified in comparison with multiple previous studies. These points remain to be addressed in a future study.

Future research

This study was conducted for patent applications filed in 2008 only. There is still room to examine the hypothesis for patent applications filed in a year other than 2008. The author believes that the proposed technique can also be applied to the analysis of foreign patent applications. However, since there are important differences across jurisdictions in citation rules and practice as mentioned by Jaffe (2017, p. 1360), it is necessary to pay close attention to these differences when applying it.12

Statements

Data availability statement

The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.

Author contributions

The author confirms being the sole contributor of this work and has approved it for publication.

Acknowledgments

The early version of this paper is supported by Innovation Management and Policy Program at Hitotsubashi University. The author would like to thank Prof. Masaru Karube and his seminar students for useful discussions, and to Assistant Prof. Tohru Yoshioka Kobayashi for his continuous support and constructive advice. The author also appreciates their valuable and useful comments from anonymous reviewers, which greatly improved the clarity of the manuscript. The author would like to sincerely thank Mr. Jonathan Griffiths for the English.

Conflict of interest

The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/frma.2023.978249/full#supplementary-material

Footnotes

1.^The keywords used in the search are as follows. The actual search formula was prepared using Japanese keywords. “carbon neutral + global warming + de-carbonization + de-carbonizing + de-plasticization + COP26 + United Nations Framework Convention on Climate Change + net-zero emission + zero carbon emissions”.

2.^Although the annual number of patent application filed in Japan has remained at a high level of around 400,000 until 2008, the number of total patent applications in Japan decreased by 10.8% from 2008 to 2009 and has been gradually decreasing thereafter (see, e.g., JPO Annual Report, 2012, Part 1, p12, Fig. 1-1-1; JPO Status Report, 2022, Part 1, Chapter 1, p3, Figs. 1-1-4). Fujii and Managi (2016, p. 15) pointed out that the main reason for this dramatic decline is the economic recession triggered by the collapse of Lehman Brothers in 2008.

3.^Schumpeter identified five cases of new combinations: “(1) The introduction of a new good—or of a new quality of a good. (2) The introduction of a new method of production… (3) The opening of new market… (4) The conquest of a new source of supply of raw materials or half manufactured goods… (5) The carrying out of new organization of any industry….” (Shumpeter, 1934, p. 66).

4.^For example, Walsh (1995) pointed out that knowledge structures can restrict opportunities to acquire new knowledge and limit an individual's ability to comprehend information. In addition, focusing on core capabilities, Leonard-Barton (1992) pointed out the existence of core rigidities behind them. Tripsas and Gavetti (2000) found that the beliefs and dominant logic that Polaroid has accumulated in the past have created structural inertia, which has become a factor that hinders adaptation to technological change.

5.^For example, Fujifilm Corporation, a manufacturer of cameras and film, entered the cosmetics field in 2006 by paying attention to the similarity between film and skin, both of which are composed of mainly collagen. Idemitsu Kosan applied technology that had been cultivated in petrochemicals to a light-emitting material for organic EL, realizing high-efficiency, long-life pure blue light emission (commonly known as “Idemitsu Blue”), which was considered the most difficult to develop at the time. In another case, Seiko Epson has redefined the inkjet technology accumulated in the printer business as a micro liquid process technology and entered the organic EL display business using industrial production technology. In addition, Nitto Denko has been working on the development of “transdermal medication patches”, which are medical formulations, by making use of its advanced technological capabilities related to “adhesion technology” cultivated with adhesives. All of these cases have in common that they strive to develop new technology in different fields by making use of their own strengths.

6.^Although the context is different, a similar indication was made by Thompson and Fox-Kean (2005) regarding Jaffe et al. (1993), which found a corresponding “control” patent issued in the same primary U.S. patent class as the citing patent.

7.^Similar examples are found in the case of “Insecticides/Herbicides (IPC's Class A01N)” classified as ATC No. 13 in combination with organic chemistry (IPC Class C07) as well as the case of Explosives (IPC Class C06), which is separated from Chemistry (IPC Section C) and transferred to ATC No. 26 (Weapons and Explosives).

8.^It may be necessary to separately create an ATC table corresponding to the data of the IPC subclass in order to perform the filtering process because the ATC Nos information has been excluded from the IIP Patent Database since the data were extracted on August 31, 2017 (for unknown reasons).

9.^The JPO Examination Guidelines state that if there is a prior art document that the applicant finds useful, such as when making amendments, the examiner may include it in the prior art information (see Guidelines, Part I?, Chapter 2, Section 2.3). Although it does not serve as a basis for rejecting the claimed invention, it is considered to be useful information for determining the category of subject applications.

10.^In the United States, there is a duty to disclose to the U.S. Patent and Trademark Office any known prior art that is material to the patentability of any claim of a pending U.S. patent application. Failure to disclose relevant prior art during the prosecution of a patent application may lead to the patent being unenforceable (Code of Federal Regulations, Title 37: 37 C.F.R 1.56).

11.^It has been reported that the reason for this is that in the United States, “applicant citation” and “examiner citation” had not been distinguished as citation information contained in patent gazettes until 2001 (Yasukawa, 2017, p. 74).

12.^Especially in the United States, in addition to the difference mentioned in Note 11, it has been pointed out that the citation tendency of U.S. patent examiners had been different from that of Japanese examiners since the application publication system was introduced in November 2000 (Yasukawa, 2016, 2017). In addition, it has been reported that the U.S. Patent Classification (UPC) lacks some uniformity in the scale of technology as an issue (Hall and Trajtenberg, 2004, p. 4).

References

  • 1

    AharonsonB. S.SchillingM. A. (2016). Mapping the technological landscape: measuring technology distance, technological footprints, and technology evolution. Res. Policy45, 8196. 10.1016/j.respol.2015.08.001

  • 2

    AhujaG.Morris LampertC. (2001). Entrepreneurship in the large corporation: A longitudinal study of how established firms create breakthrough inventions. Strat. Manag. J. 22, 521543. 10.1002/smj.176

  • 3

    AlbertM. B.AveryD.NarinF.McAllisterP. (1991). Direct validation of citation counts as indicators of industrially important patents. Res. Policy20, 251259. 10.1016/0048-7333(91)90055-U

  • 4

    BennerM.WaldfogelJ. (2008). Close to you? Bias and precision in patent-based measures of technological proximity. Res. Policy37, 15561567. 10.1016/j.respol.2008.05.011

  • 5

    CarpenterM. P.NarinF.WoolfP. (1981). Citation rates to technologically important patents. World Patent Inform.3, 160163. 10.1016/0172-2190(81)90098-3

  • 6

    CohenW. M.LevinthalD. A. (1990). Absorptive capacity: a new perspective on learning and innovation. Admin. Sci. Quart.35, 128152. 10.2307/2393553

  • 7

    CotropiaC. A.LemleyM. A.SampatB. (2013). Do applicant patent citations matter?Res. Policy42, 844854. 10.1016/j.respol.2013.01.003

  • 8

    FlemingL. (2001). Recombinant uncertainty in technological search. Manag. Sci.47, 117132. 10.1287/mnsc.47.1.117.10671

  • 9

    FuK.ChanJ.CaganJ.KotovskyK.SchunnC.WoodK. (2013). The meaning of “near” and “far”: the impact of structuring design databases and the effect of distance of analogy on design output. J. Mech. Design135, 158. 10.1115/1.4023158

  • 10

    FujiiH.ManagiS. (2016). Research and development strategy for environmental technology in Japan: a comparative study of the private and public sectors. Technol. Forec. Soc. Change112, 293302. 10.1016/j.techfore.2016.02.012

  • 11

    FujiiH.ShirakawaS. (2015). Decomposition analysis of green chemical technology inventions from 1971 to 2010 in Japan. In: Munich, Personal RePEc Archive (“MPRA”) Paper No. 62790. Munich Personal RePEc Archive (MPRA). Available online at: https://mpra.ub.uni-muenchen.de/62790/

  • 12

    GarudR.RappaM. A. (1994). A socio-cognitive model of technology evolution: the case of cochlear implants. Org. Sci.5, 344362. 10.1287/orsc.5.3.344

  • 13

    GotoA.MotohashiK. (2005). Tokkyo dētabēsu no kaihatsu to inobēshon [Development of patent database and innovation]. Instit. Intellect. Property Forum.63, 4349 (in Japanese). Available online at: https://www.iip.or.jp/patentdb/pdf/bunken.pdf

  • 14

    GotoA.MotohashiK. (2007). Construction of Japanese Patent Database for Research on Japanese patenting activities. Res. Policy. 36, 14311442. 10.1016/j.respol.2007.06.005

  • 15

    HallB.TrajtenbergM. (2004). Uncovering GPTS with patent data. In: NBER Working Paper No. 10901. National Bureau of Economic Research, Inc. Available online at: www.nber.org/papers/w10901

  • 16

    HallB. H.JaffeA. B.TrajtenbergM. (2001). The NBER Patent Citation Data File: Lessons, Insights and Methodological Tools. National Bureau of Economic Research Working Paper N. 8498. Available online at: http://www.nber.org/papers/w8498

  • 17

    HargadonA.SuttonR. I. (1997). Technology brokering and innovation in a product development firm. Admin. Sci. Quart.42, 716749. 10.2307/2393655

  • 18

    HarhoffD.NarinF.SchererF. M.VopelK. (1999). Citation frequency and the value of patented inventions. Rev. Econ. Stat.81, 511515. 10.1162/003465399558265

  • 19

    HarriganK. R.Di GuardoM. C.MarkuE.VelezB. N. (2017). Using a distance measure to operationalise patent originality. Technol. Anal. Strat. Manag.29, 9881001. 10.1080/09537325.2016.1260106

  • 20

    HaščičiI.MigottoM. (2015). Measuring Environmental Innovation Using Patent Data, Organization for Economic Co-operation and Development (OECD) Environment Working Papers No. 89, pp. 1–59. Organization for Economic Co-operation and Development (OECD) Publishing, Paris. Available online at: https://www.oecd-ilibrary.org/environment/measuring-environmental-innovation-using-patent-data_5js009kf48xw-en; https://www.oecd.org/env/working-papers-on-environmental-innovation.htm

  • 21

    HegdeD.SampatB. (2009). Examiner citations, applicant citations, and the private value of patents. Econ. Lett.105, 287289. 10.1016/j.econlet.2009.08.019

  • 22

    HiroseM. (2021). Predicting the potential applicability to other technical fields through the linkage between backward and forward citations. Front. Res. Metrics Analyt. 6, 736687. 10.3389/frma.2021.736687

  • 23

    JaffeA. (2017). Patent citation data in social science research: overview and best practices. J. Assoc. Inform. Sci. Technol.68, 13601374. 10.1002/asi.23731

  • 24

    JaffeA. B.TrajtenbergM.HendersonR. (1993). Geographic localization of knowledge spillovers as evidenced by patent citations. Quart. J. Econ.108, 577598. 10.2307/2118401

  • 25

    JPO Annual Report (2012). Japan Patent Office (“JPO”) Annual Report. Available online at: https://www.jpo.go.jp/e/resources/report/nenji/2012/ (accessed July 1, 2022).

  • 26

    JPO Status Report (2022). Available online at: https://www.jpo.go.jp/e/resources/report/statusreport/2022/ (accessed July 1, 2022).

  • 27

    KatilaR.AhujaG. (2002). Something old, something new: a longitudinal study of search behavior and new product introduction. Acad. Manag. J.45, 11831194. 10.2307/3069433

  • 28

    KeijlS.GilsingV. A.KnobenJ.DuystersG. (2016). The two faces of inventions: the relationship between recombination and impact in pharmaceutical biotechnology. Res. Policy45, 10611074. 10.1016/j.respol.2016.02.008

  • 29

    KimS. K.ArthursJ. D.SahaymA.CullenJ. B. (2013). Search behavior of the diversified firm: the impact of fit on innovation. Strat. Manag. J.34, 9991009. 10.1002/smj.2038

  • 30

    KuhnJ. M. (2017). Measurement and causal inference in patent strategy (doctoral dissertation). University of California, Berkeley. Available online at: https://escholarship.org/uc/item/0791g94s

  • 31

    Leonard-BartonD. (1992). Core capabilities and core rigidities: a paradox in new product development. Strat. Manag. J.13, 111126. 10.1002/smj.4250131009

  • 32

    MarchJ. G. (1991). Exploration and exploitation in organizational learning. Org. Sci.2, 7187. 10.1287/orsc.2.1.71

  • 33

    MarchJ. G.SimonH.A. (1958). Organizations. New York, NY: Wiley.

  • 34

    McNameeR. C. (2013). Can't see the forest for the leaves: Similarity and distance measures for hierarchical taxonomies with a patent classification example. Res. Policy42, 855873. 10.1016/j.respol.2013.01.006

  • 35

    MeyerM. (2000). What is special about patent citations? Differences between scientific and patent citations. Scientometrics49. 93123. 10.1023/A:1005613325648

  • 36

    MiharaK. (2012). Baiotekunoroji bun'ya ni okeru tokkyo bunrui oyobi in'yo joho o shihyo to shita tokkyo no kachi hyoka ni kansuru ichikosatsu [Considerations on patent valuation based on patent classification and citation in biotechnological field]. J. Inform. Proces. Manag. 54. 738749 (in Japanese). 10.1241/johokanri.54.738

  • 37

    MillerD.J.FernM.J.CardinalL. B. (2007). The use of knowledge for technological innovation within diversified firms. Acad. Manag. J.50, 308326. 10.5465/amj.2007.24634437

  • 38

  • 39

    NelsonR. R.WinterS. G. (1982). The Schumpeterian tradeoff revisited. Am. Econ. Rev.72, 114132.

  • 40

    NemetG. F. (2012). Inter-technology knowledge spillovers for energy technologies. Energy Econ.34, 12591270. 10.1016/j.eneco.2012.06.002

  • 41

    NemetG. F.JohnsonE. (2012). Do important inventions benefit from knowledge originating in other technological domains?Res. Policy41, 190200. 10.1016/j.respol.2011.08.009

  • 42

    PheneA.Fladmoe-LindquistK.MarshL. (2006). Breakthrough innovations in the US biotechnology industry: the effects of technological space and geographic origin. Strat. Manag. J.27, 369388. 10.1002/smj.522

  • 43

    PoppD.NewellR. G. (2009). Where does energy RandD come from? Examining crowding out from environmentally-friendly RandD (No. w15423). Natl. Bureau Econ. Res. Appendix A, 5355. 10.3386/w15423

  • 44

    RosenkopfL.AlmeidaP. (2003). Overcoming local search through alliances and mobility. Manag. Sci.49, 751766. 10.1287/mnsc.49.6.751.16026

  • 45

    RosenkopfL.NerkarA. (2001). Beyond local search: boundary-spanning, exploration, and impact in the optical disk industry. Strat. Manag. J.22, 287306. 10.1002/smj.160

  • 46

    ShaneS. (2000). Prior knowledge and the discovery of entrepreneurial opportunities. Organiz. Sci.11, 448469. 10.1287/orsc.11.4.448.14602

  • 47

    ShaneS. (2001). Technological opportunities and new firm creation. Manag. Sci.47, 205220. 10.1287/mnsc.47.2.205.9837

  • 48

    ShumpeterJ.A. (1934). The Theory of Economic Development: An Inquiry into Profits, Capital, Credit, Interest, and the Business Cycle.Cambridge, MA: Harvard University Press.

  • 49

    StuartT. E.PodolnyJ. M. (1996). Local search and the evolution of technological capabilities. Strat. Manag. J. 17, 2138. 10.1002/smj.4250171004

  • 50

    ThompsonP.Fox-KeanM. (2005). Patent citations and the geography of knowledge spillovers: a reassessment. Am. Econ. Rev.95, 450460. 10.1257/0002828053828509

  • 51

    TrajtenbergMHendersonRJaffeA. (1997). University versus corporate patents: a window on the basicness of invention. J. Econ. Innov. New Technol.5, 1950. 10.1080/10438599700000006

  • 52

    TripsasM.GavettiG. (2000). Capabilities, cognition, and inertia: evidence from digital imaging. Strat. Manag. J. 21, 1147116. Available online at: http://www.iot.ntnu.no/innovation/norsi-pims-courses/Levinthal/Tripsas%20&%20Gavetti%20(2000).pdf

  • 53

    TsunodaA. (2016). Tokkyo bunrui ni tsuite, bunrui o mitsume naosu [Patent classification, classification, with a fresh eye]. Part 2 J. Inform. Sci. Technol.66, 266271. 10.18919/jkg.66.6_266

  • 54

    TushmanM. L. (1977). Special boundary roles in the innovation process. Admin. Sci. Quart.22, 587605. 10.2307/2392402

  • 55

    WadaT. (2010). Hatsumei-sha ni yoru senko tokkyo ninshiki to tokkyo koho in'yo [Recognition of prior patents by inventors and patent backward citations.]. In: DP-RIETI Discussion Paper Series 10-J-001, 1–27 (in Japanese). Research Institute of Economy, Trade and Industry (RIETI). Available online at: https://policycommons.net/artifacts/1558301/dp/2248107

  • 56

    WalshJ. P. (1995). Managerial and organizational cognition: notes from a trip down memory lane. Organiz. Sci.6, 280321. 10.1287/orsc.6.3.280

  • 57

    YamadaS. (2010). Shinsa-kan in'yo wa juyo ka - tokkyo kachi hanbetsu shihyo to shite no hi in'yo kaisu no yuyo-sei [Is Examiner Citation Important? - Usefulness of Number of Citations as an Indicator of Patent Value]. Econ. Res. 61, 203213 (in Japanes Fujiie). Available online at: https://hermesir.lib.hitu.ac.jp/hermes/ir/re/19610/?lang=1

  • 58

    YasukawaS. (2016). Tokkyo - Tokkyo-kan no in'yo joho ni kansuru kenkyu: Nichibei ni okeru shinsa-kan zenpo in'yo kensu no yuyo-sei ni tsuite [Study on Citation Information Between Patents: Usefulness of Examiner Forward Citations in Japan and the United States] (Doctoral dissertation). University of Tokyo (in Japanese), Tokyo, Japan. Available online at: https://repository.dl.itc.u-tokyo.ac.jp/record/49864/files/B18174.pdf

  • 59

    YasukawaS. (2017). Nichibei ni okeru shinsa-kan zenpo in'yo kensu no yuyo-sei ni kansuru kenkyu [Study on usefulness of statistics of the number of forward citations by examiners in Japan and the U.S.]. In: Patent, 70, No. 4, 73–87 (in Japanese). Japan Patent Attorneys Association (JPAA). Available online at: https://jpaa-patent.info/patent/viewPdf/2777

Summary

Keywords

patent analytics, NetZero, distant knowledge, backward citations, technical field

Citation

Hirose M (2023) Should an R&D manager refer to distant technical fields? The effectiveness of new combinations with knowledge from different technical fields through the quantitative analysis of patent data related to NetZero. Front. Res. Metr. Anal. 8:978249. doi: 10.3389/frma.2023.978249

Received

25 June 2022

Accepted

21 February 2023

Published

11 April 2023

Volume

8 - 2023

Edited by

Santiago Ruiz Navas, Ritsumeikan Asia Pacific University, Japan

Reviewed by

Igam Moaniba, International Monetary Fund, United States; Antonio K. W. Lau, Kyung Hee University, Republic of Korea

Updates

Copyright

*Correspondence: Masayuki Hirose

This article was submitted to Patent Analytics, a section of the journal Frontiers in Research Metrics and Analytics

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics