Obstacle Detection Display for Visually Impaired: Coding of Direction, Distance, and Height on a Vibrotactile Waist Band

van Erp, Jan B. F.; Kroon, Liselotte C. M.; Mioch, Tina; Paul, Katja I.

doi:10.3389/fict.2017.00023

ORIGINAL RESEARCH article

Front. ICT, 13 October 2017

Sec. Human-Media Interaction

Volume 4 - 2017 | https://doi.org/10.3389/fict.2017.00023

Obstacle Detection Display for Visually Impaired: Coding of Direction, Distance, and Height on a Vibrotactile Waist Band

Jan B. F. van Erp^1,2*

Liselotte C. M. Kroon¹

Tina Mioch¹

Katja I. Paul³

¹Perceptual and Cognitive Systems, TNO, Soesterberg, Netherlands
²Computer Science, University of Twente, Enschede, Netherlands
³University of Groningen, Groningen, Netherlands

Electronic travel aids (ETAs) can potentially increase the safety and comfort of blind users by detecting and displaying obstacles outside the range of the white cane. In a series of experiments, we aim to balance the amount of information displayed and the comprehensibility of the information taking into account the risk of information overload. In Experiment 1, we investigate perception of compound signals displayed on a tactile vest while walking. The results confirm that the threat of information overload is clear and present. Tactile coding parameters that are sufficiently discriminable in isolation may not be so in compound signals and while walking and using the white cane. Horizontal tactor location is a strong coding parameter, and temporal pattern is the preferred secondary coding parameter. Vertical location is also possible as coding parameter but it requires additional tactors and makes the display hardware more complex and expensive and less user friendly. In Experiment 2, we investigate how we can off-load the tactile modality by mitigating part of the information to an auditory display. Off-loading the tactile modality through auditory presentation is possible, but this off-loading is limited and may result in a new threat of auditory overload. In addition, taxing the auditory channel may in turn interfere with other auditory cues from the environment. In Experiment 3, we off-load the tactile sense by reducing the amount of displayed information using several filter rules. The resulting design was evaluated in Experiment 4 with visually impaired users. Although they acknowledge the potential of the display, the added of the ETA as a whole also depends on its sensor and object recognition capabilities. We recommend to use not more than two coding parameters in a tactile compound message and apply filter rules to reduce the amount of obstacles to be displayed in an obstacle avoidance ETA.

Introduction

According to the World Health Organization (2014), 285 million people are estimated to be visually impaired worldwide: 39 million are blind and 246 million have low vision. Electronic travel aids (ETAs) have the potential to increase the mobility and with that the quality of life of the visually impaired. ETAs have two main components: (1) sensors to determine location and orientation and to detect and possibly identify objects in the environment and (2) displays to present information about for instance waypoints and obstacles. Since visual displays are of little or no use to this user population, work has been focused on displaying information through the sense of hearing and/or touch. In this paper, we focus on the design of a multisensory display for an ETA and not on its sensors. Sensor technology that is able to detect and identify obstacles beyond the range of the white cane is developing rapidly although it certainly is not perfect yet (Zeng et al., 2017a,b). Through a series of experiments, we aim to formulate recommendations that balance the information availability (provided by the ETA) and the information processing capabilities of the user. This balance is important because the temptation to present all information generated by the ETA may result in an increased threat of information overload of the user and reduce the potential of the ETA system.

Levels of Navigation

Electronic travel aids support navigation tasks. Wickens (Wickens, 1992; Prevett and Wickens, 1994) distinguished two levels of navigation: global awareness and local guidance. Global awareness is mainly concerned with routes to a target location and understanding the spatial layout of points of interest and landmarks. A GPS navigation system that takes you from location A to B supports this global awareness level of navigation. Local guidance is concerned with understanding the spatial structure of the immediate environment, maneuvering along a route, and interacting with objects alongside it. A lane departure warning system typically supports this local guidance level of navigation. Both navigation subtasks require a different display design (Roscoe, 1968; Wickens, 1992; Wickens and Prevett, 1995; Van Erp, 2007). Global awareness requires a world-referenced or north-up display with a 360° field of regard while local guidance requires an ego-referenced or heading-up display, a forward cone of regard and a high information update rate. Failing to pick up local guidance information correctly and timely may have serious consequences for tasks, such as keeping course, circumventing obstacles, and avoiding collisions with stationary and moving objects.

Tactile Navigation Displays

Potential bottlenecks to perceive and process local guidance information are visual and cognitive overload (Van Erp and Werkhoven, 2006; Elliott et al., 2010). This led researchers and developers to explore display approaches, for instance based on the sense of touch (i.e., haptic and tactile displays) to reduce the risk of visual or auditory overload. Our sense of touch continuously processes local guidance information, and starting in the 1990s, vibrotactile displays (e.g., vests or belts containing actuators that produce a localized vibration) were developed for a range of local guidance tasks. At that time, the potential for visually impaired users was also acknowledged. For instance by Ertan et al. (1998) who developed a system consisting of a 4 × 4 tactor array on the back displaying the four cardinal directions and a stop signal for navigation.

Successful tests and implementations are reported for soldiers (Elliott et al., 2010), pilots (Eriksson et al., 2006), and drivers (Van Erp and Van Veen, 2004). Apart from reducing the risk of visual and auditory information overload, these vibration belts and vests have the inherent advantage of being ego-referenced since the trunk may be considered as “the physical anchor of the internal representation of body orientation in space” (Karnath et al., 1991, 1993) and is critical in bodily self-consciousness (Blanke et al., 2015; Serino et al., 2015). Belts and vests can also easily implement a forward cone of regard thereby meeting important requirements for local guidance displays (Van Erp, 2005a).

Tactile ETAs for Visually Impaired Users

Tactile displays have also been employed for ETAs for the visually impaired, but not always in the form of a belt. Many follow the approach of sensory substitution, in this case converting visual information (camera images) to a “tactile picture.” First explorations by Bach-Y-Rita date back to the late 1960s (Bach-y-Rita et al., 1969). Experiments show that users are indeed able to learn to recognize simple, isolated objects but that the principle is of limited use for (fast) local guidance tasks and cluttered environments (Johnson and Higgins, 2006). Next to sensory substitution, local information can be displayed in an abstract or symbolic manner. Tang and Beebe (2006) used an electrotactile display implemented as a mouthpiece to present directional cues of a navigation system to the roof of the mouth in the form of arrow patterns. Pielot et al. (2012) developed and tested a pocket-based, single-actuator tactile navigation application that mapped navigation information to vibration patterns. Information on use was gathered from anonymous and presumably sighted users. The pocket-based approach was adopted in one-third of the trips and the authors mention its positive effects on reducing distraction (i.e., the threat of visual and/or cognitive overload). Zelek (2005) describes a glove with vibrating elements to present objects and the ground plane distilled from stereo camera images. Vibration intensity was used to provide distance cues. Shah et al. (2006), Bouzit et al. (2004), Kim et al. (2015), and Amemiya and Sugiyama (2009) also used the hands as display location. These hand-mounted displays can successfully support blind users to safely walk along a predefined route at their usual walking pace but also have several inherent disadvantages: the hands are not ego-referenced and information presentation may also interfere with other tasks, including simply holding objects and using the white cane (Pawluk et al., 2015). Also, displays for the hands are difficult to hide while users prefer a discreet system (Conradie et al., 2014). The same argument holds for displays mounted on the head or neck like the haptic radar system that provides localized haptic cues based on multiple sensors that can detect obstacles developed by Cassinelli et al. (2007).

Velazquez and Bazan (2010) developed an on-shoe tactile display and suggest that it may be useful for visually impaired users to present navigation information (see also Meier et al. (2015) for a recent discussion on foot-based displays). Palleja et al. (2010) used a tactile belt coupled to a “simulated white cane”: a LIDAR and accelerometers mounted on the forearm. A similar approach was taken by Kim et al. (2014) but based on stereo cameras and users pointing their finger. Bousbia-Salah and colleagues (Bousbia-Salah and Fezari, 2007; Bousbia-Salah et al., 2011) developed a system that combines different sensors and tactile and synthetic speech output. The tactile display consisted of only two actuators to warn for over-hanging objects. Flores et al. (2015) made an explicit comparison between a tactile display in the form of a belt and an auditory display evaluated by 10 blind participants and conclude that the belt results in closer path following but lower walking speeds. The directional information provided by the belt was rated very positive. In addition to providing directional information, Cosgun et al. (2014) added rotational information. They reasoned that directional information sec may not be sufficient in cluttered environments requiring fine maneuvers. So far, they tested different ways of displaying rotational information, but their approach has not been evaluated in a real maneuver task. Related to the presentation of navigation information is the use of a haptic belt to convey non-verbal communication cues. McDaniel et al. (2008) developed such a system and showed that the direction and distance of a conversation partner can be presented on a tactile belt using vibration location and duration. More at the level of global awareness, Karcher et al. (2012) tested a belt that continuously displayed the direction of north and tested this with a single blind participant after extended training. They concluded that the belt improved performance in several orientation tasks and increased the mobility and feelings of security of the participant. Heuten et al. (2008) tested a tactile belt for wayfinding presenting waypoint directions and deviations from the path.

In addition to the systems using tactile display technology, many ETAs implemented an auditory representation either through speech or through non-speech sonification (see Dakopoulos and Bourbakis, 2010, for a survey). Examples include the NavBelt based on binaural feedback to indicate the direction of travel (Borenstein, 1990; Shoval et al., 1998), seeing with sound based on frequency and amplitude variations (Meijer, 1992), the NAVI system based on stereo acoustic patterns (Sainarayanan et al., 2007), and a system based on 3D specialized sounds to represent obstacle distance and direction (Aguerrevere et al., 2004).

Commercial ETAs for Visually Impaired Users

The above-mentioned studies show the diversity in display approach for ETAs for visually impaired users. However, the research systems have not reached a level of technical maturity beyond their use in a laboratory environment. There are several commercially available ETA systems that implement auditory and/or tactile display technology, be it with restricted functionality. Stand-alone ETAs (i.e., that do not depend on for instance tags, markings, or beacons in the environment) include the UltraCane, the iGlasses, the K-Sonar Cane, and the Miniguide. The UltraCane has two ultrasonic sensors and two vibrating buttons located on the handle of the white cane. The iGlasses Ultrasonic Mobility Aid focuses on detecting obstacles above waist height but cannot detect drop-offs, e.g., stairs and holes. The K-Sonar Cane uses ultrasound sensors and can be attached on a common white cane and displays obstacles through sounds. Finally, the Miniguide is a hand-held ultrasonic obstacle detector that vibrates when an obstacle is detected withinin its range. Generally, ultrasonic sensors are low cost but may fail to detect small obstacles and holes, steps, and curbs. This may potentially be solved by infrared sensors, but they have a restricted field of view.

Challenges for ETAs for Visually Impaired Users

Generally, ETAs for visually impaired users face four challenges: to display more than one obstacle at a time, to increase the range substantially beyond that of the white cane (i.e., to 5 m or more), to provide a wide field of regard, and to detect and present ground-based as well as hanging objects. Focusing on the display module of the ETA, most relevant research questions are:

1. how can multiple objects be displayed with minimal risk of perceptual and cognitive overload?

2. how can the display present multiple information features of an object such as direction, distance, height, and object identity (ID) with minimal risk of perceptual and cognitive overload?

3. how can auditory and tactile presentation be combined?

We will approach these research questions from a user perspective and taking into account important user requirements for ETAs. These user requirements are presented in-depth elsewhere (Miao et al., 2011; Conradie et al., 2014; Lewis et al., 2015). Based on these documents, the critical user requirements for an ETA’s display module are the following:

• users are reluctant to rely heavily on audio for information presentation,

• users prefer to use audio for vital information only,

• users are worried about the risk of tactile overload,

• users do not like a continuous tactile signal and indicate that a filter is needed,

• users expect an ETA to be able to provide the direction and distance of an obstacle and preferably its height (especially hanging objects since these are difficult to detect using the white cane). Displaying object ID is not an explicitly user requirement, but may become relevant now sensor systems become available that go beyond obstacle detection and enable object recognition.

Although the overview above indicates that several body parts, including hands, feet and torso, may be feasible as location for a tactile ETA display, we adapt a tactile belt approach. The main reasons are the proven effectiveness in navigation applications, and the trunk’s unique role in spatial awareness as “the physical anchor of the internal representation of body orientation in space” (Karnath et al., 1991). Belt-like displays have also been employed before but often as a proof of concept only without going through a systematic display design process, and without addressing the threat of information overload. We describe the iterative development and tests of an ETA display involving naive participants, display design experts, and visually impaired end users. Point of departure is that the ETA is used by a visually impaired user in combination with the white cane. We start with designing the compound tactile signals to present all four parameters of interest (object direction, distance, height, and ID). The design is tested in Experiment 1. The results of Experiment 1 show that the compound tactile signals used in Experiment 1 carry in them a high risk of information overload. We test two approaches to off-load the tactile channel: by presenting part of the compound signal through the auditory channel (Experiment 2) and to reduce the amount of information presented by designing a minimalistic tactile signal (Experiment 3). The result of the minimalistic design of Experiment 3 is evaluated in the final experiment (Experiment 4) with end users.

Design of the Compound Tactile Coding

Information Chunks to Be Displayed and Display Parameters

Based on user requirements, best practices, and local navigation requirements, we set the boundary conditions for the design, operationalized as follows:

• refresh rate of the information display: 1 Hz or better

• direction: three levels (left, middle, right to cover the forward cone of regard)

• distance: four levels

• secondary parameter object height: three levels (e.g., below ground, ground-based, hanging)

• secondary parameter object ID: four levels (users were primarily interested in stairs, doors, unknown object, and sensor noise).

Tactile Display Parameters Available in Vibrator Motor Based Displays Worn on the Torso and Their Usefulness

Over the past decade, guidelines have become available, among others in ETSI (2002) guidelines and ISO (2009) norms (Van Erp and Kern, 2008; Sinclair et al., 2012). Below is a concise summary.

• Vibration intensity and frequency. These are generally of limited use because both are affected by actuator differences, attachment method, etc. ISO (2009) recommends that the number of levels should be limited to three.

• Actuator location. Useful parameter; the spatial acuity of the torso is high enough to distinguish at least 12 different locations horizontally (i.e. columns like clock hours) and at least 5 locations vertically (i.e., rings around the torso).

• Temporal pattern or on–off rhythm (resembling Morse-code like patterns). Very useful parameter because of the high temporal acuity of the tactile sense. According to ETSI (2002) and ISO (2009) guidelines, temporal parameters have a clear relation to for instance perceived urgency (Van Erp et al., 2015).

• Higher order spatial/temporal/intensity patterns. Potentially useful are, for instance, spatio-temporal patterns to elicit apparent motion and intensity-temporal patterns to create sweeps of increasing or decreasing intensity.

Mapping Information to Display Parameters

Point of departure is to map the primary ETA information (direction and distance) to the display parameters that are recommended as best perceived (actuator location and temporal pattern) and map the secondary ETA information (height and object ID) to the remaining display parameters. To allow multiple levels in both horizontal and vertical location, we use a 3 × 3 tactor display worn on the frontal side of the torso and with a 5-cm center-to-center distance between the tactors (Figure 1).

FIGURE 1

Figure 1. Placement of the tactile belt. White dots indicate the location of the tactors. The top row (1, 2, 3) represent hanging obstacles, the middle row (4, 5, 6) grounded obstacles, and the bottom row (7, 8, 9) gaps. The left column (1, 4, 7) represent obstacle to the right, the middle column (2, 5, 8) obstacle in front, and the right column (3, 6, 9) obstacles to the left.

The direction of the object is indicated by the horizontal position of the activated tactor (Figure 2). The height of the object is mapped to the different rows of the display, indicating a gap (bottom ring), ground-based object (middle ring), or hanging object (top ring).

FIGURE 2

Figure 2. The cone of regard of a possible electronic travel aid and with different object direction and distance categories.

Object ID is coded in the signal pattern resembling Morse-code, i.e., combinations of three short (100 ms) and/or long (300 ms) pulses. This study distinguishes 4 object ID’s which are selected by our focus group consisting of visually impaired users: stairs, doors, unknown object, and sensor noise. Of course, object ID can be given any meaning a system or users prefers (see Figure 3).

FIGURE 3

Figure 3. Design of the pattern for object identity.

Object distance is coded in the rate of repetition. Each object ID pattern is always repeated three times. The interval in between the repetitions depends on the distance, comparable to the auditory beeps of a parking assist. Four distance levels were implemented with intervals of 400 ms (“far”), 300 ms (“near”), 200 ms (“close”), and 100 ms (“very close”). See Figure 2 for a possible representation.

Experiment 1: Perceiving Tactile Compound Signals While Walking

In this experiment, we test the compound tactile signals that present all four parameters of interest (object direction, distance, height, and ID).

Methods

Participants

A total of 18 non-visually impaired volunteers participated (7 males, mean age 43.7 years SD 13.0). The following inclusion and exclusion criteria applied: age between 20 and 60 years old; no physical restrains with regard to walking; no difficulties with keeping balance; no known auditory or tactile deficits. All volunteers were also invited to participate in Experiment 2 to be run on the same day. Before engaging in the experiment, participants read and signed an informed consent. The experiment was approved by the Institutional Review Board (TNO TCPE, Soesterberg). Participants received a monetary reward according to the IRB norms, and their travel costs were reimbursed.

Apparatus

The evaluation was conducted in a room with a 5 m × 5 m free floor space, lined with a guiding strip. This strip could be detected with a white cane such that participants could walk in circles. Participants’ vision was completely blocked by goggles that were easy to put on and to take off to ensure safety and comfort during the evaluation. Participants had a standard white cane (also known as a “Hoover” cane, as used by visually impaired people) to detect the guiding strip on the floor.

The tactile patterns were presented on a 3 × 3 tactile display (Elitac Science Suit^®, Amsterdam, The Netherlands). The display consists of a strip of flexible and stretchy material that can be adjusted to fit each body and applies equal pressure to all tactors that are mounted on the inside. The tactors are circular shaped pancake motors with a diameter of 10 mm, housed in a casing of 39 mm × 13 mm × 6 mm. Vibration frequency was 128 Hz. The tactors were wired to a control unit of 80 mm × 50 mm × 21 mm that was mounted on the outside of the strip on the back of the participant. The control unit was connected via Bluetooth to the control laptop operated by the test leader. The tactors were ordered in a 3 × 3 grid with 5 cm distance center-to-center. Participants wore the belt over their first layer of clothes resulting in a vibration intensity that was easy to feel. The center tactor of the 3 × 3 grid was located just above the belly button.

Tactile Patterns

The design of the tactile pattern is described in the previous section. All possible combinations of object direction (3 levels), object height (3 levels), and object distance (4 levels) were tested (i.e., 36 cases). Object ID (4 levels) was randomly varied over these 36 cases to restrict the total number of cases because pilot studies showed that the duration using a full factorial design (with 144 cases) would be too long for participants to stay alert and concentrated. The order of the 36 cases was randomized for each participant.

Procedure

Participants first read a written instruction about the goal of the evaluation, the working of the tactile belt and the test procedures. Next, the test leader showed the participant all materials used in the evaluation and their functioning, and answered any remaining questions of the participant. Before donning the tactile display, a brief equilibrium test was conducted: the participant was asked to position his or her feet next to each other with a shoulder wide gap in between and stand still with closed eyes for 1 min.

Training of the Tactile Signals Separately

Under supervision of the test leader the participant was asked to put on the tactile belt over his/her first layer of clothes. The center tactor (tactor 5) was always placed right above the participant’s belly button, to ensure a similar tactor position between participants.

Next, the training for identifying object direction, height, distance, and ID was conducted. The levels of a signal dimension (for instance tactor location) were presented in logic order, than twice randomly, while the other signal variables (in this example signal interval and signal pattern) remained constant. The participant was asked to give a response after each signal by telling which object information he/she could identify, after which the test leader gave feedback. During this training, the participant stood still, without the vision blocking goggles and the white cane. The test leader also ensured with this test whether the participant was able to feel all tactors and whether the system was comfortable to the participant. In case the participant yield low correct response rates after training one signal dimension or if a participant was still unsure about his/her performance, the test leader could repeat the training once more. If the participant could not distinguish object information correctly in at least 75% of all cases after the training or could not feel a particular tactor at all, the participant would be asked to stop with the test at this point in time. However, this did not occur during the experiment.

Testing and Experiencing the Vision Blocking Goggles and the White Cane

The test leader asked the participant to walk around by using the white cane to detect the guiding strip on the floor; first without vision blocking goggles, then with. Only when the participant was comfortable walking around blinded the test leader proceeded with the last training step.

Training Tactile Signals Combined

During the last training all signals were presented in random order to the participants, such that the participant could train in recognizing object direction, distance, and ID at the same time, though, for the purpose of the training, still with feedback and without walking around blinded. Participants were presented with two randomly ordered sets of 18 signals.

Experiment

Participants walked around blinded with a white cane, while they were given all 36 compound signals. After 18 signals, a 5-min brake was given. The signals were started by the test leader one by one. Only after the participant indicated what he/she felt, by calling out loud object direction, height, distance, and ID (in any chosen order), the next signal was started by the test leader. There was a minimal interval of 5 s between signals. There was no time limit for the participant to respond. Each answer was registered by the test leader on the computer. If a participant could not identify an object feature “no response” was entered for that specific feature. The test leader also noted general observations of the participant’s behavior and remarks and comments made by the participant.

Results

One participant could not feel tactor 7 (object direction: right, object height: gap) during the test; these four cases [four object distances randomly combined with four object IDs: object (2×), noise (1×), stairs (1×)] where removed from the data set. We will present the results along the features of the compound signals: object direction and height (coded by tactor location), object distance, and object ID.

Tactor Identification (Coding Object Direction and Height)

Table 1 shows the confusion matrix for the individual tactors. The left column indicates the stimulus presented to the participants (tactor 1–9, see also Figure 1), the other columns the participants’ response (tactor 1–9, or “no response” in case the participant did not give an answer). Please note that this is the confusion matrix for the nine tactors, i.e., the combination of the information features object direction and object height.

TABLE 1

Table 1. Confusion matrix for tactor location (object direction and object height).

It can be seen that the percentages correct responses were between 42.1 and 76.1%, ample above chance performance (11%). An inter-rater reliability analysis using the Kappa statistic was performed to determine consistency among raters. Cohen’s Kappa shows a moderate to substantial consensus with a value of 0.61 (p = 0.00). In addition, the table shows which confusions were common. Participants mainly confused the tactors with those that are above or below the presented stimulus (marked dark gray in the table) but hardly with tactors to the left or the right (marked light gray). This means that the confusion was primarily between rows and not columns, or in other words: it seems more difficult to participants to determine the height of a tactor than the horizontal direction when the tactors are located at 5 cm distances on the belly. Furthermore, tactors in the middle column (tactor 2, 5, and 8) yielded better response rates than tactors in the outer columns. This effect is reversed for the rows: tactors in the outer rows (tactor 1–2–3, 7–8–9) yielded better response rates than tactors in the middle row (tactor 4–5–6). This may be due to the fact that the middle row simply had more neighboring tactors than the top and bottom rows (end of scale effect).

Object Distance Identification

Table 2 shows the confusion matrix for object distance. The results indicate that determining distance was difficult for the participants. The percentages correct responses were between 12.8 and 47.0% with chance performance at 25%. Cohen’s Kappa was 0.13 (p = 0.00), indicating only a slight agreement between the given stimulus and participants’ response.

TABLE 2

Table 2. Confusion matrix for object distance (coded through duration between signal repetitions).

It can be seen that the confusion between distance categories became less when the mutual differences in signal frequency became greater, i.e., “very close” got more confused with “close” (difference of 100 ms brakes between signals) than with “near” or “far” (difference of, respectively, 200 and 300 ms brakes between signals). The table shows that particularly the distance level “far” yielded bad response rates: in less than 13% of the cases participants correctly identified the “far” level. The results indicate that four distance categories with the current coding is too much. Based on the data and the observations, the use of two categories seems a better choice.

Object ID Identification

Table 3 shows the confusion matrix for object ID. It shows that the percentages correct responses were between 51.4 and 72.2% (with change level at 25%). Cohen’s Kappa provides insight in the consistency among raters and turned out to be 0.51 (p = 0.00), indicating a moderate agreement. IDs “Object,” “Noise,” and “door” which were all either homogenous signals or symmetric ones were understood best by participants.

TABLE 3

Table 3. Confusion matrix for object identity (ID) (signal pattern).

Observations by Test Leader

During the experiment, the test leader took notes of the participant behavior and remarks and registered additional instructions that had to be given. Most participants stopped walking before they could indicate the object features they felt. It seemed that identifying all four object features took a lot of mental processing.

Discussion

The scores for the tactor identification (65.6%) are ample above chance (11.1%). Since the rows and columns code different features of the object (direction and height, respectively), it is useful to look at the errors made along these two dimensions separately. This closer inspection shows that the majority of the incorrect replies are “within column” errors, i.e., errors with the correct direction but the wrong height. On average, object direction was correct for 90.3% of the responses and object height for 70.5% of the responses (for both, change level is 33.3%). The fact that tactors in the middle column (tactor 2–5–8) yield better response rates than tactors in the outer columns (tactor 1–4–7, 3–6–9), may suggest that one is more sensitive to vibrations given at the middle of the belly. This is in line with findings on spatial resolution (which is higher on the midsagittal plane than a few centimeters away from it, Van Erp, 2005b) and on direction perception which is substantially better for vibrations on the body midline because the body midline is an important anchor point in direction perception and spatial cognition (Van Erp, 2005a). There is no such anchor point on the torso for the vertical localization.

The average score for distance is low (33.2%) and only just above chance (25%). There appears to be a large bias in the responses because a number of participants did not make the distinction between “near” and “far,” as a result of which “near” was excessively mentioned and “far” was underrepresented in the data set. The fact that participants identified “far” poorly may be related to an interaction effect between distance coding (signal interval or speed of repetition) and object ID coding (signal pattern). The signal for “object” (3 ms × 100 ms) was often experienced as (very) “close,” even when given with long brakes in between the signals, i.e., it was experienced as a fast signal, even with a low speed of repetition. With the longer brakes, the whole duration of the three signals was still short in comparison to signals with a longer burst duration and shorter intervals. With the current test setup it was not possible to test for interaction effects to prove the existence of this relationship. The data leads to the recommendation that distinguishing two distance categories is feasible, but more signals may lead to categorizing the signals into two groups when presented as a compound message in combination with coding object ID by signal pattern.

The scores for object ID are all above chance (mean 61.7% correct, chance 25%). Inspection of the data does not reveal specific biases or confusions: the errors made seem to be relatively evenly distributed along the response categories. Albeit the average score of only about 60% correct, the scores are much higher than those for distance indicating that for the current compound message, the coding parameter temporal pattern is preferred over speed of repetition.

Cohen’s Kappa for object direction (tactor position) and object ID (signal pattern) yield moderate to substantial agreements indicating generalizability of the results over participants. The percentages correct indicate that there may still be room for improvement. It seems that the complex task of identifying all four types of information (object direction, height, distance, and ID) based on one compound signal and while walking blinded and using a white cane severely taxes the perceptual and cognitive resources and may carry the potential threat of information overload.

Conclusion

The objective of Experiment 1 was to investigate whether a compound signal presenting the core ETA information was usable, and if not what the boundary conditions are. Based on the performance and the observations made by the experimenter, we may conclude that the current compound signals may lead to substantial perceptual and/or cognitive processing demands. Users may have to slow down or even stop walking while processing the information and performance for specific dimensions may be too low to be usable. The results also confirm that performance may differ along coding dimensions: horizontal tactor location leads to best performance, while vertical location scores are substantially lower. With respect to temporal coding, the data show that on–off pattern is a solid coding principle but speed of repetition is not and the combination of both may introduce confusion. Slowing down the pattern and rate of presentation may increase identification performance but is not an option in the time-critical task of walking and avoiding obstacles. With an average walking speed of 1–1.5 m/s, an information refresh rate below one per second increases the risk that users cover the distance to an obstacle before it is presented. For example, a refresh rate of one per 2 s means that the user covers 2–3 m in between obstacle presentations.

A first solution to reduce the risk of overload is to off-load the tactile modality by presenting one or two of the four object features through the auditory modality. However, as visually impaired users already rely heavily on audition for communication and to pick up environmental cues, using the auditory channel to present ETA information may result in a new threat of (auditory) overload. The auditory signals should at the least not interfere with the main use of audition or the user should be able to switch the auditory presentation off without losing critical ETA information. The effect of coding distance and/or object ID through auditory signals is investigated in Experiment 2.

A second solution is to use the tactile channel only, but reduce the amount of displayed information, for instance by filtering out less relevant objects or by reducing the number of object features presented from the current four to two or three in a minimalistic tactile compound signal. Further optimization of this filter and the tactile compound signal is described in Experiment 3 (reporting the iterative design process with experts) and evaluated in Experiment 4 with visually impaired participants.

Experiment 2: Off-Loading the Tactile Channel Using Multimodal Presentation

The results of Experiment 1 show that the compound tactile signals carry in them a high risk of perceptual overload. In Experiment 2, we test the effect of off-loading the tactile channel by presenting part of the compound signal through the auditory modality.