Simpson J was of the opinion that s 79 accommodated the idea of an ad hoc expert.
41 Mr Nicholson QC, who appeared with Mr Stanton for the appellant, submitted that the evidence of Mr Chan (and also Sergeant Lee and Dr Gibbons) was irrelevant as the tapes were available to the jury and the jury could have made the necessary comparisons for themselves. He submitted that Smith v The Queen (2001) 206 CLR 650 supported this submission.
42 Butera and Leung, however, demonstrate that evidence of ad hoc expertise is admissible. The very many hours that Mr Chan spent listening to and working on the five tapes with a view to identifying the words spoken by M1 qualified him as an ad hoc expert in the characteristics of M1's voice.
43 In any event, as the words on the five tapes were spoken in Cantonese, it was appropriate for the jury to hear identification evidence from a person who understood the words spoken. As Simpson J observed in R v Leung at 414:
"Voice comparison is not necessarily a question for expert evidence, although it may be. If the two sets of tape-recordings in the present case had been in English, it would have been open to the Crown to have left it to the jury to make their own comparison and assessment of whether the voices on the DAT tapes (or any of them) corresponded to either of the voices on the police tapes. That course theoretically remained open but would have left the jury with a task immeasurably more difficult, given the reasonable assumption that no member of the jury understood either of the Chinese languages involved. The jury would, truly, have been comparing voices only, without the intrusion of language and speech patterns that are part of voice identification".
44 In my opinion, Mr Chan was qualified to say whether, in his opinion, the voice of the appellant, as heard in the record of interview tape, was the same voice as the voice Mr Chan had identified as M1 on the five tapes. His evidence (and that of Sergeant Lee and Dr Gibbons) was not irrelevant.
45 In submitting that the evidence of Mr Chan should not have been admitted, Mr Nicholson submitted that Mr Chan's evidence "had the following weaknesses":
"[a] When performing his task of comparing the tapes, he did not think of the possibility that what was recorded on the tape was a compilation of different passages of speech from different locations. He conceded the probability of him being mistaken as to the context surrounding his interpretation (T.474).
[b] He was not a voice recognition expert (T.475).
[c] A year may have elapsed between the time when he was engaged in interpreting the tapes and the time when he was asked to give his opinion on the similarity issue.
[d] When he listened to those tapes again one year later, he may have listened to them only once (T.477).
[e] The opinion he gave was an 'ordinary man's opinion' as to the sameness of the voices on the tapes (T.477)
[f] He would not say there were any special features of the voice on the record of interview tape which would enable him with certainty to say the voice was different from those of other confident male Cantonese speakers with deep tonal voices (T.477).
[g] He agreed that when seeking to understand the language in a conversation it is highly desirable to have both sides of the conversation, instead of one side only as in a telephone conversation. He further agreed that because of the nature of the language, Cantonese, there are some ambiguities which can exist without both sides of the conversation being heard (T.479).
[h] He also agreed that it will sometimes be almost impossible to ascertain meaning without both sides of the conversation (T.480).
[i] The witness further agreed that people speaking on a telephone have a different type of speech from people speaking face to face (T.480).
[j] He possessed no training, knowledge or experience in comparing voices speaking in English and those speaking in Cantonese (T.485).:
46 Mr Nicholson submitted that, on these grounds, Shadbolt DCJ should have refused to admit the evidence because "its probative value is outweighed by the danger of unfair prejudice to the defendant" (to use the words of s 137 of the Evidence Act).
47 As regards the submission in paragraph [a] above, Mr Chan did not concede the probability of him being mistaken as to the context surrounding his interpretation. He said that there was a "possibility" of him being mistaken. In any event, in my view, the "context" in which the voices were spoken, while being of significance to the interpretation of words for translation purposes, is, I would think, far less relevant than the identification of the voices speaking the words concerned.
48 As regards paragraph [b] above, Mr Chan did not have to be a voice recognition expert to become an ad hoc expert within the meaning of R v Leung. In any event, his years of practising as an interpreter and translator would have been of great assistance to him as it meant he had considerable familiarity with the language spoken and with voices speaking Cantonese and English.
49 As regards paragraph [c], the relevance of the lapse of the year was a matter for the jury, and had little bearing on a discretionary exercise under s 137 of the Evidence Act. In any event, Mr Chan refreshed his memory immediately before the final identification exercise, by listening to the five tapes again.
50 Paragraph [d] has no merit in the present context. It raises a matter for the jury, it is not a matter that would have any relevant weight under s 137.
51 As regards paragraph [e], the fact that Mr Chan gave an "ordinary man's opinion" said nothing about whether he was duly qualified as an ad hoc expert as explained in R v Leung.
52 Paragraph [f] is essentially a jury matter, it is not a matter that would have any relevant weight under s 137.
53 Paragraph [g] is also essentially a jury matter and not one that would have any relevant weight under s 137. In any event, ambiguities in the language are irrelevant to an identification of the voice.
54 As regards paragraph [h], the ascertaining of meaning in the sense referred to was of minor significance in the voice identification exercise. Again, this was essentially a jury issue.
55 I do not understand what relevance paragraph [i] has with regard to the issue of Mr Chan's ability to identify the voices to which he listened.
56 As regards paragraph [j], Mr Chan acquired expertise in the voice of M1 by listening, over and over again, to the five tapes. It was not suggested to him that he could not apply that expertise in identifying the voice on the video tape which was speaking in English. In any event, I see no reason why the cross-lingual element in the comparison that Mr Chan was required to undertake detracted significantly from his ability to express a reliable opinion. Once more, this is, essentially, a jury matter.
57 In oral argument, Mr Nicholson submitted that Mr Chan could not acquire the relevant expertise because he had only undertaken a single act of comparison, namely, that between the voices he identified as M1 on the five tapes and the voice on the video tape. In my view this submission must be rejected. Once Mr Chan acquired the necessary expertise in being able to identify the voice of M1 he was able, as an ad hoc expert, to use that expertise in determining that the voice on the video tape was also that of M1.
58 Mr Nicholson also submitted, orally, that the evidence of Mr Chan was "tainted" as he must have known that the appellant was a suspect when he compared the appellant's voice on the record of interview with the voice of M1 on the five tapes.
59 In making this submission, Mr Nicholson relied on certain remarks made by Simpson J in R v Leung. Simpson J in that case drew attention to an aspect of voice identification evidence that she said might raise "real questions of propriety". Her Honour pointed out that a situation may arise in voice identification that is analogous to physical identification by photographs or by a police line up, "in which care must be taken not to suggest that a particular person is the suspect".
60 While I accept, with respect, that the analogy is capable in certain circumstances of being apt, it has, I think, limitations. It is one thing to produce persons not fundamentally dissimilar from a suspect in a photograph book or a line up. It is another, to produce voices not dissimilar to the voice to be identified. In the present case, for example, the police had five tapes where the identity of the voices was in dispute. Mr Chan had spent considerable time on the five tapes first distinguishing and then identifying, by the use of pseudonyms, the speakers of the words recorded therein. He had to use pseudonyms as the identity of the persons concerned was then unknown. Later, the police obtained a recording of the appellant's voice. They wished to determine whether the appellant's voice was one of the voices recorded in the five tapes. In this situation, I do not see how a means of identification akin to a line up could have been contrived.
61 It is further to be noted that voice identification evidence is to be distinguished from "visual identification evidence" within the meaning of s 114(1) of the Evidence Act. That is because voice identification evidence is not "based wholly or partly on what a person saw", the quoted words being an element of "visual identification evidence" as defined. As is noted by Odgers, Uniform Evidence Law, 5th Ed at para 1.3.9660, the admission of voice identification evidence turns on judicial discretion. See also R v Adler (2000) 116 A Crim R 38.
62 As Shadbolt DCJ noted in giving judgment in the voir dire relating to the admissibility of the voice identification evidence of Mr Chan and Mr Lee:
"The question then is not one which lies within ss 114 to 116 of the Evidence Act 1995 but rather within those sections dealing with opinion evidence, namely ss 76 to 80 which necessarily carry with [them] consideration[s] of discretion under ss 135, 137 and 165 and perhaps 192".
63 In my view, Shadbolt DCJ committed no error in admitting the evidence of Mr Chan.