Capturing the distinction between translated vs. original (i.e. non-translated) language varieties holds centre stage in corpus-based translation studies and related fields such as supervised machine learning. A similar question also holds for native vs. proficient non-native speakers' production. In both cases, the linguistic features that seem to be good indicators of the different language varieties appear to be genre-dependent. In the articles included in this volume, genre-controlled multilingual corpora are used to identify and measure two competing properties of both translational and non-native language varieties: (i) source (or native) language interference and (ii) normalization, which can be described as a tendency to fit into target-language standards. The topics addressed include linguistic features to uncover and quantify interference and normalization within a specific genre and across genres, the complementarity of comparable vs. parallel corpus data and experimental vs. corpus data to investigate interference and normalization, and the extraction of highly similar and homogeneous comparable and parallel corpora from multilingual resources such as Europarl. A wide range of genres are examined, e.g. research articles abstracts, parliamentary debates, administrative texts, fiction.
1. Interference and normalization in genre-controlled multilingual corpora: Introduction (by Lefer, Marie-Aude); 2. Using the Europarl corpus for cross-linguistic research (by Cartoni, Bruno); 3. Applying a multidimensional, register-sensitive approach to visualize normalization in translated and non-translated Dutch (by Delaere, Isabelle); 4. Close encounters of the third code: Quantitative vs. Qualitative analyses in corpus-based translation studies (by Loock, Rudy); 5. Identifying translation features in multi-word lexical units (by Sanchez, Maribel Tercedor); 6. Native and non-native English abstracts in contrast: A multidimensional move analysis (by Xiao, Richard)