Multifactor dimensionality reduction (MDR) is a statistical approach, also used in machine learning automatic approaches,[1] for detecting and characterizing combinations of attributes or independent variables that interact to influence a dependent or class variable.[2][3][4][5][6][7][8] MDR was designed specifically to identify nonadditive interactions among discrete variables that influence a binary outcome and is considered a nonparametric and model-free alternative to traditional statistical methods such as logistic regression.
The basis of the MDR method is a constructive induction or feature engineering algorithm that converts two or more variables or attributes to a single attribute.[9] This process of constructing a new attribute changes the representation space of the data.[10] The end goal is to create or discover a representation that facilitates the detection of nonlinear or nonadditive interactions among the attributes such that prediction of the class variable is improved over that of the original representation of the data.
^Ritchie, Marylyn D.; Hahn, Lance W.; Moore, Jason H. (1 February 2003). "Power of multifactor dimensionality reduction for detecting gene-gene interactions in the presence of genotyping error, missing data, phenocopy, and genetic heterogeneity". Genetic Epidemiology. 24 (2): 150–157. doi:10.1002/gepi.10218. ISSN1098-2272. PMID12548676. S2CID6335612.
^Cite error: The named reference :1 was invoked but never defined (see the help page).
^Moore, Jason H. (1 January 2010). "Detecting, Characterizing, and Interpreting Nonlinear Gene–Gene Interactions Using Multifactor Dimensionality Reduction". Computational Methods for Genetics of Complex Traits. Advances in Genetics. Vol. 72. pp. 101–116. doi:10.1016/B978-0-12-380862-2.00005-9. ISBN978-0-12-380862-2. ISSN0065-2660. PMID21029850.
^Moore, Jason H.; Gilbert, Joshua C.; Tsai, Chia-Ti; Chiang, Fu-Tien; Holden, Todd; Barney, Nate; White, Bill C. (21 July 2006). "A flexible computational framework for detecting, characterizing, and interpreting statistical patterns of epistasis in genetic studies of human disease susceptibility". Journal of Theoretical Biology. 241 (2): 252–261. Bibcode:2006JThBi.241..252M. doi:10.1016/j.jtbi.2005.11.036. PMID16457852.
^Michalski, R (February 1983). "A theory and methodology of inductive learning". Artificial Intelligence. 20 (2): 111–161. doi:10.1016/0004-3702(83)90016-4.