Since the introduction of physicochemical descriptors to derive useful QSAR (quantitative structure-activity relationship) models, some regression methods have been applied to linearly correlate dependent (bioactivities) and independent variables. Multiple linear regression (MLR) has been widely used when the number of samples (rows) exceed the amount of descriptors (columns), whilst partial least squares (PLS) is the most commonly applied regression method in 3D QSAR (e.g. CoMFA and related methods), where a large number of descriptors are generated. The recently implemented MIA-QSAR (Multivariate Image Analysis applied to QSAR) method is a especial (not only) case in which the descriptors (pixels) for each active compound result in a three-way array after grouping samples to give a data set. Such array may be properly treated by using N-way methods, such as multilinear PLS (N-PLS) and parallel factor analysis (PARAFAC). However, these methods have not been appropriately explored in QSAR studies, despite their supposed advantages over well established methods. Thus, this review formally details the MIA-QSAR approach prior to presenting two promising multimode methods to be applied on MIA descriptors, namely N-PLS and PARAFAC. Also, the suitability of such methods is discussed in terms of application to a case study (a series of anti-HIV compounds) and comparison to traditional (bilinear) PLS and docking studies.
Keywords: QSAR (quantitative structure-activity relationship), Multiple linear regression (MLR), partial least squares (PLS), parallel factor analysis (PARAFAC), multimode methods, anti-HIV compounds)