Hi Kim,
The design matrix X that you've obtained from the rowexch function in MATLAB represents the coded levels for each factor in your experimental design, including the interactions and possibly other polynomial terms, depending on the options you've set for the experimental design. Here's a breakdown of how to interpret the columns of X:
- Column 1 (Constant Term): This column is filled with ones and represents the intercept term in the model. It is included so that the model can fit a mean value for the response variable when all other factors are at their reference levels (usually coded as zero).
- Column 2-4 (Categorical Factor Levels): Since you have a categorical factor with 3 levels, these columns represent the dummy coding for the categorical factor. Dummy coding is a way to include categorical variables in regression models by converting the categories into binary columns. Since you have 3 levels, you need 2 columns to represent them (since one level can be inferred when the other two are zero). However, it seems there is a discrepancy because you have three columns filled with zeros and ones, which might be an error or a misunderstanding. Normally, for 3 levels, you would have only 2 columns for dummy coding.
- Column 5-7 (Continuous Factors): These columns represent the coded levels for the continuous factors. The coding is usually done to center the factors around zero to simplify the interpretation of main effects and interactions. The coding might be -1, +1 for a two-level factor or could include other levels if more than two levels are used.
- Column 8-10 (Interactions and Higher-Order Terms): These columns typically represent the interaction terms and possibly quadratic or higher-order terms for the continuous factors. Interaction terms are the product of the coded levels of two factors and represent the combined effect of those factors on the response variable. Quadratic or higher-order terms represent the non-linear effects of the factors on the response variable.
To clarify, the expected structure for your design matrix might look something like this:
- Column 1: Constant term (intercept)
- Columns 2-3: Dummy-coded categorical factor (2 columns for 3 levels)
- Columns 4-6: Continuous factors (3 columns for 3 factors)
- Columns 7-9: Two-factor interactions (if you're including interactions between all pairs of factors, you'd expect 3 choose 2 = 3 interaction terms)
- Additional columns: Any higher-order terms or additional interactions specified in the design
To resolve the discrepancy and correctly interpret your design matrix, ensure that the design is specified correctly in MATLAB and that you understand the coding scheme used for the categorical and continuous factors.