Why is the R-value 0.95 for raw data but drops to 0.85 for normalized data in ANN?

1 view (last 30 days)
Why is the R-value 0.95 for raw data but drops to 0.85 for normalized data in ANN ? does it happen or am I doing something wrong

Accepted Answer

akshatsood
akshatsood on 10 Dec 2023
Edited: akshatsood on 10 Dec 2023
I understand that you are observing an R-value of 0.95 for raw data, which drops to 0.85 for normalized data in your Artificial Neural Network (ANN). I would like to clarify whether these values are from the training dataset or the testing dataset or the validation dataset.
In general, the higher R-value associated with raw data might indeed suggest overfitting, potentially stemming from inadequate feature scaling. By normalizing the data, you are likely achieving more effective feature scaling, which can help prevent overfitting. Consequently, the decrease in the R-value for your Artificial Neural Network model when using normalized data could signify a more robust and generalizable model. This shift in R-value may indicate that the model is now better at capturing underlying patterns within the data, leading to improved generalization.
I hope this helps.
  4 Comments
akshatsood
akshatsood on 10 Dec 2023
Edited: akshatsood on 10 Dec 2023
You can eliminate the data points located at or near the origin by using logical indexing. This method involves extracting a specific portion of the data based on defined conditions, effectively filtering out the unwanted points. Additionally, I recommend reviewing the following articles that go deep into identifying and cleaning outliers.
I hope this helps.

Sign in to comment.

More Answers (0)

Categories

Find more on Sequence and Numeric Feature Data Workflows in Help Center and File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!