Feature/implementation of mimca #163

JulienRoussel77 · 2024-09-27T15:47:03Z

No description provided.

… after docstring summaries.

hlbotterman

General comments:

typing in the functions (not only in docstring)
I do not understand the structure of the 3 files in imputations/mimca folder and utils. There seems to be some redundancies
some tests do not pass
remove comments
create Class

hlbotterman · 2024-09-30T09:41:42Z

qolmat/imputations/mimca/estim_ncpMCA.py

+from tqdm import tqdm
+
+
+def moy_p(V, weights):


do not hesitate to give more explicit name for functions

hlbotterman · 2024-09-30T09:54:39Z

qolmat/imputations/mimca/estim_ncpMCA.py

+    missing_indices = rng.choice(total_values, n_missing, replace=False)
+    row_indices = missing_indices // n_cols
+    col_indices = missing_indices % n_cols
+    for i in range(n_missing):


use a vectorization to avoid loop/
Suggestion :
...
missing_indices = rng.choice(total_values, n_missing, replace=False)
row_indices, col_indices = np.unravel_index(missing_indices, (n_rows, n_cols))
data.values[row_indices, col_indices] = np.nan
return data

hlbotterman · 2024-09-30T09:57:17Z

qolmat/imputations/mimca/estim_ncpMCA.py

+    return df_reconstructed
+
+def imputeMCA(
+    don,


rename "don" ? Not very an explicit name...

hlbotterman · 2024-09-30T09:58:15Z

qolmat/imputations/mimca/estim_ncpMCA.py

+    dict
+        Dictionary containing:
+            - "tab_disj": Disjunctive coded table after imputation.
+            - "completeObs": Complete dataset with missing values imputed.


rename completeObs ? usually, variable names in python do not contain capital letter, but _ if multiple "words"

hlbotterman · 2024-09-30T09:59:06Z

qolmat/imputations/mimca/estim_ncpMCA.py

+            - "completeObs": Complete dataset with missing values imputed.
+
+    """
+    don = pd.DataFrame(don)


why pd.DataFrame(don) if don is already a DataFrame (specified as such in docstring) ?

hlbotterman · 2024-09-30T10:13:10Z

qolmat/imputations/mimca/imputer_mca.py

+        if (
+            not pd.api.types.is_numeric_dtype(don[col])
+            or don[col].dtype == "bool"
+        ):  # noqa: E501


no need of E501

hlbotterman · 2024-09-30T10:14:36Z

qolmat/imputations/mimca/imputer_mca.py

+            - "completeObs": Complete dataset with missing values imputed.
+
+    """
+    # Ensure the data is a DataFrame


suggestion: since it is expected a DataFrame in input, check the type of "don". If not dataframe, report in a log and convert it.

hlbotterman · 2024-09-30T10:15:21Z

qolmat/imputations/mimca/imputer_mca.py

+        Z = Z.subtract(Z_mean, axis=1)
+        Zscale = Z.multiply(np.sqrt(M), axis=1)
+
+        print("Centered and scaled data (Zscale):")


not fan of print.
Use logging instead

hlbotterman · 2024-09-30T10:16:19Z

qolmat/imputations/mimca/mimca.py

@@ -0,0 +1,665 @@
+import numpy as np


add general documentation for the file.

hlbotterman · 2024-09-30T10:21:04Z

qolmat/imputations/mimca/mimca.py

+
+    """
+    if verbose:
+        print(f"{print_msg}...", end="", flush=True)


logging instead of print

hlbotterman · 2024-09-30T12:12:23Z

qolmat/utils/algebra.py

+    else:
+        row_w = np.array(row_w, dtype=float)
+        row_w /= row_w.sum()
+    ncp = int(min(ncp, X.shape[0] - 1, X.shape[1]))


why shape[0] - 1 ?
I think your tests do not passed because of that.

Yasser Zidani and others added 5 commits September 23, 2024 16:58

test commit

6219054

modifying gitignore

80f82e4

✨🔧 Adapt code to comply with Ruff linter (D205) by adding blank lines…

edb1a38

… after docstring summaries.

✅ Adding MIMCA and Estim_ncpMCA codes to the branch --

524901c

🧪 Add unit tests for svdtriplet function in algebra.py

89de45c

JulienRoussel77 marked this pull request as ready for review September 27, 2024 15:47

hlbotterman reviewed Sep 30, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/implementation of mimca #163

Feature/implementation of mimca #163

JulienRoussel77 commented Sep 27, 2024

hlbotterman left a comment •

edited

Loading

hlbotterman Sep 30, 2024

hlbotterman Sep 30, 2024

hlbotterman Sep 30, 2024

hlbotterman Sep 30, 2024

hlbotterman Sep 30, 2024

hlbotterman Sep 30, 2024

hlbotterman Sep 30, 2024

hlbotterman Sep 30, 2024

hlbotterman Sep 30, 2024

hlbotterman Sep 30, 2024

hlbotterman Sep 30, 2024

Feature/implementation of mimca #163

Are you sure you want to change the base?

Feature/implementation of mimca #163

Conversation

JulienRoussel77 commented Sep 27, 2024

hlbotterman left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hlbotterman left a comment •

edited

Loading