Description
Modin version checks
-
I have checked that this issue has not already been reported.
-
I have confirmed this bug exists on the latest released version of Modin.
-
I have confirmed this bug exists on the main branch of Modin. (In order to do this you can follow this guide.)
Reproducible Example
import os
os.environ['MODIN_STORAGE_FORMAT'] = 'Native'
os.environ['MODIN_ENGINE'] = 'Native'
from modin.config import Backend
assert Backend.get() == "Pandas", Backend.get()
Issue Description
If StorageFormat
or Engine
has not already been set, either by Modin's automatic engine choice or by the user, and MODIN_STORAGE_FORMAT
or MODIN_ENGINE
, respectively, has been set to a valid choice, Modin will take the config value from the respective environment variable. However, when it does this, it doesn't know to update Engine
accordingly.
Similarly, MODINE_BACKEND
does set the modin.config.Backend
variable, but it doesn't propagate the new backend to Engine
or StorageFormat
.
This issue blocks #7459 because the modin integration tests use environment variables to set execution.
Expected Behavior
Setting MODIN_BACKEND
before Modin or the user has chosen an execution should set the appropriate storage format and engine. Likewise, settingMODIN_STORAGE_FORMAT
or MODIN_ENGINE
before Modin or the user has chosen an execution should set Backend
to the correct value.
Error Logs
No response
Installed Versions
INSTALLED VERSIONS
commit : c114e7b
python : 3.9.21
python-bits : 64
OS : Darwin
OS-release : 24.3.0
Version : Darwin Kernel Version 24.3.0: Thu Jan 2 20:24:23 PST 2025; root:xnu-11215.81.4~3/RELEASE_ARM64_T6020
machine : arm64
processor : arm
byteorder : little
LC_ALL : None
LANG : en_US.UTF-8
LOCALE : en_US.UTF-8
Modin dependencies
modin : 0.32.0+19.gc114e7b0
ray : 2.40.0
dask : 2024.8.0
distributed : 2024.8.0
pandas dependencies
pandas : 2.2.3
numpy : 1.26.4
pytz : 2024.2
dateutil : 2.8.2
pip : 24.2
Cython : None
sphinx : 5.3.0
IPython : 8.18.1
adbc-driver-postgresql: None
adbc-driver-sqlite : None
bs4 : 4.12.3
blosc : None
bottleneck : None
dataframe-api-compat : None
fastparquet : 2024.11.0
fsspec : 2024.12.0
html5lib : None
hypothesis : None
gcsfs : None
jinja2 : 3.1.5
lxml.etree : 5.3.0
matplotlib : 3.9.4
numba : None
numexpr : 2.10.2
odfpy : None
openpyxl : 3.1.5
pandas_gbq : 0.26.1
psycopg2 : 2.9.10
pymysql : None
pyarrow : 19.0.0
pyreadstat : None
pytest : 8.3.4
python-calamine : None
pyxlsb : None
s3fs : 2024.12.0
scipy : 1.13.1
sqlalchemy : 2.0.37
tables : 3.9.2
tabulate : 0.9.0
xarray : 2024.7.0
xlrd : 2.0.1
xlsxwriter : None
zstandard : None
tzdata : 2024.2
qtpy : None
pyqt5 : None