`torch.load(..., weights_only=True)` currently raises a Deprecation warning + [proposal] `weights_only=True` should become default for safe legacy-loading pickles #52181

vadimkantorov · 2021-02-12T01:32:12Z

That doesn't allow arbitrary unpickling and thus arbitrary code execution. Maybe an option for torch.load?

Yes, one should not load/run code from unknown locations, but sometimes intermediate controls could be good: e.g. allowing to load only known types, such as tensors (and not model instances or other things), bypassing generic unpickling mechanism

maybe make it super-clear that torch.hub.load actually executes code at load/unpickling time

(i've long time been proponent of standardized formats for weight storage such as HDF5, but this didn't get traction)

cc @ezyang @gchanan @zou3519 @mruberry @nairbv @NicolasHug @vmoens @jdsgomes @bdhirsh @jbschlosser @anjali411 @ailzhang

Also, popularity of HuggingFace hub (and existing torch.hub) makes it more acute. At some point we will have a malicious model uploaded there and become popular on twitter e.g. because it would composite in very cute cats into existing images. The malicious model can at least hijack some precious GPU compute, and at worst take over institute / company local computer networks.

rgommers · 2021- 8000 02-22T18:33:29Z

+1 for building on standardized formats. Zarr may be nicer than HDF5 because it's better in cloud environments and it doesn't need such heavyweight libraries to support it.

The one interesting thing I noticed in https://pytorch.org/docs/stable/notes/serialization.html is that when saving multiple tensors together, their view relationships get preserved. That will require something a little custom perhaps.

vadimkantorov · 2021-02-22T18:37:25Z

advantages of HDF5: mature and has bindings for all languages; libhdf5 is relatively easy to compile from sources and doesn't have dependencies; natively supports hierarchical named sub-arrays
disadvantages: it doesn't support number of columns more than 64000 (or didn't support a few years ago); relatively heavyweight (streaming slabs; compression support etc); (but maybe better suited that abusing protobuf and its constant warnings about unsupported files larger than 2Gb

rgommers · 2021-02-22T21:11:33Z

There also seems to be some confusion about what save and load are actually for. https://pytorch.org/docs/stable/hub.html has this example:

Local checkpointing makes perfect sense, and for that pickling is just fine. But pickling isn't guaranteed to be portable across Python versions, so shouldn't be used for publishing models for download like that.

advantages of HDF5: mature and has bindings for all languages

Does this matter? The current use cases for .pt certainly are Python-only. If it's about binary storage for data exchange, that's a different thing than what save is meant for.

vadimkantorov · 2021-02-22T21:19:14Z

Exchange somewhat matters, since at times people would be re-implementing models, say, in TFv2 or whatever new flavor of JAX and want to consume older weights without relying on other framework as a dependency (i.e. h5py is a less intrusive dependency than full PyTorch). I fought this a lot when consuming Caffe weights from PyTorch :( or TFv1 weights from PyTorch. I wish frameworks thought more of this interop :(

rgommers · 2021-02-22T21:23:30Z

Yes that makes sense. Data exchange certainly matters. But then you'd not want features like storing the state of modules, and preserving view relationships (or odd strides, etc.). You'd just want one or more tensors in a well-optimized and portable binary format. Import/export capability for both HDF5 and Zarr would be nice for that.

KOLANICH · 2021-02-22T21:30:19Z

Local checkpointing makes perfect sense, and for that pickling is just fine.

No, doesn't really. Imagine a proprietary app we don't trust running in a heavily sandboxed environment and an own app consuming the model produced by a proprietary app. In this case it would be a sandbox escape.

But pickling isn't guaranteed to be portable across Python versions, so shouldn't be used for publishing models for download like that.

People will still use pickle if they had the opportunity, at least because it is the laziest possible solution of the problem of storage models and apps settings. No matter deprecated or not, if it works for them, they will use it. So the only sensible way, if we want to get rid of the insecurity at all, is to make it unusable for them, so either to make them a little less lazy, or let them go and trouble users of other frameworks ecosystems with their insecure code.

rgommers · 2021-02-22T21:55:11Z

No, doesn't really. Imagine a proprietary app we don't trust running in a heavily sandboxed environment and an own app consuming the model produced by a proprietary app. In this case it would be a sandbox escape.

That example has nothing to do with local checkpointing. It's model exchange, for which indeed there is ONNX.

vadimkantorov · 2021-02-22T21:58:17Z

Well, these "local checkpoints" end up as weights files for all torchvision models... Whereas those could have used ONNX...

rgommers · 2021-02-23T10:36:44Z

Well, these "local checkpoints" end up as weights files for all torchvision models... Whereas those could have used ONNX...

This still sounds confusing. "weights files" contain, I assume, data. ONNX is "neural network exchange" - it's for storing models. Pickling has yet again different tradeoffs, it can capture state for pytorch models in ways that ONNX cannot (even aside from ONNX's incompleteness - because ONNX has to work cross-library while picking can be pytorch-specific).

It looks to me to disentangle this, we need to clearly separate these three things.

KOLANICH · 2021-02-23T11:08:19Z

it can capture state for pytorch models in ways that ONNX cannot

I guess it shouldn't capture whole any state, but only precisely defined subset of state that can be restored without any execution of any Turing-complete code (or that can be escallated to execution of Turing-complete code) stored within the "checkpoint" (and any Turing-complete code shouldn't be stored there). If someone has to store the Turing-complete code that must be executed within the data even for such cases, it means he is too lazy to implement the functionality properly. If one has no other choice than to do that, it feels like there is something is wrong with the system.

So, is there any real necessity ("it absolutely cannot be done without a Turing-complete deserialization even if we spent a year of full-time work on redesigning the code and written auxilary code that currently is not needed because pickle does that for us?") in using pickles?

rgommers · 2021-02-23T12:28:41Z

@KOLANICH as I pointed out in my first comment, it's about things like "when saving multiple tensors together, their view relationships get preserved". Turing completeness is besides the point.

even if we spent a year of full-time work on redesigning the code ...

You made your point, you dislike pickle a lot. These kind of engineering tradeoffs are for the PyTorch maintainers to make though. As I pointed out, there are correct and useful ways of using pickle. And PyTorch isn't alone in that - Python stdlib provides, pickle, and libraries like NumPy and scikit-learn use it as well. So I'd say it's unlikely that anyone will be willing to spend that much effort to design something new to replace correct usage of pickling.

That doesn't negate the point that torch.hub is doing something inappropriate here. That can hopefully be fixed.

KOLANICH · 2021-02-23T12:43:38Z

Python stdlib provides, pickle, and libraries like NumPy and scikit-learn use it as well. So I'd say it's unlikely that anyone will be willing to spend that much effort to design something new to replace correct usage of pickling.

That made the pretrained models available in the 8000 Net (produced by some large corps BTW) completely useless without additional efforts about reverse engineering the pickle files to make sure that there is no backdoors within them.

"when saving multiple tensors together, their view relationships get preserved".

I wonder how complex are the relations and if they really can be preserved, i.e. by keeping them in a JSON-like dict serialized into some serialization format for JSON-like dicts and restored with a custom code.

These kind of engineering tradeoffs are for the PyTorch maintainers to make though.

I absolutely aggree. Just wanna know how much the task is complex and how much work it is. I also have created a framework that may (or may not) simplify the task.

rgommers · 2021-02-28T09:38:40Z

When you pay attention to it, there's indeed a lot of .pt(h) files floating around. Just saw https://github.com/POSTECH-CVLab/PyTorch-StudioGAN coming by on Twitter - no warnings, and many people will just start downloading pickle files (they won't know or care about what .pth means).

I searched all repos for other formats and asked a few people. There's not much, only a couple of examples using HDF5 like https://github.com/pytorch/fairseq/blob/89a4d2bc70fd680c4768803d20707ef65df89b0f/examples/wav2vec/wav2vec_featurize.py#L95. And a set of issues and Discourse posts with discussions around correct usage through h5py.

I also have created a framework that may (or may not) simplify the task.

It looks like you're using JSON - that's not a great alternative, it'll be way too slow compared to binary formats.

Just wanna know how much the task is complex and how much work it is.

Right now this isn't actionable, it needs deciding what to do first.

vadimkantorov · 2021-02-28T09:44:22Z

And PTH is extra-bad for #14864

KOLANICH · 2021-02-28T12:23:24Z

It looks like you're using JSON - that's not a great alternative, it'll be way too slow compared to binary formats.

Not quite. The framework urm deals only with

linking several unrelationally-stored data to each other and convenient access to the data
storing them in cold and hot storages and transferring between them when needed.

For serialization it entirely relies on transformerz lib and a framework, allowing to specify a transformation chain. json is only one format within it, and can be easily replaced with CBOR or MSGPack or BSON (and CBOR and BSON have some customization points (1, 2)), or an own transformer can be written and plugged there (also a compressor/decompressor).

they won't know or care about what .pth means.

It was a big mistake to give the format an own extension. If it was given .pytorch.pkl extension, it would have been immediately obvious it is a pickle file.

TBH, I usually use TensorFlow (because it has built-in complex numbers).

When I had to use one pretrained model, and it turned out the code is for pytorch, I have detected that the files are pickle only because I have large experience of dealing with python projects and know that when most of projects need to ship some data with code or serialize, they just rely on pickle instead of developing a format and serialization and parsing routines.

pickle is a large success in the sense it gives developers something they need at the cost of something they don't care, don't need and are willing to sacrifice. Unfortunately it turns out that the thing that is sacrificed is security, this transforms the landscape, and I see absolutely no reasons for rational data scientists not to register a bunch of fake accounts on GH, each new paper publish under a new account and embed legal backdoors (most of open source software licenses (except WTFPL) explicitly disclaim any liability, so by using software and data user authorizes software to scan his PC for unfinished drafts of articles and source code and other confidential information such as ssh keys and exfiltrate it) into each model (that's why really serious orgs allow their employees only use the software that was audited (internally or externally, there are corps having business on auditing dependencies for backdoors) ). I absolutely won't be surprised to hear that it is already done in the wild.

turian · 2021-04-28T20:44:50Z

I am currently organizing a NeuroIPS competition, in which participants might be submitting pytorch models to our evaluation server for the leaderboard.

Is there a secure way of serializing/loading untrusted pytorch models?

Are there alternatively to pickle, which can be insecure? Is there a pytorch model format we can insist upon that is secure?

I believe that ONNX + tensorflow use protobuf, which avoids the security issues of pickling. That's an alternative to HDF5 to consider

I see that this is related to #52596

KOLANICH · 2021-04-29T04:51:18Z

https://github.com/CensoredUsername/picklemagic may be helpful to implement an own loader for pth files in a controlled way.

turian · 2021-04-29T15:49:09Z

@KOLANICH thank you but that repo is five years old with a single contributor. It's hard to trust abandonware that might not even work with the latest pickle format.

KOLANICH · 2021-04-29T17:43:44Z

I have recently used it, not for pytorch files though. The stuff having date of this year hasn't worked, that library has.

turian · 2021-04-30T14:23:29Z

@KOLANICH have you used it recently?

KOLANICH · 2021-04-30T14:31:57Z

In March. In some python package a very suspicious pickle was inlined... Though it turned out it was benign.

KOLANICH · 2022-11-11T19:38:48Z

i wonder what's state-of-the-art of actually generating format parsers from a formal spec to ensure that generated C is safe

Kaitai Struct has a Rust target (C++ target is also present, and is thought to be secure, but I'm not sure anyone has really checked it, C target is very immature, and none of the targets (except of Java one to some minor extent) support serialization currently).

vadimkantorov · 2022-12-02T17:20:44Z

Might be good to have safe mode default for pytorch 2.0

vadimkantorov · 2023-03-22T13:53:43Z

Is weights_only=True default in PyTorch 2.0 now?

vadimkantorov · 2023-03-24T09:42:35Z

related #97495

sjkoelle · 2023-06-20T22:14:16Z

Putting in a request for weights_only=True support for OrderedDict 's .

Edit: looking like collections.OrderedDict is supported but typing.OrderedDict is not... the joys of user submitted models

sjkoelle · 2023-06-21T18:31:10Z

Okay, I went down the rabbit hole on this. It looks like typing.OrderedDict was used in state_dict() in earlier versions of torch like 1.9

9d6639a

Note that this was first changed to collections.OrderedDict and then finally just Dict

9fae076

Many issues are seemingly somewhat related to this

#72778
python/mypy#6904
voicepaw/so-vits-svc-fork#410 (comment)
#94670 (similar idea different issue)
#94910 (same)

So in summary right now I don't think the safe loading is compatible with models trained using torch==1.9 and safe loading will fail on floats for torch<2.0.0. Adding typing.OrderedDict into the allowed pickle types would enable backwards compatibility to models trained using torch==1.9

ezyang · 2023-06-22T16:59:48Z

Adding typing.OrderedDict to the allowed list shouldn't be too hard. Send us a patch?

vadimkantorov · 2023-09-12T13:34:52Z

@malfet using weights_only=True gives an ungentle error: "TypedStorage is deprecated" (please see #109108 for repro)

vadimkantorov · 2023-10-04T08:10:34Z

I hope that TorchServe loads models with weights_only: https://www.oligo.security/shelltorch

Otherwise, one untrusted model can gain access to the server (e.g. one team's model can steal some private data from another team's model)

Use the same strategy as for unsafe pickler, i.e. use dummy `torch.serialization.StorageType` to represent legacy typed storage classes during deserialization. Add `_dtype` property to be able to use it for both new and legacy format deserialization. Parametrize `test_serialization_new_format_old_format_compat` Add regression test to validate that loading legacy modes can be done without any warnings Before the change: ``` % python test_serialization.py -v -k test_serialization_new_format_old_format_compat_ test_serialization_new_format_old_format_compat_cpu (__main__.TestBothSerializationCPU) ... ok test_serialization_new_format_old_format_compat_safe_cpu (__main__.TestBothSerializationCPU) ... /Users/nshulga/git/pytorch/pytorch/torch/_utils.py:836: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() return self.fget.__get__(instance, owner)() ok ---------------------------------------------------------------------- Ran 2 tests in 0.116s OK ``` Without the change but update test to catch warnings: ``` % python test_serialization.py -v -k test_serialization_new_format_old_format_compat_ test_serialization_new_format_old_format_compat_weights_only_False_cpu (__main__.TestBothSerializationCPU) ... ok test_serialization_new_format_old_format_compat_weights_only_True_cpu (__main__.TestBothSerializationCPU) ... FAIL ====================================================================== FAIL: test_serialization_new_format_old_format_compat_weights_only_True_cpu (__main__.TestBothSerializationCPU) ---------------------------------------------------------------------- Traceback (most recent call last): File "/Users/nshulga/git/pytorch/pytorch/torch/testing/_internal/common_utils.py", line 2536, in wrapper method(*args, **kwargs) File "/Users/nshulga/git/pytorch/pytorch/torch/testing/_internal/common_device_type.py", line 415, in instantiated_test result = test(self, **param_kwargs) File "/Users/nshulga/git/pytorch/pytorch/test/test_serialization.py", line 807, in test_serialization_new_format_old_format_compat self.assertTrue(len(w) == 0, msg=f"Expected no warnings but got {[str(x) for x in w]}") AssertionError: False is not true : Expected no warnings but got ["{message : UserWarning('TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage()'), category : 'UserWarning', filename : '/Users/nshulga/git/pytorch/pytorch/torch/_utils.py', lineno : 836, line : None}"] To execute this test, run the following from the base repo dir: python test/test_serialization.py -k test_serialization_new_format_old_format_compat_weights_only_True_cpu This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 ---------------------------------------------------------------------- Ran 2 tests in 0.109s FAILED (failures=1) ``` Fixes problem reported in #52181 (comment) Pull Request resolved: #113614 Approved by: https://github.com/kit1980, https://github.com/albanD

vadimkantorov · 2024-03-04T13:39:30Z

https://www.bleepingcomputer.com/news/security/malicious-ai-models-on-hugging-face-backdoor-users-machines/

Of course, the real problem there is that execution of internet-published code is the desired scenario

But I think, this should at least push PyTorch to make weight loading safe by default... And maybe have an audit of where PyTorch loads unsafe pickles - at least to force the user to opt-in by forcing a mandatory argument: , this_picke_load_can_execute_viruses = True)

julien-c · 2024-03-04T13:56:18Z

also quite in favor of something like this^

vadimkantorov · 2024-04-01T19:01:15Z

give the recent popularity of "backdoor" theme in general, hoping weights_only=True can become default and an audit of all unsafe pickle.load in all pytorch codebase can be done :)

vadimkantorov · 2024-11-04T22:35:52Z

@mikaylagawarecki @msaroufim So this issue can now be closed as completed? :)

If this is merged:

Flip default on weights_only #137602

Following https://dev-discuss.pytorch.org/t/bc-breaking-change-torch-load-is-being-flipped-to-use-weights-only-true-by-default-in-the-nightlies-after-137602/2573

Jannat-sultana · 2024-11-06T06:17:24Z

When I tried to upload the pickle file in the hf space for my ensemble model on prediction, it shows hf picklescan error. Can you provide any solutions?

joblib
gradio==4.44.1
huggingface_hub
scikit-learn==1.2.2
seaborn
matplotlib
lightgbm
xgboost
numpy
pandas

This is my requirements.txt file

mikaylagawarecki · 2024-11-06T15:46:11Z

@Jannat20242NSU that looks like an issue that should be filed to the respective huggingface repository that shows the picklescan error

mikaylagawarecki · 2024-11-06T15:46:35Z

Closing as complete since weights_only default was flipped in #137602

vadimkantorov mentioned this issue Feb 22, 2021

pickle is a security issue #52596

Open

malfet added the triage review label May 3, 2021

albanD added high priority and removed triage review labels May 3, 2021

pytorch-probot bot added the triage review label May 3, 2021

mayankjobanputra mentioned this issue Dec 18, 2022

Better model security with new PyTorch version deepset-ai/haystack#3521

Closed

vadimkantorov mentioned this issue Aug 22, 2023

[export] Custom object serialization #107666

Closed

vadimkantorov mentioned this issue Nov 3, 2023

added 'weights_only' param in torch.load examples #112860

Closed

vadimkantorov changed the title ~~Safe way of loading only weights from *.pt file by default~~ torch.load(..., weights_only=True) currently raises a Deprecation warning + [proposal] weights_only=True should become default for legacy-loading pickles Nov 6, 2023

vmoens mentioned this issue Nov 10, 2023

[RFC] Tensordict integration #112441

Open

2 tasks

vadimkantorov mentioned this issue Nov 10, 2023

[TorchFix] Add TorchUnsafeLoadVisitor pytorch/test-infra#4671

Merged

malfet mentioned this issue Nov 14, 2023

[BE] Do not warn when safely loading legacy dicts #113614

Closed

glangford mentioned this issue Aug 13, 2024

Fix/torch load weights only warning openai/whisper#2301

Open

DN6 mentioned this issue Aug 19, 2024

Loading PT Files for VAE with the AutoEncoder is still broken!!! huggingface/diffusers#9154

Closed

niekdejonge mentioned this issue Sep 30, 2024

Different way of saving models matchms/ms2deepscore#236

Open

mikaylagawarecki closed this as completed Nov 6, 2024

joshuacwnewton mentioned this issue Jan 28, 2025

PyTorch 2.6 has changed the default value of weights_only to True, breaking the loading of some nnunetv2 models MIC-DKFZ/nnUNet#2681

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`torch.load(..., weights_only=True)` currently raises a Deprecation warning + [proposal] `weights_only=True` should become default for safe legacy-loading pickles #52181

`torch.load(..., weights_only=True)` currently raises a Deprecation warning + [proposal] `weights_only=True` should become default for safe legacy-loading pickles #52181

torch.load(..., weights_only=True) currently raises a Deprecation warning + [proposal] weights_only=True should become default for safe legacy-loading pickles #52181

torch.load(..., weights_only=True) currently raises a Deprecation warning + [proposal] weights_only=True should become default for safe legacy-loading pickles #52181

Comments

`torch.load(..., weights_only=True)` currently raises a Deprecation warning + [proposal] `weights_only=True` should become default for safe legacy-loading pickles #52181

`torch.load(..., weights_only=True)` currently raises a Deprecation warning + [proposal] `weights_only=True` should become default for safe legacy-loading pickles #52181