- How to make type cast for python custom class
- 6 Answers 6
- Example
- When is it OK to do this?
- Why is this Dangerous?
- Safer Alternative
- Python Type Conversion
- Python Implicit Type Conversion
- Example 1: Converting integer to float
- Explicit Type Conversion
- Example 2: Addition of string and integer Using Explicit Conversion
- Key Points to Remember
- Table of Contents
How to make type cast for python custom class
This is an experiment. I want to insert a class instance to MongoDB. But can’t do it directly, so i had to convert it to a string.. Now, i need to convert it back.
You are looking at the wrong conversion, what you want is object serialization , do not use str(..) for that.
If you want to recreate the object from the string, you should be using its repr not str form. If you want to put it in a database, why not have a table where each row is an instance and each field is an instance attribute?
6 Answers 6
For those who are looking for overriding conversion builtins such as int(obj) , float(obj) , and str(obj) , see Overload int() in Python. You need to implement __int__ , __float__ , or __str__ on the object.
To answer the question, one way of doing this is by «abusing» __repr__ in combination with eval() . Let’s first have a look at the __repr__ docs (emphasis: mine):
Called by the repr() built-in function to compute the “official” string representation of an object. If at all possible, this should look like a valid Python expression that could be used to recreate an object with the same value (given an appropriate environment). If this is not possible, a string of the form <. some useful description. >should be returned. The return value must be a string object. If a class defines __repr__() but not __str__() , then __repr__() is also used when an “informal” string representation of instances of that class is required.
This is typically used for debugging, so it is important that the representation is information-rich and unambiguous.
With this in mind, we know that it is recommended to return a string from __repr__ which can be used with eval() . This is implied by the statement that the value «should look like a valid Python expression».
Example
Here is an example which uses this. The example also overrides __eq__ , but only for convenience for the print-outs. And for completeness we also add a value to the instance.
The example creates a new instance. Then the value is converted to a string using __repr__ (by using the repr() function. Next that string value is passed to eval() which will evaluate the string and return the result. The result will be a new instance of the same class and is stored in second_instance . We also print out the id() to visualise that we have indeed two different instances. Finally we show that first_instance == second_instance is indeed True :
class MyClass: def __init__(self, value): self.value = value def __eq__(self, other): return isinstance(self, MyClass) and self.value == other.value def __repr__(self): return '%s(%r)' % (self.__class__.__name__, self.value) first_instance = MyClass(123) print('First instance: repr=%r, % (first_instance, id(first_instance))) stringified = repr(first_instance) print('Stringified: %r' % stringified) second_instance = eval(stringified) # . DANGEROUS (see below) . print('Second instance: repr=%r, % (second_instance, id(second_instance))) print('First == Second: %r' % (first_instance == second_instance))
When is it OK to do this?
This is 100% acceptable if absolutely everything going into eval() is under your control! This means:
- The scope in which eval() is called is under your control
- No place in the evaluated string should contain data coming from outside sources. Outside sources include:
- Database values
- User-Input
- Data read from disk
- . basically any I/O
Keeping all this in mind and guaranteeing that at no point in the future of the project I/O will end up in an eval() call is almost impossible. As such I strongly recommend avoiding this in important production code as it opens up nasty security holes.
For code not running in production, this is absolutely acceptable. For example unit-tests, personal utility scripts, e.t.c. But the risk should always be taken into consideration.
Why is this Dangerous?
- The code passed into eval() is executed inside the Python process calling it, with the same privileges. Example: You read a value from a DB where multiple users have access and you eval() it. In that case, another user may inject code via the database and that code will run as your user!
- Using eval() when the values come from outside sources opens up the possibility of code-injections.
- It is not guaranteed that repr() will return a valid Python expression. This is only a recommendation by the docs. Hence the call to eval with __repr__ is prone to run-time errors.
- In the example above, the scope calling eval() needs to «know» about the class MyClass (it must be imported). It only looks for the name. So if by pure chance that same name exists in the scope, but pointing to another object, you will call something else unintentionally and may run into weird bugs. Granted, this is an edge-case.
Safer Alternative
Use one of the many available serialisation options. The most popular, and simplest one to use is to convert the object to/from JSON strings. The above example could be made safe like this:
import json class MyClass: @staticmethod def from_json(document): data = json.loads(document) instance = MyClass(data['value']) return instance def __init__(self, value): self.value = value def __eq__(self, other): return isinstance(self, MyClass) and self.value == other.value def __repr__(self): return '%s(%r)' % (self.__class__.__name__, self.value) def to_json(self): data = < 'value': self.value >return json.dumps(data) first_instance = MyClass(123) print('First instance: repr=%r, % (first_instance, id(first_instance))) stringified = first_instance.to_json() print('Stringified: %r' % stringified) second_instance = MyClass.from_json(stringified) print('Second instance: repr=%r, % (second_instance, id(second_instance))) print('First == Second: %r' % (first_instance == second_instance))
This is only marginally more difficult but much safer.
The same approach can be used with other serialisation methods. Popular formats are:
- XML
- YAML
- ini/cfg files
- pickle (note that this uses bytes instead of text as serialisation medium).
- MessagePack (note that this uses bytes instead of text as serialisation medium).
- Custom Implementation
- .
Python Type Conversion
In programming, type conversion is the process of converting data of one type to another. For example: converting int data to str .
There are two types of type conversion in Python.
- Implicit Conversion — automatic type conversion
- Explicit Conversion — manual type conversion
Python Implicit Type Conversion
In certain situations, Python automatically converts one data type to another. This is known as implicit type conversion.
Example 1: Converting integer to float
Let’s see an example where Python promotes the conversion of the lower data type (integer) to the higher data type (float) to avoid data loss.
integer_number = 123 float_number = 1.23 new_number = integer_number + float_number # display new value and resulting data type print("Value:",new_number) print("Data Type:",type(new_number))
In the above example, we have created two variables: integer_number and float_number of int and float type respectively.
Then we added these two variables and stored the result in new_number .
As we can see new_number has value 124.23 and is of the float data type.
It is because Python always converts smaller data types to larger data types to avoid the loss of data.
- We get TypeError , if we try to add str and int . For example, ’12’ + 23 . Python is not able to use Implicit Conversion in such conditions.
- Python has a solution for these types of situations which is known as Explicit Conversion.
Explicit Type Conversion
In Explicit Type Conversion, users convert the data type of an object to required data type.
We use the built-in functions like int() , float() , str() , etc to perform explicit type conversion.
This type of conversion is also called typecasting because the user casts (changes) the data type of the objects.
Example 2: Addition of string and integer Using Explicit Conversion
num_string = '12' num_integer = 23 print("Data type of num_string before Type Casting:",type(num_string)) # explicit type conversion num_string = int(num_string) print("Data type of num_string after Type Casting:",type(num_string)) num_sum = num_integer + num_string print("Sum:",num_sum) print("Data type of num_sum:",type(num_sum))
Data type of num_string before Type Casting: Data type of num_string after Type Casting: Sum: 35 Data type of num_sum:
In the above example, we have created two variables: num_string and num_integer with str and int type values respectively. Notice the code,
num_string = int(num_string)
Here, we have used int() to perform explicit type conversion of num_string to integer type.
After converting num_string to an integer value, Python is able to add these two variables.
Finally, we got the num_sum value i.e 35 and data type to be int .
Key Points to Remember
- Type Conversion is the conversion of an object from one data type to another data type.
- Implicit Type Conversion is automatically performed by the Python interpreter.
- Python avoids the loss of data in Implicit Type Conversion.
- Explicit Type Conversion is also called Type Casting, the data types of objects are converted using predefined functions by the user.
- In Type Casting, loss of data may occur as we enforce the object to a specific data type.
Table of Contents