API/BUG: support for operations with custom objects / object dtype

Consider the following dummy example of a custom Python object that supports some arithmetic operations:

```python
class MyObject:
    def __init__(self, val):
        self.val = val
    def __add__(self, other):
        if hasattr(other, "dtype"):
            return NotImplemented
        return MyObject(self.val + other)
    def __radd__(self, other):
        if hasattr(other, "dtype"):
            return NotImplemented
        return MyObject(other + self.val)
    def __repr__(self):
        return f"<MyObject({self.val})>"
```

Working with such objects in pandas containers generally works, i.e. we defer to the scalar operation and assemble the results:

```python
# operation of scalar/arr/series with numeric other
arr = np.array([MyObject(1), MyObject(2)])
ser = pd.Series([1, 2])

print(ser + arr[0])
print(ser + arr)
print(ser + pd.Series(arr))
print(arr[0] + ser)
print(arr + ser)
print(pd.Series(arr) + ser)
```

When the other operand were strings, this worked in pandas 2.x as well:

```python
# operation of scalar/arr/series with string (object) other
arr = np.array([MyObject("a"), MyObject("b")])
ser = pd.Series(["1", "2"])

print(ser + arr[0])
print(ser + arr)
print(ser + pd.Series(arr))
print(arr[0] + ser)
print(arr + ser)
print(pd.Series(arr) + ser)
```

However, this case no longer works in pandas 3.0 with the default `str` dtype.

We specifically fixed this for `pathlib.Path` objects for the scalar case (https://github.com/pandas-dev/pandas/issues/61940 / https://github.com/pandas-dev/pandas/pull/62229), but then got a report for such objects in a Series (https://github.com/pandas-dev/pandas/issues/63832). But in the end, while we can "fix" this again specifically for Path objects, this is a more general issue with any generic Python object. 

I think in general pandas has always been quite flexible in supporting custom objects, and IMO we should continue to do that (yes, you can define ExtensionDtypes for full control over handling of custom objects, but that is often overkill). Once we detect something we cannot infer to a non-object dtype, we can use a slower element-wise code path. And IMO we should keep doing that also for newer dtypes (such as now `str` dtype).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

API/BUG: support for operations with custom objects / object dtype #64107

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

API/BUG: support for operations with custom objects / object dtype #64107

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions