Can DecRef be optimized with refcount.load(std::memory_order_relaxed) == 1?

IncRef:

refcount.fetch_add(1, std::memory_order_relaxed)

DecRef:

if (refcount.load(std::memory_order_relaxed) == 1 ||
    refcount.fetch_sub(1, std::memory_order_release) == 1) {
  std::atomic_thread_fence(std::memory_order_acquire);
  delete this;
}

Is it safe to compare the refcount to 1, and immediately delete the object without an actual release operation? The reasoning is that the only time the refcount can be 1 is if the current thread has the only reference.

New contributor

I’m not sure if delete this is ever safe; I know this isn’t allowed to be null. But you could rework this to be not a member function so it’s ptr->refcount and delete ptr.

I think this has a problem if another thread can create another reference with IncRef while you’re in the middle of a DecRef. If you always did refcount--, their fetch_add would see the old value as 0 and know that another thread has already or is about to delete the object and they were too late in trying to get a reference.
Without that, they have no way to distinguish that case from simply being the second reference.

But if that’s impossible, e.g. because new references are created by an owner and then given to another thread, yes I think this is safe.

The acquire fence is necessary in case another thread just wrote to the object and did a DecRef, to make sure the stuff they did to the object’s members happens-before the delete. And yes, using an acquire fence means the load can be relaxed instead of acquire and the RMW can be just release not acq_rel. On some ISA (like AArch64) it might be more efficient to use acquire and acq_rel operations and avoid a separate fence.

You’re optimizing for the last-reference case by avoiding an RMW there, at the cost of making other calls slower. That might or might not be good, depending on your use-case.

A couple extra instructions to load+branch before you RMW+branch, and that’s another branch that needs to be predicted correctly.

Potentially you get the cache line into MESI Shared state with the read-only access, but then the RMW also misses in cache and has to do a read-for-ownership (RFO) to get it into MESI Exclusive/Modified state, so you have two off-core communications, hopefully pipelined with each other if the ==1 branch predicts correctly. (Other than that, a load right before an RMW is nothing to worry about on typical CPUs.)

Trang chủ Giới thiệu Sinh nhật bé trai Sinh nhật bé gái Tổ chức sự kiện Biểu diễn giải trí Dịch vụ khác Trang trí tiệc cưới Tổ chức khai trương Tư vấn dịch vụ Thư viện ảnh Tin tức - sự kiện Liên hệ Chú hề sinh nhật Trang trí YEAR END PARTY công ty Trang trí tất niên cuối năm Trang trí tất niên xu hướng mới nhất Trang trí sinh nhật bé trai Hải Đăng Trang trí sinh nhật bé Khánh Vân Trang trí sinh nhật Bích Ngân Trang trí sinh nhật bé Thanh Trang Thuê ông già Noel phát quà Biểu diễn xiếc khỉ Xiếc quay đĩa Dịch vụ tổ chức sự kiện 5 sao Thông tin về chúng tôi Dịch vụ sinh nhật bé trai Dịch vụ sinh nhật bé gái Sự kiện trọn gói Các tiết mục giải trí Dịch vụ bổ trợ Tiệc cưới sang trọng Dịch vụ khai trương Tư vấn tổ chức sự kiện Hình ảnh sự kiện Cập nhật tin tức Liên hệ ngay Thuê chú hề chuyên nghiệp Tiệc tất niên cho công ty Trang trí tiệc cuối năm Tiệc tất niên độc đáo Sinh nhật bé Hải Đăng Sinh nhật đáng yêu bé Khánh Vân Sinh nhật sang trọng Bích Ngân Tiệc sinh nhật bé Thanh Trang Dịch vụ ông già Noel Xiếc thú vui nhộn Biểu diễn xiếc quay đĩa Dịch vụ tổ chức tiệc uy tín Khám phá dịch vụ của chúng tôi Tiệc sinh nhật cho bé trai Trang trí tiệc cho bé gái Gói sự kiện chuyên nghiệp Chương trình giải trí hấp dẫn Dịch vụ hỗ trợ sự kiện Trang trí tiệc cưới đẹp Khởi đầu thành công với khai trương Chuyên gia tư vấn sự kiện Xem ảnh các sự kiện đẹp Tin mới về sự kiện Kết nối với đội ngũ chuyên gia Chú hề vui nhộn cho tiệc sinh nhật Ý tưởng tiệc cuối năm Tất niên độc đáo Trang trí tiệc hiện đại Tổ chức sinh nhật cho Hải Đăng Sinh nhật độc quyền Khánh Vân Phong cách tiệc Bích Ngân Trang trí tiệc bé Thanh Trang Thuê dịch vụ ông già Noel chuyên nghiệp Xem xiếc khỉ đặc sắc Xiếc quay đĩa thú vị

Filed under: Kiến thức lập trình - @ 12:12

Thẻ: c++stdatomicreference-counting

Thiết kế website giá rẻ

Danh mục

Can DecRef be optimized with refcount.load(std::memory_order_relaxed) == 1?