Refactor global allocator to safer `TrackingAllocator` #205

ianks · 2023-05-01T19:08:00Z

Previously, it was possible for the previous RbAllocator implementation to deadlock and/or segfault when dealloc was called after the Ruby VM was shut down. In particular, this can happen when thread local destructors are run, since they are run after the Ruby VM has been shut down. When the Ruby VM is shut down, calls to rb_gc_adjust_memory_usage will fail. So we need to avoid that.

Bullet Points

Use some hidden symbols (ruby_vm_current_ptr and ruby_current_ec) to check the availability of the Ruby VM before attempting to report memory usage.
Rename RbAllocator to TrackingAllocator to communicate the implementation a bit more accurately
Add new ManuallyTracked<T> type which allows RAII style allocation tracking outside of std::system::GlobalAlloc. This is useful when you want to track memory usage from direct calls to mmap(2), for example.

Caveats

Unfortunately, this solution will not work with Ruby 3.3 since ruby_current_vm_ptr is not longer publicly exported (as it was private to begin with). I've opened an issue on the Ruby bug tracker to add official support for the needed behavior, though. Please chime in on the issue so we can have reliable Rust memory tracking in the next Ruby release!

dylanahsmith · 2023-05-03T19:23:18Z

crates/rb-sys-test-helpers/src/ruby_test_executor.rs

+        R: Send + 'static,
+    {
+        self.run(|| {
+            trigger_full_gc!();


Is this just for the tracking allocator tests? Couldn't this just be used in those tests?

dylanahsmith · 2023-05-03T19:39:04Z

crates/rb-sys-test-helpers/src/ruby_test_executor.rs

-            for closure in receiver {
-                closure();
+            // Wait for the main thread to finish setting up Ruby
+            while unsafe { STATE.load(Ordering::Acquire) != 2 } {


From the std::sync::Once::call_once docs:

This method will block the calling thread if another initialization routine is currently running.

so there shouldn't be any need for this redundant synchronization.

dylanahsmith · 2023-05-03T19:40:46Z

crates/rb-sys-test-helpers/src/ruby_test_executor.rs

@@ -66,7 +77,7 @@ impl RubyTestExecutor {
        F: FnOnce() -> R + Send + 'static,
        R: Send + 'static,
    {
-        let (result_sender, result_receiver) = mpsc::sync_channel(1);
+        let (result_sender, result_receiver) = mpsc::sync_channel(8);


Why 8? This channel is only used in this single run call, so it will only have a single send to it.

sync_channel has some strange behaviors in 1.54, was using this as a way to robustness test. Will remove.

crates/rb-sys-test-helpers/src/utils.rs

dylanahsmith · 2023-05-03T20:14:39Z

crates/rb-sys-test-helpers/src/ruby_test_executor.rs

            static INIT: Once = Once::new();
+            static mut STATE: AtomicU8 = AtomicU8::new(0);


Why is more synchronization needed now? It looks like moving this initialization code into executor.run(|| { that precedes the executor being returned means that we shouldn't even need the INIT, let alone the extra STATE synchronization

I was having some major synchronization headaches on Rust 1.54 (but not on stable). I threw everything at the book at it to finally make these tests stable since libruby doesn't always use atomic loads for these values.

My thinking is that since these are test helpers, it's better have too much synchronization than not enough - to avoid flakiness at almost all costs. That being said, planning on circling back to clean up the unnecessary stuff.

dylanahsmith · 2023-05-03T20:19:09Z

crates/rb-sys-test-helpers/src/utils.rs

+            let result = $e;
+            let after = unsafe { rb_sys::rb_gc_stat(id) };
+
+            $crate::trigger_full_gc!();


Shouldn't this be triggered before the first rb_gc_stat?

dylanahsmith · 2023-05-03T20:23:04Z

crates/rb-sys-test-helpers/src/utils.rs

+        pub static CAPTURE_LOCK_INIT: std::sync::Once = std::sync::Once::new();
+        pub static mut CAPTURE_LOCK: Option<std::sync::Mutex<()>> = None;
+
+        CAPTURE_LOCK_INIT.call_once(|| unsafe {
+            CAPTURE_LOCK.replace(std::sync::Mutex::new(()));
+        });


Won't each use of this macro end up with a separate lock? What is this lock protecting? Isn't #[ruby_test] already keeping multiple ruby tests from running in concurrency?

dylanahsmith · 2023-05-03T20:29:53Z

crates/rb-sys/src/utils.rs

+    let ret = !crate::hidden::ruby_current_vm_ptr.is_null();
+
+    #[cfg(any(ruby_lte_2_4, ruby_gt_3_2))]
+    let ret = crate::rb_cBasicObject != 0;


Where does ruby rb_cBasicObject = 0? I only see it getting set in Init_class_hierarchy. Hopefully your upstream PR will make this unnecessary anyways.

Yeah, this doesn't actually inform us about a shutdown VM, only if it has started or not. Half-measure.

cc ruby/ruby#7783

Also, Ruby doensn't explicitly set it to 0 anywhere, instead we are relying on the fact that the default initialization value of a static variable is zero in C

ianks added 3 commits May 1, 2023 21:39

Add hidden probe to check Ruby VM availability

6ec258f

Refactor GlobalAllocator to be safer TrackingAllocator

97a2c60

Add ManuallyTracked<T> for RAII memory reporting

b36114d

ianks force-pushed the allocator-check-vm branch from 6d5798a to b36114d Compare May 2, 2023 01:40

ianks marked this pull request as ready for review May 2, 2023 01:41

ianks added 2 commits May 1, 2023 22:50

Add integration tests

7afc12f

No impl Deref on ManuallyTracked, dial in atomic ordering

4d940bf

ianks force-pushed the allocator-check-vm branch from c1c9eb5 to 4d940bf Compare May 2, 2023 05:36

ianks added 3 commits May 2, 2023 08:47

Keep track of Ruby adjusted bytes in ManuallyTracked

d59bb03

Verbose tests in CI

c4db7a4

Test synchronization

d02bd6f

ianks force-pushed the allocator-check-vm branch from 3d26c77 to d02bd6f Compare May 2, 2023 17:53

ianks added 2 commits May 2, 2023 13:55

Add back multithread assertion

468755d

Use Arc to make ManuallyTracked cloneable

d12999e

ianks mentioned this pull request May 3, 2023

Report memory usage to the Ruby GC bytecodealliance/wasmtime-rb#187

Merged

3 tasks

dylanahsmith reviewed May 3, 2023

View reviewed changes

ianks added 4 commits May 4, 2023 14:25

Document and test is_ruby_vm_started

b167812

Remove redundant startup synchronization in test executor

a95b9b6

Set sync channel back to 1

f282d90

Remove extraneous capture lock

3ff04ec

ianks requested a review from dylanahsmith May 4, 2023 18:43

ianks merged commit b536c1e into main May 6, 2023

ianks deleted the allocator-check-vm branch May 6, 2023 02:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Refactor global allocator to safer `TrackingAllocator` #205

Refactor global allocator to safer `TrackingAllocator` #205

Uh oh!

ianks commented May 1, 2023 •

edited

Loading

Uh oh!

dylanahsmith May 3, 2023

Uh oh!

dylanahsmith May 3, 2023

Uh oh!

dylanahsmith May 3, 2023

Uh oh!

ianks May 4, 2023

Uh oh!

Uh oh!

dylanahsmith May 3, 2023

Uh oh!

ianks May 4, 2023

Uh oh!

dylanahsmith May 3, 2023

Uh oh!

dylanahsmith May 3, 2023

Uh oh!

dylanahsmith May 3, 2023

Uh oh!

ianks May 4, 2023

Uh oh!

ianks May 4, 2023

Uh oh!

ianks May 4, 2023

Uh oh!

Uh oh!

		static INIT: Once = Once::new();
		static mut STATE: AtomicU8 = AtomicU8::new(0);

Uh oh!

Refactor global allocator to safer TrackingAllocator #205

Refactor global allocator to safer TrackingAllocator #205

Uh oh!

Conversation

ianks commented May 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Bullet Points

Caveats

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Refactor global allocator to safer `TrackingAllocator` #205

Refactor global allocator to safer `TrackingAllocator` #205

ianks commented May 1, 2023 •

edited

Loading