gh-135953: Profile a module or script with sampling profiler #136777

lkollar · 2025-07-19T09:21:09Z

Add -m and filename arguments to the sampling profiler to launch the specified Python program in a subprocess and start profiling it. Previously only a PID was accepted, this can now be done by passing -p PID.

Issue: Expose internal stack introspection APIs as a statistical runtime analysis tool #135953

bedevere-bot · 2025-07-29T16:19:56Z

🤖 New build scheduled with the buildbot fleet by @pablogsal for commit 1bf048a 🤖

Results will be shown at:

https://buildbot.python.org/all/#/grid?branch=refs%2Fpull%2F136777%2Fmerge

If you want to schedule another build, you need to add the 🔨 test-with-buildbots label again.

Copilot

Pull Request Overview

This PR extends the sampling profiler to support profiling modules and scripts by launching them in subprocesses, in addition to the existing PID-based profiling. The change improves the usability of the profiler by allowing users to profile Python programs from startup rather than only attaching to existing processes.

Adds -m and script filename arguments as mutually exclusive alternatives to the existing -p PID option
Updates argument parsing to handle the new target modes with proper validation
Implements subprocess management with graceful termination handling

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
Lib/profile/sample.py	Adds argument parsing for module and script modes, implements subprocess launching and management
Lib/test/test_sample_profiler.py	Updates existing CLI tests to use `-p` flag and adds comprehensive test coverage for new module/script functionality

Comments suppressed due to low confidence (1)

Lib/test/test_sample_profiler.py:1637

The test uses contextlib.chdir() to change directories for module discovery, but doesn't test the case where the module is not found or the directory change fails. Consider adding a test case for module resolution errors.

            contextlib.chdir(tempdir.name),

Copilot · 2025-07-29T16:20:55Z

Lib/profile/sample.py

+    if not(args.pid or args.module or args.script):
+        parser.error(
+            "You must specify either a process ID (-p), a module (-m), or a script to run."
+        )


The condition not(args.pid or args.module or args.script) is redundant since the mutually exclusive group is already marked as required=True. The argparse library will automatically enforce that one of these options is provided.

Suggested change

if not(args.pid or args.module or args.script):

parser.error(

"You must specify either a process ID (-p), a module (-m), or a script to run."

)

# The mutually exclusive group already enforces that one of these arguments is required.

Copilot · 2025-07-29T16:20:55Z

Lib/profile/sample.py

+        process = subprocess.Popen(cmd)
+
+        try:
+            exit_code = process.wait(timeout=0.1)


The hardcoded timeout of 0.1 seconds is a magic number. Consider defining this as a named constant or making it configurable, as some programs may have longer startup times.

Suggested change

exit_code = process.wait(timeout=0.1)

exit_code = process.wait(timeout=DEFAULT_PROCESS_WAIT_TIMEOUT)

Lib/profile/sample.py

+            if process.poll() is None:
+                process.terminate()
+                try:
+                    process.wait(timeout=2)


Lib/profile/sample.py

lkollar · 2025-08-05T23:30:53Z

Thanks @AA-Turner, I think I addressed all your comments.

Python/remote_debug.h

Add `-m` and `filename` arguments to the sampling profiler to launch the specified Python program in a subprocess and start profiling it. Previously only a PID was accepted, this can now be done by passing `-p PID`.

These args are already mutually exclusive, but we need to check if at least on module argument has been passed.

In this case the subprocess will go into zombie state until we can poll it. We can simply assume this is the case if it's still detected as running when we get a ValueError.

Improve the return value check to be able to raise a ProcessLookupError when the remote process is not available. Mach uses composite error values where higher error values indicate specific subsystems. We can use the err_get_code function to mask the higher bits to make our error checking more robust in case the subsystem bits are set. For example, in some situations if the process is in zombie state, we can get KERN_NO_SPACE (0x3) but the actual return value is 0x10000003 which indicates a specific subsystem, thus we need to use err_get_code to extract the error value. This also improves how KERN_INVALID_ARGUMENT is handled to check whether we got a generic invalid argument error, or if the process is no longer accessible.

bedevere-bot · 2025-08-08T22:56:30Z

🤖 New build scheduled with the buildbot fleet by @pablogsal for commit 0338ee1 🤖

Results will be shown at:

https://buildbot.python.org/all/#/grid?branch=refs%2Fpull%2F136777%2Fmerge

If you want to schedule another build, you need to add the 🔨 test-with-buildbots label again.

pablogsal · 2025-08-09T15:43:16Z

@lkollar Seems test_sample_target_module has a race since is failin on some buildbots:

AssertionError: 'slow_fibonacci' not found in 'Captured 10001 samples in 1.00 seconds
Sample rate: 10000.94 samples/sec
Error rate: 99.10%
Profile Stats:
       nsamples   sample%  tottime (ms)    cumul%  cumtime (ms)  filename:lineno(function)
          90/90     100.0         9.000     100.0         9.000  sample.py:96(SampleProfiler.sample)
           0/90       0.0         0.000     100.0         9.000  sample.py:549(sample)
           0/90       0.0         0.000     100.0         9.000  sample.py:590(wait_for_process_and_sample)
           0/90       0.0         0.000     100.0         9.000  sample.py:790(main)
           0/90       0.0         0.000     100.0         9.000  test_sample_profiler.py:1640(TestSampleProfilerIntegration.test_sample_target_module)
           0/90       0.0         0.000     100.0         9.000  case.py:613(TestCase._callTestMethod)
           0/90       0.0         0.000     100.0         9.000  case.py:667(TestCase.run)

Legend:
  nsamples: Direct/Cumulative samples (direct executing / on call stack)
  sample%: Percentage of total samples this function was directly executing
  tottime: Estimated total time spent directly in this function
  cumul%: Percentage of total samples when this function was on the call stack
  cumtime: Estimated cumulative time (including time in called functions)
  filename:lineno(function): Function location and name

Summary of Interesting Functions:

Functions with Highest Direct/Cumulative Ratio (Hot Spots):
  1.000 direct/cumulative ratio, 100.0% direct samples: sample.py:(SampleProfiler.sample)

Functions with Highest Call Frequency (Indirect Calls):
  90 indirect calls, 100.0% total stack presence: sample.py:(sample)
  90 indirect calls, 100.0% total stack presence: sample.py:(wait_for_process_and_sample)
  90 indirect calls, 100.0% total stack presence: sample.py:(main)

Functions with Highest Call Magnification (Cumulative/Direct):

pablogsal · 2025-08-09T15:43:49Z

why is the sample.py code in the results? Also somehow we have Error rate: 99.10%????

pablogsal · 2025-08-09T17:00:46Z

Oh, I think this is because we are sampling before the other process calls execv so we see the forked one! We may need some sync mechanism between the process and the profilee.

bedevere-bot · 2025-08-09T18:28:45Z

🤖 New build scheduled with the buildbot fleet by @pablogsal for commit 3847fff 🤖

Results will be shown at:

https://buildbot.python.org/all/#/grid?branch=refs%2Fpull%2F136777%2Fmerge

If you want to schedule another build, you need to add the 🔨 test-with-buildbots label again.

bedevere-app bot mentioned this pull request Jul 19, 2025

Expose internal stack introspection APIs as a statistical runtime analysis tool #135953

Open

lkollar force-pushed the sample-module branch 2 times, most recently from 5a351da to 1bf048a Compare July 19, 2025 14:07

lkollar marked this pull request as ready for review July 19, 2025 14:59

bedevere-app bot added the awaiting review label Jul 19, 2025

pablogsal added skip news 🔨 test-with-buildbots Test PR w/ buildbots; report in status section labels Jul 29, 2025

bedevere-bot removed the 🔨 test-with-buildbots Test PR w/ buildbots; report in status section label Jul 29, 2025

pablogsal requested a review from Copilot July 29, 2025 16:20

Copilot AI reviewed Jul 29, 2025

View reviewed changes

pablogsal reviewed Jul 29, 2025

View reviewed changes

Lib/profile/sample.py Outdated Show resolved Hide resolved

lkollar commented Aug 4, 2025

View reviewed changes

Lib/profile/sample.py Outdated Show resolved Hide resolved

AA-Turner changed the title ~~[gh-135953] Profile a module or script with sampling profiler~~ gh-135953: Profile a module or script with sampling profiler Aug 4, 2025

AA-Turner reviewed Aug 4, 2025

View reviewed changes

lkollar force-pushed the sample-module branch from 19cde40 to 3c53904 Compare August 5, 2025 23:50

pablogsal reviewed Aug 5, 2025

View reviewed changes

Python/remote_debug.h Outdated Show resolved Hide resolved

lkollar added 11 commits August 6, 2025 14:26

Profile a module or script with sampling profiler

eada5f0

Add `-m` and `filename` arguments to the sampling profiler to launch the specified Python program in a subprocess and start profiling it. Previously only a PID was accepted, this can now be done by passing `-p PID`.

Wait for interpreter to initialize in subprocess

d8d035e

Remove unnecessary argument check

d99a39e

These args are already mutually exclusive, but we need to check if at least on module argument has been passed.

Handle the process runs shorter than expected

f97cc55

In this case the subprocess will go into zombie state until we can poll it. We can simply assume this is the case if it's still detected as running when we get a ValueError.

fixup! Handle the process runs shorter than expected

980c0f2

fixup! Wait for interpreter to initialize in subprocess

5043171

fixup! fixup! Wait for interpreter to initialize in subprocess

7832a7d

fixup! Improve ReadRemoteMemory error handling on macOS

c13a74d

fixup! fixup! Wait for interpreter to initialize in subprocess

2ce8a96

fixup! fixup! Improve ReadRemoteMemory error handling on macOS

1622ee3

lkollar force-pushed the sample-module branch from 3c53904 to 1622ee3 Compare August 6, 2025 17:26

pablogsal requested review from ambv and 1st1 as code owners August 8, 2025 12:34

pablogsal force-pushed the sample-module branch 3 times, most recently from 758af04 to 289d868 Compare August 8, 2025 13:31

Do not return error without an exception set

0338ee1

pablogsal force-pushed the sample-module branch from 289d868 to 0338ee1 Compare August 8, 2025 17:22

pablogsal added the 🔨 test-with-buildbots Test PR w/ buildbots; report in status section label Aug 8, 2025

bedevere-bot removed the 🔨 test-with-buildbots Test PR w/ buildbots; report in status section label Aug 8, 2025

Protect against more errors with no exceptions

2576f8b

pablogsal force-pushed the sample-module branch from a7fa762 to 2576f8b Compare August 9, 2025 17:40

Attempt to sync processes

3847fff

pablogsal force-pushed the sample-module branch from e40aa09 to 3847fff Compare August 9, 2025 17:53

pablogsal added the 🔨 test-with-buildbots Test PR w/ buildbots; report in status section label Aug 9, 2025

bedevere-bot removed the 🔨 test-with-buildbots Test PR w/ buildbots; report in status section label Aug 9, 2025

pablogsal approved these changes Aug 9, 2025

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting review labels Aug 9, 2025

pablogsal merged commit 4497ad4 into python:main Aug 11, 2025
129 checks passed

bedevere-app bot removed the awaiting merge label Aug 11, 2025

	exit_code = process.wait(timeout=0.1)
	exit_code = process.wait(timeout=DEFAULT_PROCESS_WAIT_TIMEOUT)

Uh oh!

gh-135953: Profile a module or script with sampling profiler #136777

gh-135953: Profile a module or script with sampling profiler #136777

Uh oh!

Conversation

lkollar commented Jul 19, 2025 • edited by bedevere-app bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bedevere-bot commented Jul 29, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

This comment was marked as outdated.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lkollar commented Aug 5, 2025

Uh oh!

Uh oh!

bedevere-bot commented Aug 8, 2025

Uh oh!

pablogsal commented Aug 9, 2025

Uh oh!

pablogsal commented Aug 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pablogsal commented Aug 9, 2025

Uh oh!

bedevere-bot commented Aug 9, 2025

Uh oh!

Uh oh!

Uh oh!

lkollar commented Jul 19, 2025 •

edited by bedevere-app bot

Loading

pablogsal commented Aug 9, 2025 •

edited

Loading