-
Notifications
You must be signed in to change notification settings - Fork 51
feat: Support audio_transcribe with partial ordering #1908
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
78dcbf0
to
abc6dae
Compare
4b2927f
to
5560902
Compare
@@ -2488,6 +2488,11 @@ def join( | |||
) | |||
if result is not None: | |||
return result | |||
|
|||
# For block identify joins with null indices, perform cross join |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This doesn't seem desirable. If df1 is n rows and df2 is m rows, won't this end up with n x m rows?
result = df.to_pandas(ordered=False) | ||
|
||
assert "transcribed_text" in result.columns | ||
assert len(result) > 0 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The number of rows in result
should be exactly equal to the number of rows in audio_mm_df_partial_ordering
.
feat: Support audio transcription with partial ordering
This change also fixes a related issue where Block.join would fail on joins with null indexes when operating in this partial ordering mode.
b/430572560