You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+1-2Lines changed: 1 addition & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -79,8 +79,7 @@ to uncompress and untar. The directory structure and how the code samples are or
79
79
80
80
The 4 benchmark datasets, Project_CodeNet_C++1000, Project_CodeNet_C++1400,
81
81
Project_CodeNet_Python800, and Project_CodeNet_Java250 are included in the
82
-
full dataset and are available separately in the "Archive Dataset File" column of the table in the "Get this Dataset"
83
-
section in our [data repository](https://developer.ibm.com/exchanges/data/all/project-codenet/).
82
+
full dataset.
84
83
They can be used for code classification and code similarity research as a replacement of or in addition to the dataset [POJ-104](https://sites.google.com/site/treebasedcnn/).
85
84
86
85
To expedite AI for code research using graph neural networks, we have included the simplified parse tree (SPT) representation of the code samples for each benchmark dataset. They are available in the "Archive SPT File" column of the table in the "Get this Dataset" section in our [data repository](https://developer.ibm.com/exchanges/data/all/project-codenet/).
0 commit comments