Skip to content

Commit c68a00c

Browse files
committed
took out URLs to derived datasets
1 parent 160840a commit c68a00c

File tree

1 file changed

+1
-2
lines changed

1 file changed

+1
-2
lines changed

README.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -79,8 +79,7 @@ to uncompress and untar. The directory structure and how the code samples are or
7979

8080
The 4 benchmark datasets, Project_CodeNet_C++1000, Project_CodeNet_C++1400,
8181
Project_CodeNet_Python800, and Project_CodeNet_Java250 are included in the
82-
full dataset and are available separately in the "Archive Dataset File" column of the table in the "Get this Dataset"
83-
section in our [data repository](https://developer.ibm.com/exchanges/data/all/project-codenet/).
82+
full dataset.
8483
They can be used for code classification and code similarity research as a replacement of or in addition to the dataset [POJ-104](https://sites.google.com/site/treebasedcnn/).
8584

8685
To expedite AI for code research using graph neural networks, we have included the simplified parse tree (SPT) representation of the code samples for each benchmark dataset. They are available in the "Archive SPT File" column of the table in the "Get this Dataset" section in our [data repository](https://developer.ibm.com/exchanges/data/all/project-codenet/).

0 commit comments

Comments
 (0)