Skip to main content
Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

851 to 900 of 1,859 Results
Sep 3, 2021
Neergaard, Karl David; Xu, Hongzhi; Huang, Chu-Ren, 2021, "Database of Word Level Statistics - Mandarin", https://hdl.handle.net/11272.1/AB2/VJDPA0, Abacus Data Network, V1
Abstract Introduction Database of Word Level Statistics - Mandarin was developed by The Hong Kong Polytechnic University. It provides lexical characteristics of a descriptive and statistical nature for words and nonwords of Mandarin Chinese. It is designed for researchers particu...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Optical Disc Image - 279.4 MB - MD5: bc033c1519a63f05d509aee06c1bf5b2
Data
ISO disc image including all documentation and data
Plain Text - 1.4 KB - MD5: 44fce89c1a2390a7b64f0c8222088855
Documentation
File manifest
Sep 3, 2021
Knight, Kevin; Badarau, Bianca; Baranescu, Laura; Bonial, Claire; Bardocz, Madalina; Griffitt, Kira; Hermjakob, Ulf; Marcu, Daniel; Palmer, Martha; O'Gorman, Tim; Schneider, Nathan, 2021, "Abstract Meaning Representation (AMR) Annotation Release 3.0", https://hdl.handle.net/11272.1/AB2/82CVJF, Abacus Data Network, V1
Abstract Introduction Abstract Meaning Representation (AMR) Annotation Release 3.0 was developed by the Linguistic Data Consortium (LDC), SDL/Language Weaver, Inc., the University of Colorado's Computational Language and Educational Research group and the Information Sciences Ins...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Optical Disc Image - 263.5 MB - MD5: e2ffa11c9d6bbb3a183cfa1b5679183b
Data
ISO disc image including all documentation and data
Plain Text - 309.8 KB - MD5: 18700a77dc8421f185bd209d3a582f4f
Documentation
File manifest
Sep 3, 2021
Sluyter-Gaethje, Henny; Bourgonje, Peter; Stede, Manfred, 2021, "Penn Discourse Treebank Version 2.0 - German Translation", https://hdl.handle.net/11272.1/AB2/1AXWBN, Abacus Data Network, V1
Abstract Introduction Penn Discourse Treebank Version 2.0 - German Translation was developed at the University of Potsdam's Applied Computational Linguistics group and consists of approximately one million tokens derived from Penn Discourse Treebank Version 2.0 (LDC2008T05). This...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Optical Disc Image - 110.7 MB - MD5: c64b172f3b16a3c9bcd0ad3a5985f548
Data
ISO disc image including all documentation and data
Plain Text - 402 B - MD5: 1b3ee9f19976a27d4dc11d43b5bc1551
Documentation
File manifest
Sep 3, 2021
Ellis, Joe; Getman, Jeremy; Strassel, Stephanie, 2021, "TAC KBP English Surprise Slot Filling -- Comprehensive Training and Evaluation Data 2010", https://hdl.handle.net/11272.1/AB2/VAZOSD, Abacus Data Network, V1
Abstract Introduction TAC KBP English Surprise Slot Filling -- Comprehensive Training and Evaluation Data 2010 was developed by the Linguistic Data Consortium and contains training and evaluation data produced in support of the 2010 TAC KBP Surprise Slot Filling track, the only y...
Optical Disc Image - 2.4 MB - MD5: eeef8698d6282fb7a9e2cd45f23ea691
Data
ISO disc image including all documentation and data
Plain Text - 7.5 KB - MD5: 5bc4cd812c6b2118a94e246039efa1ee
Documentation
File manifest
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Sep 3, 2021
Ellis, Joe; Getman, Jeremy; Strassel, Stephanie, 2021, "TAC KBP English Sentiment Slot Filling -- Comprehensive Training and Evaluation Data 2013-2014", https://hdl.handle.net/11272.1/AB2/MRZALN, Abacus Data Network, V1
Abstract Introduction TAC KBP English Surprise Slot Filling -- Comprehensive Training and Evaluation Data 2010 was developed by the Linguistic Data Consortium and contains training and evaluation data produced in support of the 2013 and 2014 TAC KBP Sentiment Slot Filling tracks....
Optical Disc Image - 6.7 MB - MD5: ca74664d38a50ac97c892eb5b40b6c23
Data
ISO disc image including all documentation and data
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Plain Text - 46.6 KB - MD5: 40f20b324c900c1e96c63724058747f0
Documentation
File manifest
Sep 3, 2021
Daza, Angel; Frank, Anette, 2021, "X-SRL: Parallel Cross-lingual Semantic Role Labeling", https://hdl.handle.net/11272.1/AB2/DNOJP9, Abacus Data Network, V1
Abstract Introduction X-SRL: Parallel Cross-lingual Semantic Role Labeling was developed by Heidelberg University, Department of Computational Linguistics and the Leibniz Institute for the German Language (IDS). It consists of approximately three million words of German, French a...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Optical Disc Image - 187.7 MB - MD5: bb20fcbcdcc91f337cd6abf1da2ac7e8
Data
ISO disc image including all documentation and data
Plain Text - 1.4 KB - MD5: df77a0ce35a7b2680597ceff5eb176bb
Documentation
File manifest
Sep 3, 2021
Arase, Yuki; Tsujii, Junichi, 2021, "ESPADA", https://hdl.handle.net/11272.1/AB2/ANSK9Z, Abacus Data Network, V1
Abstract Introduction ESPADA (Extended Syntactic Phrase Alignment DAtaset) consists of annotated parse trees and alignment on English sentential paraphrases extracted from machine translation evaluation corpora. It extends SPADE (LDC2018T09) by adding new annotated data for train...
Sep 3, 2021 - ESPADA
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Sep 3, 2021 - ESPADA
Optical Disc Image - 37.8 MB - MD5: 492cc273177c347c7b0dce46317401b8
Data
ISO disc image including all documentation and data
Sep 3, 2021 - ESPADA
Plain Text - 138.7 KB - MD5: 8632c4fd65fa44c7b53778b162bc6a9b
Documentation
File manifest
Sep 3, 2021
Tracey, Jennifer; Delgado, Dana; Chen, Song; Strassel, Stephanie, 2021, "BOLT Chinese SMS/Chat Parallel Training Data", https://hdl.handle.net/11272.1/AB2/O3JTA9, Abacus Data Network, V1
Abstract Introduction BOLT Chinese SMS/Chat Parallel Training Data was developed by the Linguistic Data Consortium and consists of approximately 1.8 million tokens of Chinese SMS/Chat data collected for the DARPA BOLT program along with their corresponding English translations Th...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Optical Disc Image - 123.1 MB - MD5: fee8120c3058cf7520c61b373c3d7fcf
Data
ISO disc image including all documentation and data
Plain Text - 657.5 KB - MD5: 1445dc08c56a8ad9adbdda7777138eb5
Documentation
File manifest
Sep 3, 2021
Li, Bin; Xiao, Liming; Liu, Yihuan; Wen, Yuan; Song, Li; Chun, Jayeol; Feng, Minxuan; Zhou, Junsheng; Qu, Weiguang; Xue, Nianwen, 2021, "Chinese Abstract Meaning Representation 2.0", https://hdl.handle.net/11272.1/AB2/LVQEZJ, Abacus Data Network, V1
Abstract Introduction Chinese Abstract Meaning Representation (CAMR) 2.0 was developed by Brandeis University and Nanjing Normal University and is comprised of semantic representations of a set of approximately 20,000 Chinese sentences from Chinese Treebank (CTB) 8.0 (LDC2013T21)...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Optical Disc Image - 74.1 MB - MD5: 9ee5119e2feec0341f2784e1d223b269
Data
ISO disc image including all documentation and data
Plain Text - 764 B - MD5: bb6b1b6a756a7a77ca9bbab7763fe253
Documentation
File manifest
Sep 3, 2021
Agarwal, Nitin; Francini, Michelle; Kappler, Michelle; Micciulla, Linnea; Pradhan, Sameer; Ramshaw, Lance, 2021, "BOLT Egyptian Arabic Co-reference -- Discussion Forum, SMS/Chat, and Conversational Telephone Speech", https://hdl.handle.net/11272.1/AB2/DXWM3B, Abacus Data Network, V1
Abstract Introduction BOLT Egyptian Arabic Co-reference -- Discussion Forum, SMS/Chat, and Conversational Telephone Speech was developed by Raytheon BBN Technologies and consists of co-reference annotation on Egyptian Arabic discussion forum (DF), SMS/Chat and conversational tele...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Optical Disc Image - 14.3 MB - MD5: 14b71777c64773eda3d06a8c4318a689
Data
ISO disc image including all documentation and data
Plain Text - 72.5 KB - MD5: b5b960eb1d365df8580674ac4c0e7c6f
Documentation
File manifest
Sep 2, 2021
Mena, Carlos Daniel Hernández, 2021, "LibriVox Spanish", https://hdl.handle.net/11272.1/AB2/AHBO1C, Abacus Data Network, V1
Abstract Introduction LibriVox Spanish consists of approximately 73 hours of Spanish read speech and transcripts. The audio data was taken from Spanish audiobooks developed by LibriVox, a non-profit project that creates audiobooks from public domain works. The transcripts were de...
Sep 2, 2021 - LibriVox Spanish
Optical Disc Image - 2.8 GB - MD5: 0b0babe581f6f299b8fbd6e6f77cef41
Data
ISO disc image including all documentation and data: disc 2
Sep 2, 2021 - LibriVox Spanish
Optical Disc Image - 2.1 GB - MD5: a7d31d1a95c8234a2e4af4f4001c93cb
Data
ISO disc image including all documentation and data: disc 1
Sep 2, 2021 - LibriVox Spanish
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Sep 2, 2021 - LibriVox Spanish
Plain Text - 995.0 KB - MD5: 7c38d2c5bec779f89eb0453995dd17bb
Documentation
File manifest: disc 1
Sep 2, 2021 - LibriVox Spanish
Plain Text - 1.4 MB - MD5: 58c2dbe6bdacba6a16753ada6d0f1936
Documentation
File manifest: disc 2
Sep 2, 2021
Ding, Hongwei; Liao, Sishi; Zhan, Yuqing; Yuan, Jiahong; Liberman, Mark, 2021, "Global TIMIT Mandarin Chinese", https://hdl.handle.net/11272.1/AB2/2CCXH8, Abacus Data Network, V1
Abstract Introduction Global TIMIT Mandarin Chinese was developed by the Linguistic Data Consortium and Shanghai Jiao Tong University and consists of approximately five hours of read speech and transcripts in Mandarin Chinese. The Global TIMIT project aimed to create a series of...
Optical Disc Image - 440.6 MB - MD5: 9359f9fc2cf74be907ed324a66fcd183
Data
ISO disc image including all documentation and data
Plain Text - 1.0 MB - MD5: 2531f274481423a4df0066c1105322c0
Documentation
File manifest
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Add Data

Log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.

Contact Abacus Data Network Support

Abacus Data Network Support

Please fill this out to prove you are not a robot.

+ =