German FrameNet

Friday, May 18, 2007

File structure & To-do

The file structure of the German Framenet is currently organized as follows:
  • current/remote_client/heartofgold/hog Contains the files of the Hear of Gold engine. This includes the chunker and tokenizer. It also includes some of the xsl transformation files such as toFrameNet.xsl
  • current/remote_client/german-client Includes client utils such as FnDesktop
  • current/german German version of the FN database
  • current/english English version of the FN database
  • current/fnSystem Complete FN database and JBoss that Jisup emailed to us
  • current/corpus corpus files including:
    • bin utils
    • cfg some scripts and config files such as header/footers & DTD files
    • cqp CQP engine version 3.0
    • doc some documentation drafts
    • lib includes OpenSP SGML to XML comverter
    • other tar files of the original corpora as they came on the CDs
    • raw uncompressed corpora (AFP, APWS, DPA)
    • tagger IMS tree-tagger
    • txt plain-text version of corpora (i.e., without tags)
    • xml XML version of the corpora
  • current/sandbox misc files

Major To-do items:
  • Incorporate the H.O.G. engine
  • Solve the FarinaImport.sh error
  • Create a few more scripts to plug-in different pipeline components

1 Comments:

  • Hi
    sorry that my question doesn't relate with this post.
    Could you please say me how can i have (download or buy) the FrameNet Desktop?
    Thanks

    By Blogger Unknown, at 1:28 PM  

Post a Comment

<< Home