<?xml version="1.0" encoding="utf-8"?>
<!-- generator="FeedCreator 1.7.2-ppt DokuWiki" -->
<?xml-stylesheet href="http://geek.kyloo.net/software/lib/exe/css.php?s=feed" type="text/css"?>
<rdf:RDF
    xmlns="http://purl.org/rss/1.0/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
    xmlns:dc="http://purl.org/dc/elements/1.1/">
    <channel rdf:about="http://geek.kyloo.net/software/feed.php">
        <title>Qin Gao's Softwares  chaski</title>
        <description></description>
        <link>http://geek.kyloo.net/software/</link>
        <image rdf:resource="http://geek.kyloo.net/software/lib/images/favicon.ico" />
       <dc:date>2010-05-26T09:38:06-06:00</dc:date>
        <items>
            <rdf:Seq>
                <rdf:li rdf:resource="http://geek.kyloo.net/software/doku.php/chaski:configure?rev=1259433666&amp;do=diff"/>
                <rdf:li rdf:resource="http://geek.kyloo.net/software/doku.php/chaski:download?rev=1268062169&amp;do=diff"/>
                <rdf:li rdf:resource="http://geek.kyloo.net/software/doku.php/chaski:faq?rev=1259433629&amp;do=diff"/>
                <rdf:li rdf:resource="http://geek.kyloo.net/software/doku.php/chaski:install?rev=1259433558&amp;do=diff"/>
                <rdf:li rdf:resource="http://geek.kyloo.net/software/doku.php/chaski:output_structure?rev=1259433689&amp;do=diff"/>
                <rdf:li rdf:resource="http://geek.kyloo.net/software/doku.php/chaski:overview?rev=1259433425&amp;do=diff"/>
                <rdf:li rdf:resource="http://geek.kyloo.net/software/doku.php/chaski:release_note_chaski?rev=1268062280&amp;do=diff"/>
                <rdf:li rdf:resource="http://geek.kyloo.net/software/doku.php/chaski:tutorial?rev=1259434869&amp;do=diff"/>
            </rdf:Seq>
        </items>
    </channel>
    <image rdf:about="http://geek.kyloo.net/software/lib/images/favicon.ico">
        <title>Qin Gao's Softwares </title>
        <link>http://geek.kyloo.net/software/</link>
        <url>http://geek.kyloo.net/software/lib/images/favicon.ico</url>
    </image>
    <item rdf:about="http://geek.kyloo.net/software/doku.php/chaski:configure?rev=1259433666&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2009-11-28T11:41:06-06:00</dc:date>
        <title>Configuration of Chaski</title>
        <link>http://geek.kyloo.net/software/doku.php/chaski:configure?rev=1259433666&amp;do=diff</link>
        <description>Chaski as a Jar


Chaski provides several useful scripts to make training easy, these will be described later. Leaving the scripts aside, Chaski is packed into a single jar file. It can only be executed with Hadoop version 0.20.1+.


	*  log4j.properties the configure file for java logging system
	*  pkglist.txt list available actions of Chaski and the entry points.</description>
    </item>
    <item rdf:about="http://geek.kyloo.net/software/doku.php/chaski:download?rev=1268062169&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2010-03-08T08:29:29-06:00</dc:date>
        <title>Download Chaski</title>
        <link>http://geek.kyloo.net/software/doku.php/chaski:download?rev=1268062169&amp;do=diff</link>
        <description>Dependencies


Chaski must be run on Hadoop, however it can also be run on local machines with Pseudo-cluster hadoop installations. Most recent Chaski make use of 0.20.1 API, and DOES NOT support any previous version of Hadoop.

Chaski package usually contain all its dependencies jar, and after building the package should be able to run directly on Hadoop, i.e. you do not need to add third party jars to the CLASSPATH, all the jar-files not included in Hadoop distribution will be unjar-ed and re-…</description>
    </item>
    <item rdf:about="http://geek.kyloo.net/software/doku.php/chaski:faq?rev=1259433629&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2009-11-28T11:40:29-06:00</dc:date>
        <title>FAQ for Chaski</title>
        <link>http://geek.kyloo.net/software/doku.php/chaski:faq?rev=1259433629&amp;do=diff</link>
        <description></description>
    </item>
    <item rdf:about="http://geek.kyloo.net/software/doku.php/chaski:install?rev=1259433558&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2009-11-28T11:39:18-06:00</dc:date>
        <title>Installation of Chaski</title>
        <link>http://geek.kyloo.net/software/doku.php/chaski:install?rev=1259433558&amp;do=diff</link>
        <description>The page describes the installation of Chaski.

1. Download latest release


Please go to download and download latest release.

If you only want to run phrase extraction using Chaski, you can just proceed, if you also want to run word alignment distributedly, you need to download and compile MGIZA package. download.</description>
    </item>
    <item rdf:about="http://geek.kyloo.net/software/doku.php/chaski:output_structure?rev=1259433689&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2009-11-28T11:41:29-06:00</dc:date>
        <title>Output Structure of Chaski/MGIZA</title>
        <link>http://geek.kyloo.net/software/doku.php/chaski:output_structure?rev=1259433689&amp;do=diff</link>
        <description>This section will describe typical output on HDFS, so you know where to find what, what can be deleted and what need to be saved.

The structuer tree under $ROOT directory is printed in tree-view in the following file.



All the directories names in italic format are temporary files and can be deleted safely after training.</description>
    </item>
    <item rdf:about="http://geek.kyloo.net/software/doku.php/chaski:overview?rev=1259433425&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2009-11-28T11:37:05-06:00</dc:date>
        <title>Chaski</title>
        <link>http://geek.kyloo.net/software/doku.php/chaski:overview?rev=1259433425&amp;do=diff</link>
        <description>Chaski is a distributed toolkit for machine translation. It contains the following tools:


	*  Distributed word clustering. Being able to build word classes for billion-word corpus. 
	*  Distributed word alignment. Using the newest version of overview, it is able to training word alignment models on the cluster in hours instead of days.
	*  Distributed phrase extraction. The phrase extraction for large corpus turns turns out to be slow and require huge disk space and (actually, or) memory, the …</description>
    </item>
    <item rdf:about="http://geek.kyloo.net/software/doku.php/chaski:release_note_chaski?rev=1268062280&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2010-03-08T08:31:20-06:00</dc:date>
        <title>Chaski Release Notes</title>
        <link>http://geek.kyloo.net/software/doku.php/chaski:release_note_chaski?rev=1268062280&amp;do=diff</link>
        <description>0.2.5


Maintenance release, fixed bugs when detecting existence of files on HDFS.

Download on Source Forge

0.2.4


Maintenance release, fixed bugs that may affect interaction with LoonyBin.

Download on Source Forge

0.2.3


Maintenance release, fixed bugs:</description>
    </item>
    <item rdf:about="http://geek.kyloo.net/software/doku.php/chaski:tutorial?rev=1259434869&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2009-11-28T12:01:09-06:00</dc:date>
        <title>Chaski Tutorial</title>
        <link>http://geek.kyloo.net/software/doku.php/chaski:tutorial?rev=1259434869&amp;do=diff</link>
        <description>The most common usage of Chaski include word alignment and phrase extraction. This tutorial will cover both.

0. Prepare Chaski


Download Chaski here.

Follow the instruction here to install chaski.

And the following part of the tutorial will explain two alternative pipelines: Running full training from word alignment and running only the phrase extraction.</description>
    </item>
</rdf:RDF>
