{"id":931,"date":"2008-10-21T12:55:23","date_gmt":"2008-10-21T19:55:23","guid":{"rendered":"http:\/\/islemaster.wordpress.com\/?p=78"},"modified":"2014-03-17T00:39:55","modified_gmt":"2014-03-17T07:39:55","slug":"to-do-from-ccsc-nw-08","status":"publish","type":"post","link":"https:\/\/www.bradleycbuchanan.com\/b\/to-do-from-ccsc-nw-08\/","title":{"rendered":"To-do from CCSC-NW 08"},"content":{"rendered":"<p>Here is a list of the potential changes to SVMTrainer that were suggested to me during this weekend&#8217;s conference.<\/p>\n<p><strong>Searcher<\/strong><\/p>\n<ul>\n<li>Implement conditions on acceptable web document sizes to optimize document retrieval time<\/li>\n<li>Try using a small initial search as a seed to get other search terms and expand the diversity of my training set &#8211; Yahoo! Term Extraction might be good for this, too.<\/li>\n<\/ul>\n<p><strong>WordFilter<\/strong><\/p>\n<ul>\n<li>Try implementing WordNet in the WordFilter class<\/li>\n<li>Find a use for Yahoo! Term Extraction<\/li>\n<\/ul>\n<p><strong>WebDocument<\/strong><\/p>\n<ul>\n<li>Implement parallelism in the retrieval of search results and the retrieval of web documents<\/li>\n<li>Implement a document retrieval timeout and a URL blacklist to prevent hanging on bad downloads<\/li>\n<\/ul>\n<p><strong>Other<\/strong><\/p>\n<ul>\n<li>Investigate the use of SVMstruct for categorization\/ranking problem in multiple dimensions<\/li>\n<li>Start doing an independent check on the accuracy of trained sets by keeping 10% of results for categorization rather than training<\/li>\n<li>Learn about Xi Alpha estimates and what exactly they mean<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Here is a list of the potential changes to SVMTrainer that were suggested to me during this weekend&#8217;s conference. Searcher Implement conditions on acceptable web document sizes to optimize document retrieval time Try using a small initial search as a seed to get other search terms and expand the diversity of my training set &#8211;&#8230;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[8],"tags":[88,41],"class_list":["post-931","post","type-post","status-publish","format-standard","hentry","category-programmer","tag-ccsc-nw","tag-svm-trainer"],"_links":{"self":[{"href":"https:\/\/www.bradleycbuchanan.com\/b\/wp-json\/wp\/v2\/posts\/931","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.bradleycbuchanan.com\/b\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.bradleycbuchanan.com\/b\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.bradleycbuchanan.com\/b\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.bradleycbuchanan.com\/b\/wp-json\/wp\/v2\/comments?post=931"}],"version-history":[{"count":1,"href":"https:\/\/www.bradleycbuchanan.com\/b\/wp-json\/wp\/v2\/posts\/931\/revisions"}],"predecessor-version":[{"id":1223,"href":"https:\/\/www.bradleycbuchanan.com\/b\/wp-json\/wp\/v2\/posts\/931\/revisions\/1223"}],"wp:attachment":[{"href":"https:\/\/www.bradleycbuchanan.com\/b\/wp-json\/wp\/v2\/media?parent=931"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.bradleycbuchanan.com\/b\/wp-json\/wp\/v2\/categories?post=931"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.bradleycbuchanan.com\/b\/wp-json\/wp\/v2\/tags?post=931"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}