CS652 Spring 2004 - Projects
Literature Projects
| Programming Project
| Research Project
Literature Projects
- Student Papers
- Choose paper(s) on the schedule list.
- Presenters.
(1) Each presentation should be about 20-25 minutes in length,
to be followed by 10 minutes or so of discussion.
(2) Powerpoint presentations are required. (We will post them on our class
web site.)
(3) Be sure to present the main ideas and results of the paper.
(4) The total score for each presentation is 60 points.
- 10 points for the overview -- introduce the work and state the goals and
objectives of the authors;
- 25 points for the approach -- explain the basic problem
and how the authors attacked and solved the problem;
- 15 points for the conclusion -- interpret the results and give your
assesment of the strengths, weakness, and possible future work;
- 10 points for answers to questions -- answer the questions as posed by the
class.
- Listeners. (1) Prepare two good questions about the paper in advance, write
them down, and print them out. (2) Add at least one more question during the
presentation. (3) We'll use these questions for class discussion.
These questioins
are due on the date the paper is presented, 5 points for each set of questions
for a paper. (Presenters get these points "for free.")
- Instructor and Guest Lecture Papers
- Optional Reading (You are only responsible for the material in the .ppt
presentation.)
- Participation Responsibilities
- For each presentation, write 2 good discussion questions while listening.
- Grading: Hand in questions. (We will grade them on the basis of 5 points
for each set: 3 points for questions that are really good, are asked, and lead
to good discussion; 2 points for thoughful questions; and 1 point for any
reasonable question.)
Programming Project
There is only one programming project in this course. It, however, is split
into two parts.
1. Build a data-extraction ontology for a domain. (100 points)
- Choose a data extraction application domain. Be sure that there are at least
8 web pages from different web sites available that you can use for experimentation.
- Create a data extraction ontology for your application. First use one Web
page to build your initial ontology. Then tune your ontology on several
(3-5) additional Web pages from different sites until you are satisfied that
it works well. (Keep a log of how long it takes you to create your initial
ontology and how much time it takes you to tune your ontology.)
- Test your application ontology on 2 different "randomly chosen" pages that
are applicable to your ontology from sites not yet used. Compute recall and precisions
ratios for each lexical object sets specified in your ontology.
- Adjust your ontology one "last" time. Test your application ontology on the
last 2 different "randomly chosen" pages and record the precision and recall ratios.
And also, run your final ontology on all of the web pages you have collected
and record the overall precision and recall ratios.
- Write a report to analysis your results and discuss the reasons of your
false positive and true negative results, if any of them is applicable. Also
keep a log of any other interesting problems encountered.
- Electronically submit your application ontology for use in the demo. Also,
submit one or two HTML pages that you think describing the ability of your
ontology the best. Note that please don't send any page that takes a long time
to be extracted.
2. Generate (or write) an OWL ontology for a selected web page in the chosen
domain and annotate a web page based on the OWL ontology. (100 points)
- Write an OWL ontology for your chosen domain.
- Choose a Web page that you have tested for your data-extraction ontology.
- Run your data-extraction ontology to get the result table.
- You need to write a program that fulfills the follows
- be able to automatically insert a pair of class tags for each extracted
data instance in the Web page;
- be able to automatically insert the URI of the tagged data instance
into the created OWL ontology;
- be able to correctly represent the tuples in your result table in
the OWL ontology.
- The program should take three inputs:
- OWL domain ontology
- target Web page
- extracted data table for the target Web page
- For evaluation, we will test your program (1) using a page we select
in your chosen domain and (2) using a classmate's Web page, associated data table,
and his OWL domain ontology.
Research Project
Choose a topic, write a "thesis proposal," and present your ideas to the class.
The topic could be a research problem presented in class or something different.
Use the CS department's MS thesis format found in the on-line graduate
information, except that your outline should be for a journal paper, and you do
not need a signature page, a schedule, a list of artifacts, or a list of
delimitations. Hand in your written proposal on the date specified in the
schedule. Prepare a PowerPoint presentation 12-15 minutes in length to be
presented in class.
Proposals and presentations will be graded on the basis of 100 points,
awarded as follows.
- 25 points for the introduction -- clear motivation for the problem
(2-3 pages in the write-up and as the introductory part of the
presentation)
- 15 points for the thesis statement -- clear, concise statement of
the problem being solved. (1 paragraph; one slide)
- 40 points for methods -- an explanation stating how the problem
is to be solved and the technical details needed to make the solution
viable. (5-7 pages; the body of your presentation)
- 10 points for the contributions. (list 3-4; make a slide for these)
- 10 points for the outline -- make this be a two-level outline
for a research paper you could publish in a CS journal (in making your
outline, consider how papers are written for our discipline and why,
and make your outline conform to this standard).