| Course Code : | 2001WETPDB | | Study domain: | Computer Science | | Semester: | Semester: 2nd semester
| | Contact hours: | 45 | | Credits: | 6 | | Study load (hours): | 168 | | Contract restrictions: | No contract restriction
| | Language of instruction : | English
| | Exam period: | exam in the 2nd semester
| | Tutor(s) | Theobald Martin Bart Goethals
|
1. Prerequisites
At the start of this course the student should have acquired the following competences: An active knowlegde of :- General knowledge of the use of a PC and the Internet
Specific prerequisites for this course: Databases
Basic programming skills, preferably in Java
2. Learning outcomes
Main objectives: Analysis of extended projects in computer science, design of extended projects in computer science.
The students have to realize a project individually. They gain experience in a thorough analysis of a large practical database management problem and in the development of a practical solution.
3. Course contents
This year, we will implement a focused crawler which is able to automatically classify Web pages according to their textual contents. The goal of the project is to build a specialized search engine for researchers' homepages and their publications in the Computer Science domain.
We will employ the following tools:
- Java JDK 7
- Apache Lucene
- SVM-light
- Apache PDFBox
4. Teaching method
Class contact teaching: Laboratory sessionsSkills training Personal work: Assignments:In group Project-based work:In group Facilities for working students Classroom activities:
- Exercise sessions: free to choose the group division
Individual work:
- In group: individual alternative assignment possible
5. Assessment method and criteria
Presentation
6. Study material
Required reading
Depending on the subject of the project.
Optional reading
The following study material can be studied on a voluntary basis: Depending on the subject of the project.
7. Contact information
(+)last update: 19/02/2013 17:17 martin.theobald
|