Browsing by Author "Bourennani, Farid"
Now showing 1 - 2 of 2
- Results Per Page
- Sort Options
Item Integration of heterogeneous data types using self organizing maps(2009-07-01) Bourennani, Farid; Zhu, Ying; Pu, KenWith the growth of computer networks and the advancement of hardware technologies, unprecedented access to data volumes become accessible in a distributed fashion forming heterogeneous data sources. Understanding and combining these data into data warehouses, or merging remote public data into existing databases can significantly enrich the information provided by these data. This problem is called data integration: combining data residing at different sources, and providing the user with a unified view of these data. There are two issues with making use of remote data sources: (1) discovery of relevant data sources, and (2) performing the proper joins between the local data source and the relevant remote databases. Both can be solved if one can effectively identify semantically-related attributes between the local data sources and the available remote data sources. However, performing these tasks manually is time-consuming because of the large data sizes and the unavailability of schema documentation; therefore, an automated tool would be definitely more suitable. Automatically detecting similar entities based on the content is challenging due to three factors. First, because the amount of records is voluminous, it is difficult to perceive or discover information structures or relationships. Second, the schemas of the databases are unfamiliar; therefore, detecting relevant data is difficult. Third, the database entity types are heterogeneous and there is no existing solution for extracting a richer classification result from the processing of two different data types, or at least from textual and numerical data. We propose to utilize self-organizing maps (SOM) to aid the visual exploration of the large data volumes. The unsupervised classification property of SOM facilitates the integration of completely unfamiliar relational database tables and attributes based on the contents. In order to accommodate heterogeneous data types found in relational databases, we extended the term frequency – inverse document frequency (TF-IDF) measure to handle numerical and textual attribute types by unified vectorization processing. The resulting map allows the user to browse the heterogeneously typed database attributes and discover clusters of documents (attributes) having similar content. iii The discovered clusters can significantly aid in manual or automated constructions of data integrity constraints in data cleaning or schema mappings for data integration.Item Leadership based multi-objective optimization with applications in energy systems.(2013-12-01) Bourennani, Farid; Rahnamayan, Shahryar; Naterer, Greg F.Multi-objective optimization metaheuristics (MOMs) are powerful methods for solving complex optimization problems but can require a large number of function evaluations to find optimal solutions. Thus, an efficient multi-objective optimization method should generate accurate and diverse solutions in a timely manner. Improving MOMs convergence speed is an important and challenging research problem which is the scope of this thesis. This thesis conducted the most comprehensive comparative study ever in MOMs. Based on the results, multi-objective (MO) versions of particle swarm optimization (PSO) and differential evolution (DE) algorithms achieved the highest performances; therefore, these two MOMs have been selected as bases for further acceleration in this thesis. To accelerate the selected MOMs, this work focuses on the incorporation of leadership concept to MO variants of DE and PSO algorithms. Two complex case studies of MO design of renewable energy systems are proposed to demonstrate the efficiency of the proposed MOMs. This thesis proposes three new MOMs, namely, leader and speed constraint multi-objective PSO (LSMPSO), opposition-based third evolution step of generalized DE (OGDE3), and multi-objective DE with leadership enhancement (MODEL) which are compared with seven state-of-the-art MOMS using various benchmark problems. LSMPSO was found to be the fastest MOM for the problem undertaken. Further, LSMPSO achieved the highest solutions accuracy for optimal design of a photovoltaic farm in Toronto area. OGDE3 is the first successful application of OBL to a MOM with single population (no-coevolution) using leadership and self-adaptive concepts; the convergence speed of OGDE3 outperformed the other MOMs for the problems solved. MODEL embodies leadership concept into mutation operator of GDE3 algorithm. MODEL achieved the highest accuracy for the 30 studied benchmark problems. Furthermore, MODEL achieved the highest solution accuracy for a MO optimization problem of hydrogen infrastructures design across the province Ontario between 2008 and 2025 considering electricity infrastructure constraints.