Does order matter? Effect of order in group recommendation

Expert Systems With Applications 82 (2017) 115–127 Contents lists available at ScienceDirect Expert Systems With Applications journal homepage: www...

Download PDF

2MB Sizes 0 Downloads 9 Views

Report

PDF Reader
Full Text

Expert Systems With Applications 82 (2017) 115–127

Contents lists available at ScienceDirect

Expert Systems With Applications journal homepage: www.elsevier.com/locate/eswa

Does order matter? Effect of order in group recommendation Akshita Agarwal1, Manajit Chakraborty1,∗, C. Ravindranath Chowdary1 Department of Computer Science & Engineering, Indian Institute of Technology (BHU), Varanasi 221 005, India

a r t i c l e

i n f o

Article history: Received 11 November 2016 Revised 9 March 2017 Accepted 29 March 2017 Available online 5 April 2017 Keywords: Recommender systems Order in recommendation Group recommendation Hungarian method Least Misery

a b s t r a c t Recommendation Systems (RS) are gaining popularity and they are widely used for dealing with information on education, e-commerce, travel planning, entertainment etc. Recommender Systems are used to recommend items to user(s) based on the ratings provided by the other users as well as the past preferences of the user(s) under consideration. Given a set of items from a group of users, Group Recommender Systems generate a subset of those items within a given group budget (i.e. the number of items to have in the ﬁnal recommendation). Recommending to a group of users based on the ordered preferences provided by each user is an open problem. By order, we mean that the user provides a set of items that he would like to see in the generated recommendation along with the order in which he would like those items to appear. We design and implement algorithms for computing such group recommendations eﬃciently. Our system will recommend items based on modiﬁed versions of two popular Recommendation strategies– Aggregated Voting and Least Misery. Although the existing versions of Aggregated Voting (i.e. Greedy Aggregated Method) and Least Misery perform fairly well in satisfying individuals in a group, they fail to gain signiﬁcant group satisfaction. Our proposed Hungarian Aggregated Method and Least Misery with Priority improves the overall group satisfaction at the cost of a marginal increase in time complexity. We evaluated the scalability of our algorithms using a real-world dataset. Our experimental results evaluated using a self-established metric substantiates that our approach is signiﬁcantly eﬃcient. © 2017 Elsevier Ltd. All rights reserved.

1. Introduction Recommender Systems are a subclass of information ﬁltering systems that predict items to a user or a group of users based on their prior preferences. This information can be obtained either explicitly by collecting users’ ratings or implicitly by monitoring the users’ behavior (Bobadilla, Ortega, Hernando, & Bernal, 2012). Recommender Systems are often classiﬁed based on their design or ﬁltering technique. They are namely, Content-based (Lops, De Gemmis, & Semeraro, 2011), Context-based (Adomavicius & Tuzhilin, 2015; Chakraborty, Agrawal, Shekhar, & Chowdary, 2015), Collaborative (Ricci, Rokach, & Shapira, 2011), Demographic (Pazzani, 1999) and Knowledge-based (Trewin, 20 0 0). Group Recommender Systems are a subclass of general Recommender Systems. Here, the emphasis is on satisfying the needs of a group of users rather than individuals. While a number of techniques have been applied to group recommendation ranging from Collaborative Filtering

∗

Corresponding author. E-mail addresses: [email protected] (A. Agarwal), [email protected] (M. Chakraborty), [email protected] (C.R. Chowdary). 1 All authors contributed equally, names are listed in alphabetical order. http://dx.doi.org/10.1016/j.eswa.2017.03.069 0957-4174/© 2017 Elsevier Ltd. All rights reserved.

(Baltrunas, Makcinskas, & Ricci, 2010; Ghazarian & Nematbakhsh, 2015; O’connor, Cosley, Konstan, & Riedl, 2001) to Critiquing-based approaches (McCarthy et al., 2006), it still continues to attract the community because of its varied application in Social Networks (Cantador & Castells, 2012; Gartrell et al., 2010), E-commerce websites (Sarwar, Karypis, Konstan, & Riedl, 20 0 0; Schafer, Konstan, & Riedl, 2001) etc. In this paper, we ﬁrst model the user satisfaction measure and show that introducing order in group recommendation has a positive effect. Consequently, we propose a system, which not only takes user preferences into account but also considers the order in which the user likes them to be presented to her. To the best of our knowledge, this is the ﬁrst attempt that takes order into account for group recommendation. To prove the eﬃcacy of our proposed system, we adopt two widely known consensus functions for recommendation– Aggregated Voting and Least Misery. Additionally, we provide approximations (Hungarian Aggregated Method and Least Misery Method with Priority) for the above two consensus functions to suit our purposes. While GReedy Aggregated Method (GRAM) aims to select the best item to maximize the satisfaction of the group, it still is far from providing a perfect solution. Instead, Hungarian Aggregated Method (HAM) offers a maximum satisfaction assignment combinatorial solution. On the other hand, while Least Misery Method (LMM) tends to maximize the minimum least satisfaction of the group, we show with

116

A. Agarwal et al. / Expert Systems With Applications 82 (2017) 115–127 Table 1 Example of schedule choices. Student

Choice of schedule

Student 1 Student 2 Student 3 ::

Computer Science → Mathematics → Physics → Chemistry Mathematics → Geography → Computer Science → English Physics → Chemistry → Computer Science → Mathematics :::

with “Goblet of Fire” but then a Harry Potter purist may stress on watching the series chronologically starting with “Philosopher’s Stone”. Certainly, this is a conundrum and picking the right order that satisﬁes everybody’s mood is a tough choice. Herein, lies the possibility of a recommender system’s inclusion of order to accommodate such variance in choices. 1.2. Our contributions

the help of experiments that Least Misery Method with Priority (LMMP) provides better group satisfaction (as high as 30%). We provide a detailed analysis of the effect on three essential variables namely, Group Budget, Group Size and Number of items pertaining to the inclusion of order in group recommendation. Group budget (also known as space budget (Amer-Yahia, Roy, Chawlat, Das, & Yu, 2009) is deﬁned as a subset of the items from the total set of items available. It is the maximum number of items to be recommended. A comprehensive set of experiments was conducted on a real-world dataset MovieLens (GroupLens, 2015) which corroborate our claim. 1.1. Motivation Most Group Recommender Systems suggest a list of items to users, based on some consensus function. While generating relevant items to user preferences in the recommended list is a prime requirement, the order in which they appear in the list is important as well. It has been observed through experimentation (as explained later in Section 3.2) that when the satisfaction measure is modeled as a similarity function between user’s preference list and the generated recommendation list, there exists a positive correlation between the two. We studied this correlation between users’ preferences and generated recommendation list and found out that order plays a major role in determining user satisfaction when there is a strict group budget. By order we mean that the user provides the set of items that he would like to see in the generated recommendation along with the order in which he would like those items to appear. As such, we propose a system which incorporates the notion of order in addition to the existing functionality of recommending to users of a group. This is particularly useful in Recommender Systems like travel planning where along with providing the preferences the users would also like to visit places in a particular order depending on the convenience, weather, transportation etc. The idea can also be used for planning schedules in educational institutions where the students and teachers would like to suggest an order in which their classes are held or the order in which the examinations are conducted. Example 1. Suppose there are six subjects in the curriculum– Computer Science, Mathematics, Physics, Chemistry, Geography and English and for a particular day, the students have to choose four subjects to be taught and also suggest the order in which they would like the classes to be held. Some possible proposed choices are listed in Table 1. The problem of selecting four subjects and arranging them in order in which they would be held based on some consensus function is quite a challenging task especially if the number of students is large. Now, consider similar constraints on a problem which may include many variables, such as recommending an itinerary to a group of travelers. Clearly, the problem complexity increases manifold. Satisfying each and every user is as important as meeting the satisfaction of the whole group. Another situation could be family members planning for a Harry Potter movie series marathon. While some members might like to watch the critically acclaimed “Prisoners of Azkaban” ﬁrst, another member might like to start

Our contributions are two-fold: •

•

We posit the idea of introducing order in group recommendation and study its effect on group satisfaction. We propose intelligent approximation algorithms for consensus functions to suit our requirements and analyze them thoroughly using three variables– group size, group budget and number of items.

The rest of the paper is organized as follows. Section 2 gives an overview of related works in existing literature. Section 3 offers an insight into our proposed model. The algorithms and their modiﬁed versions are presented in Section 4. Section 5 provides the details of experimental setup. Results and Analysis pertaining to experimentation are presented in Section 6. Finally, Section 7 concludes the article. 2. Related Work 2.1. Group Recommendation Movie recommendation has been an active research area for the past few decades. But, in recent years there has been an inclination towards building group recommender systems for movies (Christensen & Schiaﬃno, 2011) to cater the needs of families, friends etc.. Pera and Ng (2013) build a group recommender for movies by the name of GroupReM. The novelty of this approach is that instead of using the conventional approach of Matrix Factorization for movie ratings, this system employs (personal) tags for capturing the contents of movies considered for the recommendation and group members interests. They formulate a three-pronged approach for the same: •

• •

Employ a merging strategy to explore individual group members interests in movies and create a proﬁle that reﬂects the preferences of the group on movies. Use word-correlation factors to ﬁnd movies similar in content. Consider the popularity of movies on a movie website.

Kagita, Pujari, and Padmanabhan (2015) utilizes the virtual user approach for recommendation generation. They make use of transitive precedence relations among items to generate a virtual user proﬁle that represents the combined proﬁle of the group. This strategy has been applied to two different recommendation techniques– Precedence Mining and Collaborative Filtering. Other existing virtual user strategies take into consideration the set of common items consumed by the users whereas the authors’ strategy computes the fuzzy score for the items not consumed by all the users, that gives this method better precision and recall over the other virtual user strategies. Kagita et al. (2015) also propose ‘monotonicity’, i.e. the degree to which a recommendation prevails when new information is added to the training set, as a measure for estimating the quality of a Group Recommender System. Villavicencio, Schiaﬃno, Diaz-Pace, and Monteserin (2016) present a multi-agent approach, called PUMAS-GR, for group recommendation. The novelty of their approach is that it leverages on negotiation techniques in order to integrate recommendations

A. Agarwal et al. / Expert Systems With Applications 82 (2017) 115–127

(previously) obtained for each group member into a list of recommendations for the group. Each user is represented by a personal agent that works on her behalf. The agents carry out a cooperative negotiation process based on the multilateral Monotonic Concession Protocol (MCP). The authors claim that this negotiation process can generate recommendations that satisfy the different group members more evenly than traditional group recommendation approaches since it mirrors the way in which human negotiation seems to work. To achieve high-quality initial personalization, recommender systems must provide an eﬃcient and effective process for new users to express their preferences. Chang, Harper, and Terveen (2015) propose that this goal is best served not by the classical method where users begin by expressing preferences for individual items but instead new users can begin by expressing their preferences for groups of items. The authors test this idea by designing and evaluating an interactive process where users express preferences across groups of items that are automatically generated by clustering algorithms. The paper contributes a strategy for recommending items based on these preferences that is generalizable to any collaborative ﬁltering-based system. Ortega, Hernando, Bobadilla, and Kang (2016) introduces Matrix Factorization (MF) based Collaborative Filtering (CF) for group recommendations. The authors propose three original approaches to map the group of users to the latent factor space and compare the proposed methods in three different scenarios: when the group size is small, medium and large. They also compare the precision of the proposed methods with state-of-the-art group recommendation systems using KNN based Collaborative Filtering. The authors further analyze group movie ratings on MovieLens and Netﬂix datasets. Kaššák, Kompan, and Bieliková (2016) focus on the improvement of the quality of the very Top-N recommendations and the recommendation list order in group recommendations. The authors achieve this by generating content-based and collaborative candidates for group members. Next, these candidates (content-based and collaborative separately) are aggregated to resolve conﬂict preferences of group members. Finally, the hybrid recommender is applied to merge content-based and collaborative group candidates and to generate ﬁnal recommendations. 2.2. Consensus functions Amer-Yahia et al. (2009) introduces the semantics of group recommendation algorithms by means of a Consensus Function. The authors also introduce disagreement among group members as a factor that greatly inﬂuences the eﬃciency and effectiveness of the generated recommendation. The disagreement among the users on a particular item is taken into account by taking the consensus function that computes the recommendation score for an item as a weighted summation of the item relevance and the group disagreement for that item. Amer-Yahia et al. (2009) utilizes the concept of user similarity to break ties by giving priority to the user having maximum similarity with other members of the group. Two disagreement models are proposed and top-k threshold algorithms are given for two popular recommendation strategies– Aggregated Voting and Least Misery. The proposed algorithms were analyzed for quality and eﬃciency by performing experiments on the Movie Lens dataset (MovieLens, 2009) and the utility of the disagreement models for obtaining effective and eﬃcient results is established. Jameson and Smyth (2007) put forward two methods of generating group recommendation viz. Virtual User and Recommendation Aggregation. The ﬁrst method combines the highest rated items by the individual users into a list that is recommended to the group, while the second aggregates the predicted ratings of the items for all the users and returns the set of items having the

117

highest aggregate ratings. The choice of method to use depends on the kind of application and the extent to which the users are able to modify their choices or view the other person’s choices during the recommendation process. The authors also examine the various existing recommender systems and depict the different ways in which the users’ preferences are analyzed and used for the group preference model. Our work adopts the recommendation aggregation consensus because of its ﬂexibility. There are four basic stages in the collaborative ﬁltering algorithms where the group’s users’ data can be aggregated to the data of the group of users: similarity metric, establishing the neighborhood, prediction phase, determination of recommended items. Our approach varies from this paper in two respects viz. (a) we do not employ collaborative ﬁltering and (b) preference of users are captured implicitly in our approach, as we simply consider those movies in which the user is interested in. Ortega, Bobadilla, Hernando, and Gutirrez (2013) perform aggregation experiments in each of the four stages and present two fundamental conclusions: 1. The system accuracy does not vary signiﬁcantly according to the stage where the aggregation is performed. 2. The system performance improves notably when the aggregation is performed in an earlier stage of the collaborative ﬁltering process. The authors also provide a group recommendation similarity metric and demonstrates the convenience of tackling the aggregation of the group’s users in the actual similarity metric of the collaborative ﬁltering process. Conventional solutions to Group Decision Making (GDM) problems do not regard the disagreement of few of the participants. Castro, Quesada, Palomares, and Martínez (2015) provides a method of seeking group recommendation that ensures a certain level of agreement among the group members. For this Consensus Reaching Processes (CRPs) have been proposed wherein the preferences are brought iteratively closer to one another until a suﬃcient level of agreement is achieved. The task of recommendation here has been divided into two major phases– Recommendation phase and Consensus phase. In the Recommendation phase, the k-nearest neighbor Collaborative Filtering algorithm is applied to obtain individual recommendations and the top-n commonly predicted items are presented as preferences. Finally, in the consensus phase, a CRP is applied by using automatic consensus model, wherein automatic changes in preferences are made, for the decision making using the fuzzy preference relations. The proposed method was tested on the Movie Lens-100K dataset (MovieLens, 1998) and the improvement over the baseline group recommending techniques has been established. In most of the works mentioned above, a consensus has been drawn with or without collaborative ﬁltering that reﬂects the overall commonality of users based on their past preferences, but none of them account for the dissimilarity between the order of items that are recommended. Thus, we hypothesize that order is important and in later sections that follow we establish the same. Roy, Thirumuruganathan, Amer-Yahia, Das, and Yu (2014) introduces the concept of a feedback box in group recommendation. The feedback box is used by a user when he is updating his preferences. Given the updated user’s preference, the feedback box generates a recommendation for the user that maximizes the satisfaction of the user in the henceforth generated recommendation. The use of the feedback box is examined for the two recommendation strategies– Aggregated Voting and Least Misery in the presence of two different consensus functions– Overlap Similarity and Hamming Distance. The various proposed recommendation algorithms and feedback boxes are analyzed for eﬃciency using the Movie Lens dataset. The feedback box generates optimal results for the Least Misery case using Overlap Similarity as the consensus func-

118

A. Agarwal et al. / Expert Systems With Applications 82 (2017) 115–127

tion. Performance is recorded mainly by varying three parameters– number of users, total number of items and group budget. Our work draws inspiration from Roy et al. (2014), as we employ similar consensus functions embedded with the additional concept of order. We have adopted Overlap Similarity as the basis of our satisfaction measure2 and adapted it appropriately for our problem. 3. Model Our user interaction model requires four inputs viz. Number of users (n), Total number of items (m), Group budget(k) and User preference vector (v). Once the user preference vectors are acquired, it is fed into our system in the form of various matrices and vectors; the ﬁnal output is a vector of recommendation (r). As stated above, our work draws its framework from Roy et al. (2014) and hence, we have used their models with modiﬁcations to adapt our requirement wherever necessary; a glimpse of which is provided in Section 3.3. 3.1. User preference model A user provides a vector v of length k (equivalent to group budget) where each index has an item, which she would like to appear at ith position in the recommendation list.3 The content of vi is invariably the item_id of an item. We assume that each item provided in the preference vector is unique. The result of Group Recommendation System is a vector r with length k similar to the user preference vector. Example 2. Let’s say there is a total of 20 items (m = 20) of which each user needs to select 6 items to be listed in the preference vector. In that case a sample vector may look like this:

12 6 1 9 15 17

appear at and the position at which the item actually appears in the generated recommendation (x can have values between 0 and k − 1). a is a constant such that the value of a is inversely proportional to the importance that we give to the order to generate the recommendation. c is a regularization factor, usually a constant value. The values of f(x, a) is stored in a score vector s. Satisfaction of any user j at any state depends on the currently recommended list of items and their order. So, initially, all users have a satisfaction score of zero. When the ﬁrst item appears in the recommendation list, each user j will express her satisfaction using f(x, a). In case, the item was not in her initial preference vector v, the satisfaction score for that user j will remain the same. This process is carried out for all n users involved. Now, when the second item appears again, each user will compute her satisfaction using f(x, a). This new satisfaction will be inclusive of the satisfaction she received due to an item in the ﬁrst position. The process as mentioned above is thus iterative and continues until the group budget is full. The procedure to calculate s is presented as Algorithm 1. Algorithm 1: Generating score vector.

which implies that a user wants to see an item with item_id= 12 at ﬁrst position, item with item_id= 6 at second position and so on. The recommendation list generated may be

15 12 1 9 6 17 which means that items with item_ids 1, 6, 9, 12, 15 and 17 are selected based on user satisfaction measure and consensus function and the items would appear in the order as stated. 3.2. User satisfaction measure In the existing literature there are many techniques to measure the satisfaction of user preferences against the generated recommendation list, but none of them account for order. So, we propose a new measure called USO (User Satisfaction with Order). It can be represented as:

uso j = uso j + f (x, a ) f (x, a ) = c +

Fig. 1. Effect of order on recommendation for k=15.

(1)

k 1

(x + 1 ) a

(2)

where uso is a vector of length n whose elements are user satisfaction score computed by the above formula.4 For each user j, usoj is the summation of previous USO score and the new order satisfaction component f(x, a)5 . k is the group budget, x is the absolute difference between the position that the user wanted the item to 2 Overlap Similarity was chosen as it considers the similarities between the generated recommendation and the user preferences. 3 Existing approaches require users to specify their requirements arbitrarily. 4 All the user satisfaction scores are initialized to zero. 5 sj basically denotes user j’s satisfaction with the order of items.

2

Data: k, a Result: Score vector s for 0 ≤ i ≤ k − 1 do k si ← c + 1 ;

3

end

1

(i+1 ) a

Greater the value of a, lesser will be the role of order in determining the recommendation and vice-versa. To demonstrate this, we plot the function f(x, a) for different values of x and a while assigning the group budget k = 15 and setting c = 1. The resultant plot is depicted in Fig. 1. From this ﬁgure, it is clear that for small values of a (say 0.5), there appears a large gap for the deviation of a single position in the recommendation. On the other hand, if the value of a becomes very large (say 100 or 10 0 0), it hardly affects the satisfaction function.6 Thus, it substantiates our claim that order is of prime importance to group recommendation. Additional graphs are also presented in Figs. 2 and 3. The satisfaction function f(x, a) basically represents the group budget normalized over the deviation in order in the recommendation list from user’s expectation. f(x, a) gives us control over the extent that order affects recommendation. 3.3. Consensus function 3.3.1. Aggregated Voting Aggregated Voting is one of the popular recommendation consensus methods (Adomavicius & Tuzhilin, 2005; Kompan & Bieliková, 2014; Resnick & Varian, 1997). This method aims to com6

In Fig. 1 the plots for a = 100 and a = 10 0 0 almost overlap.

A. Agarwal et al. / Expert Systems With Applications 82 (2017) 115–127

119

Table 2 User preference matrix U. Item_ID

Fig. 2. Effect on order on recommendation for k=5.

Position

1

2

3

4

5

6

7

8

9

1 2 3 4 5

0 0 0 1 0

2 1 1 0 2

0 1 2 1 1

0 1 0 1 2

2 1 1 1 0

0 1 1 0 1

1 1 2 3 0

2 2 1 1 1

1 0 0 0 1

Table 3 Score vector s for k = 5. Index

0

1

2

3

4

Score

6.0 0 0 0

4.5355

3.8867

3.50 0 0

3.2361

Suppose, a user u selects an item t, which does not appear in rest of the user preferences. Now, if u is minimally satisﬁed at some stage, it would further reduce the satisfaction of other users, since they never kept t in their preferences. To overcome such adversities, we have proposed a modiﬁed version of Least Misery method in Section 4.4. Fig. 3. Effect on order on recommendation for k=100.

pute the best group recommendation by considering items in such a fashion that the sum total of individual satisfaction scores of all the users is maximized. Example 3. Suppose there are n = 4 users and m = 5 items out of which k = 2 have to be selected and the preference vectors are:

2 4 2 3

1 1 1 5

In this case, the total number of possible recommendations are: mP = 20 k By aggregated method of voting the recommendation list would be 2 1 as it will provide maximum satisfaction to the group. Although the method is relatively simple; it incurs a computational cost of O (k2 mk ). To avoid such a high computational complexity, we propose an intelligent approximation algorithm to aggregated voting in Section 4.2 that computes the recommendation eﬃciently in polynomial time.

4. Algorithms To understand the algorithms in this section, let us consider the following example: Example 4. Number of users (n) is : 8 Number of items (m) : 9 Group budget (k) : 5 Value of a = 2 The preferences as given by the users are:

User1 : 2 5 6 7 8 User2 : 5 8 7 4 3 User3 : 8 2 3 7 9 User4 : 8 4 3 7 2 User5 : 2 3 5 1 6 User6 : 5 6 7 8 2 User7 : 7 8 2 5 4 User8 : 9 7 8 3 4 Corresponding to these vectors, two matrices are generated– 1. User Preference matrix U and 2. User Satisfaction Score matrix S.

3.3.2. Least Misery This is another popular method (Masthoff, 2011; O’connor et al., 2001) that generates a recommendation such that minimum satisfaction score among all the users is maximized. For Example 3, Least Misery method would generate

In the former matrix of size k × m each element Ui, j represents the number of users who would like to see item j at position i in the recommendation list. We set the value of a = 2. The U so generated, is presented in Table 2.

3 1

Table 3 gives the values for score vector s. Once the User Preference Matrix and the score vector have been computed, we generate the User Satisfaction Matrix. Here, Si, j denotes the satisfaction value added to the group satisfaction when item numbered j is chosen for the ith position (The User Satisfaction Score matrix S corresponding to the above U is shown in Table 4). For instance, S1,2 denotes the group satisfaction score when the 2nd item is chosen for the 1st position. As we can see from the User Preference Matrix U there are: two users who want to see item no. 2 at position 1 (Users 1 and 5), one user (User 3) who wants to see item no. 2 at position 2, one user (User 7) who wants to see this item at position 3 and ﬁnally two users (User 4 and 6) who want to see it at position 5. For each of the users {1, 5, 3, 7, 4, 6}, choosing

as the ﬁnal output. While this method ensures that every user is minimally satisﬁed, it performs poorly when the overall satisfaction of the group is considered. Moreover, in cases where the user base is large and the group budget is small, both individual and overall satisfaction of the group may be affected. This is because in Least Misery when we try to maximize the minimum satisfaction of the group and if the group budget is small then the overall satisfaction score would be much less that that generated by GRAM. Also, since we are aiming for at least minimum satisfaction for any user, chances are no one in the group is completely satisﬁed. E.g. let there are 10 0 0 users while the group budget is 10.

120

A. Agarwal et al. / Expert Systems With Applications 82 (2017) 115–127 Table 4 User satisfaction matrix S. Item_ID Position

1

2

3

4

5

6

7

8

9

1 2 3 4 5

3.50 3.88 4.54 6.00 4.53

26.89 26.61 26.08 24.49 25.86

19.04 22.46 24.96 23.49 21.81

14.51 16.88 16.84 18.96 20.03

23.92 23.49 22.84 21.42 18.39

11.66 14.04 14.42 12.96 13.39

28.81 31.27 34.03 34.46 28.12

31.69 32.99 31.27 29.84 27.89

9.24 8.04 7.77 8.03 9.23

item no. 2 at position 1 will entail adding their corresponding satisfaction component (s) to the group satisfaction. Users 1 and 5 are optimally satisﬁed when item no. 2 is selected for position 1 since there is no deviation in expected position. Hence, corresponding to these two users, we will add s0 = 6 to the group satisfaction score S1,2 . User 3 will add s1 = 4.5355 to S1,2 , since there is a deviation of one position from her expectation and so on. Hence, Si, j is the sum of all these satisfaction component values added by each user for a particular item and a particular position.

Si, j =

k

s| p−i| × U p, j

p=1

So, for the example under consideration: S1,2 = s0 × U1,2 + s1 × U2,2 + s2 × U3,2 + s3 × U4,2 + s4 × U5,2 = 6 × 2 + 4.53 × 1 + 3.89 × 1 + 3.50 × 0 + 3.23 × 2 = 26.89 Similarly, the other values of S are calculated and the ﬁnal matrix is presented as Table 4. The above explained generalized procedure to generate U and S are formalized as Algorithms 2 and 3 respectively.

isfaction of the group in the present state (that is having the maximum value in matrix S). According to this logic, for the given example, S4,7 has the highest value, so, for the 4th position, 7th item will be selected. By greedy approach (Black & Pieterse, 2005), we will not visit item no. 7 again. In the following iterations the 8th item is selected for the 2nd position, 2nd item is selected for 1st position and so on. Finally, this method generates the output:

2 8 3 7 4 with the total group satisfaction = 139.339. The total group satisfaction is obtained by adding up the individual user satisfaction generated by the procedure Compute_Satisfaction presented as Algorithm 4. GRAM is presented in Algorithm 5. Time complexity of GRAM (including the cost of computing S) = O (k2 m + kn ). Clearly, the introduction of greedy approach makes it an intelligent one as it avoids selection of an item visited once. Once the item has been visited for a particular position, GRAM avoids visit-

Algorithm 4: Computing satisfaction of individual user.

4.1. GReedy Aggregated Method (GRAM) This method uses a greedy approach (Black & Pieterse, 2005) to compute the recommendation. It operates in k iterations where at each step it ﬁnds a particular position p and an item j such that selecting this item j for this position p adds to the maximum sat-

1 2 3 4 5 6

Algorithm 2: Generating user preference matrix.

1 2 3 4 5 6

Data: k, v, n Result: User Preference Matrix U U ← 0; for 1 ≤ j ≤ n do for 1 ≤ p ≤ k do U p,v j [ p] ← U p,v j [ p] + 1; end end

7 8 9

Algorithm 5: Greedy Aggregated Method.

1 2 3

Algorithm 3: Generating user satisfaction matrix.

1 2 3 4 5 6 7 8

Data: k, m, a, n Result: User Satisfaction Matrix S S ← 0; for 1 ≤ j ≤ k do for 1 ≤ p ≤ m do for 1 ≤ q ≤ k do S j,p ← S j,p + Uq, j × s|i−q| ; end end end

Data: u, vu , k, r Result: Satisfaction score as sum sum ← 0; for 1 ≤ p ≤ k do item ← vu [ p]; for 1 ≤ q ≤ m do if item = rq then sum ← sum + s| p−q| ; end end end

4 5 6 7 8 9 10 11 12 13 14

Data: k, m, S Result: Recommendation vector r i ← 1; j ← 1; r←0 Bold 0 represents zerovector; while j ≤ k do max_value ← max 1≤p≤k S p,q ; r p ← q; for 1 ≤ i ≤ m do S p,i ← φ ; end for 1 ≤ i ≤ k do Si,q ← φ ; end j ← j + 1; end

1≤q≤m

A. Agarwal et al. / Expert Systems With Applications 82 (2017) 115–127

Algorithm 6: Hungarian Aggregated Method.

1 2 3 4 5

Data: k, m, S Result: Recommendation vector r p ← 1; q ← 1; r←0 Bold 0 represents zerovector; n ← max(k, m ); max_val ← max 1≤i≤n Si, j ; 1≤ j≤n

6 7 8 9 10 11 12 13 14 15 16

∀i, j S¯i, j = max_val − Si, j ; H ← Hungarian(k, m, S¯ );

while p ≤ k do while q ≤ m do if H p,q = 1 then r p ← q; end q ← q + 1; end p ← p + 1; end

Algorithm 7: Least Misery method algorithm.

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19

ing the item again. This effectively reduces the computation time at the same time aiming to increase the overall group satisfaction.

20 21 22 23 24

4.2. Hungarian Aggregated Method (HAM)

25 26

This method employs Hungarian Algorithm (Kuhn (1955)) for maximizing the group satisfaction. As usual, the satisfaction value matrix S is supplied as input. Hungarian algorithm works on the principle that the each component of the sum in a twodimensional matrix belongs to a unique row and column. For our case, it produces a matrix H of size m × m such that if Hi, j = 1, it implies that jth item should be placed at ith position to ensure maximum group satisfaction. Strictly speaking, the Hungarian algorithm offers a minimum cost assignment combinatorial solution. However, it can be modiﬁed to compute the maximum cost by subtracting each element from the maximum cost in the matrix. The Hungarian Aggregated Method (HAM) as presented in Algorithm 6 takes a p × q matrix as input and generates an assignment matrix of size r × r (where r = max( p, q )) with Boolean entries such that assignment (i, j ) = 1 if element j is selected for ith row. This stands true for cases generating the minimum sum of the matrix such that there is only one entry assigned the value of 1 from every row and every column. In our case, we have to modify this problem for obtaining the maximum cost. To do so, we take the maximum cost in the satisfaction value matrix S and each element of this matrix is subtracted from this maximum value, i.e.x = max − x for each element x in S. This resultant matrix S¯ is given as input to the minimum cost Hungarian procedure. For our situation, we have k ≤ m. The Hungarian procedure will run in O (m3 ) time to generate a m × m matrix. The items of our interest will lie in the ﬁrst k rows of this assignment matrix. For each entry Hi, j = 1, we will choose the jth item for the ith position. Based on this matrix H, the requisite items are selected in the appropriate order. The Hungarian procedure is presented in detail as Algorithm 11 in Appendix A. For the example under consideration, Algorithm 6 produces the recommendation:

5 8 3 7 2 with the total group satisfaction = 142.19 as computed using Compute_Satisfaction procedure.

121

27

Data: k, m, v Result: Recommendation vector r r ← 0; for 1 ≤ p ≤ k do for 1 ≤ q ≤ n do sat p ← Compute_Satisfaction(q, vq , k, r ); end least _sat ← min1≤p≤n sat p ; u ← p; m, row, col ← 0; for 1 ≤ p ≤ k do item ← vu [ p]; q ← 1; for 1 ≤ q ≤ k do if Sq,item > m then m ← Sq,item ; row ← q; col ← item; end end end rrow ← col; for 1 ≤ i ≤ m do Srow,i ← φ ; end for 1 ≤ j ≤ k do S j,col ← φ ; end end

Time complexity of HAM (including the cost of computing S) = O (m3 + k2 m + kn ). This is again an application of intelligence as we pick elements into the ﬁnal list in a perceptive fashion.7

4.3. Least Misery Method (LMM) As stated earlier, this algorithm tries to maximize the minimum satisfaction of the group. It too operates in k iterations where at each step it ﬁnds the user with the current least satisfaction. Next, the method considers the set of items provided in her user preference vector and select an item i for position j, so as to provide maximum satisfaction to the group. For the example under consideration initially, user 1 is identiﬁed as the user with the least misery. Her set of items: {2, 5, 6, 7, 8}. In this set, LMM searches for an item that can produce maximum satisfaction for the group pertaining to that particular item (i.e. it tries to ﬁnd the maximum satisfaction score in her satisfaction score vector). The corresponding position for item is also retained. So, item 7 is selected for the 4th position. Again, the method looks for the next user with minimum satisfaction. It turns out to be user 5 and the above procedure is repeated until all the users are covered. The recommendation generated using this method is:

8 5 2 7 3 with the total group satisfaction =137.537 and the minimum satisfaction of the group =11.9223. The LMM is presented in Algorithm 7. Time complexity of LMM is O (k3 n ).

7

For details refer Appendix A.

122

A. Agarwal et al. / Expert Systems With Applications 82 (2017) 115–127

Algorithm 8: Computing similarity between two users.

1 2

Data: u, vu , k, vw Result: val val ← Compute_Satisfaction(u, vu , k, vw ); return val;

Algorithm 9: Procedure for computing priority.

1 2 3 4

Table 5 Similarity values for User 1 with other users. User

Overlap with User 1

2 3 4 5 6 7 8

12.5711 13.7716 12.4721 14.4223 21.3782 14.7735 7.7735

5

6 7

Data: n, vu , k, s, u Result: Priority of user u as sum r ← vu ; sum ← 0; for 1 ≤ i ≤ n do if i = j then sum ← sum + Compute_Satisfaction(i, vu , k, r ); end end Table 6 Priority values for the users in Example 4.

4.4. Least Misery Method with Priority (LMMP) This algorithm is an extension of Least Misery method. It follows the same principle as described in the Section 4.3. However, to resolve the ties that may occur, for instance, when initially all the users have the same satisfaction score of zero, we take the similarity between the users into consideration. The principal idea behind this concept is that if there is a set of users having the same value of current satisfaction, we must satisfy the user, who can add maximum possible satisfaction to the group. To implement this, we assign a Priority value for each user based on her similarity with all the other users in the group. The priority value for a user is computed once for a recommendation where priority[i] denotes the similarity of user i with all other users in the group. Overlap similarity (i.e. the similarity between two users) is calculated on the basis of similarity between two users’ preference vectors. The underlying philosophy of Compute_Satisfaction and Compute_Overlap is same, the difference being the type of list being passed as arguments to them. For, Compute_Satisfaction, one list is the preference vector and the other is the generated recommendation. While for Compute_Overlap, both the lists are user preferences. Higher the priority value, greater is the similarity of the user with other members of the group. Hence, satisfying such a user will most deﬁnitely add greater satisfaction to the group. For instance, let us compute priority score for User 1 from Example 4. For this, we will calculate the similarity or overlap of preference vector v1 with the vectors v2 , v3 , ... , v8 . The overlap values calculated using the Compute_Overlap procedure (Algorithm 8) are presented in Table 5. Summing up the above values we get the priority for user 1 as pr ior ity[1] = 97.1623. Similarly, priority values are calculated for other users as well. In this way, we ensure that the Least Misery user is chosen optimally, that is, we satisfy this user as well as maximize the overall group satisfaction. Breaking ties with the help of priority instigate the property of aggregation into the Least Misery method. For the case under consideration, the priority values are generated using Compute_Priority procedure (Algorithm 9) as shown in Table 6. At ﬁrst, all the users have a satisfaction score of 0. So, user 4 with the highest priority value is selected with the current least satisfaction. Now from his set of items, i.e. {8, 4, 3, 7, 2} we look for the item that will be able to generate maximum satisfaction for the group. So the 7th item is selected for the 4th position. The satisfaction values of the users are recomputed and now user 5 has the least satisfaction. The same procedure is repeated for this user and the 2nd item is chosen for the 3rd position and so on. For

User

Priority

1 2 3 4 5 6 7 8

97.1623 101.711 96.9522 103.855 61.5836 97.6902 99.3427 85.2987

Example 4, LMMP generates the following output:

5 8 2 7 3 with the total group satisfaction = 139.265 and the minimum satisfaction of the group = 11.2735. The above scores depict that including priority in the Least Misery approach increases the overall satisfaction of the group by 0.015% while ensuring that the least satisfaction is not reduced beyond 0.05%. This approach is particularly useful when the number of users and items is large as compared to the group budget. LMMP is presented in Algorithm 10. Time complexity of LMMP is O (k3 n + k2 n2 ). 5. Experimental setup We conducted a series of performance and quality experiments using a real world dataset extracted from MovieLens (GroupLens, 2015). Our prototype system has been implemented in C++. All experiments were conducted on an Intel machine equipped with quad-core 2.20 GHz CPU, 8GB RAM, 1TB HDD and running Ubuntu 14.04. All the results presented in the next sections were obtained as the average of ﬁve runs. 5.1. Data preparation The Movie Lens dataset (GroupLens, 2015) contained 20,0 0 0,263 movie ratings provided by 7200 users over 27,278 movies. We adopted a simple boolean procedure to convert these ratings to our required model. For each user, if a particular movie has been rated greater than 2 (on a scale of 1–5), was selected and appended to the preference vector of that user in the order in which it appeared in the input rating ﬁle. If the movie acquired a rating less than or equal to 2, it was simply discarded. So in effect, we had 7200 user preference vectors. We considered order of movies, as suggested by the input order of ratings of movies by users. Since the number of movies is quite large as compared to the number of users, the chances of having exactly the same order set is minimal. Hence, we assumed that the order in which the rating of movies appear in the dataset is the order of recommendation the user prefers.

A. Agarwal et al. / Expert Systems With Applications 82 (2017) 115–127

123

Algorithm 10: Least Misery Method with Priority.

1 2 3 4 5 6 7 8 9 10 11

12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37

Data: k, m, a, s, S Result: Recommendation vector r j ← 1; r ← 0; while j ≤ n do pr ior ity j ← Compute_Priority(n, k, s, v j , j ); end for 1 ≤ j ≤ k do for 1 ≤ p ≤ n do sat p ← Compute_Satisfaction( p, v p , k, r ); end least _sat ← min1≤p≤n sat p ; if (sati = sat j and sati = least _sat) ∀i = j and 1 ≤ i, j ≤ n then max_priority ← max1≤q≤n pr ior ityq ; u ← q; end else u ← p; end m, row, col ← 0; for 1 ≤ p ≤ k do item ← vu [ p]; q ← 1; for 1 ≤ q ≤ k do if Sq,item > m then m ← Sq,item ; row ← q; col ← item; end end end rrow ← col; for 1 ≤ i ≤ m do Srow,i ← φ ; end for 1 ≤ j ≤ k do S j,col ← φ ; end end

Since no explicit groups were provided in the dataset, we had assumed groups of users based on their similarity as explained in Section 4.4 using Algorithm 8. 6. Results and Analysis We report the eﬃciency results for the ﬁve proposed algorithms. Performance is measured mainly by varying 3 parameters– Group size(n), Total number of items(m) and the Group budget(k). 6.1. Results 6.1.1. Varying group budget k We study the time complexity of all the proposed algorithms by varying the group budget (k). The brute force algorithm quickly becomes very expensive (for k < 5). We set n = 1200 and m = 2000. As shown in Fig. 4 our proposed algorithms prove to be very efﬁcient and the variance with the group budget k is depicted. The running time of Least Misery with Priority method increases most rapidly with increasing in k. This can be attributed to the fact that it has the time complexity of O (nk3 + n2 k2 ). Calculating the prior-

Fig. 4. Comparison of Time complexity of algorithms for varying group budget.

Fig. 5. Comparison of Time complexity of algorithms for varying number of users.

Fig. 6. Comparison of Time complexity of algorithms for varying number of items.

ities apriori and then breaking ties with priority adds to the computation time. 6.1.2. Varying the number of users n In this experiment we wanted to study the effect of number of users on the running time of the proposed algorithms. For this we set the group budget k = 100 and m = 2000 and vary the group size n. The resulting plot is depicted in Fig. 5. From this, we observe that the GRAM gives the performance as it produces no noticeable changes with respect to the number of users. 6.1.3. Varying the total number of items m In this experiment we set k = 100 and n = 10 0 0 and vary the total number of items. The results are shown in Fig. 6. 6.2. Analysis The performance of the two aggregation methods was compared on the basis of the overall group satisfaction and the two Least Misery algorithms were compared by the overall group satisfaction as well as the least satisfaction obtained. The following results were obtained:

124

A. Agarwal et al. / Expert Systems With Applications 82 (2017) 115–127

Fig. 7. Comparison of HAM and GRAM over satisfaction.

Fig. 8. Comparison of Least Misery vs. Least Misery with Priority over user satisfaction.

1. The Hungarian Aggregated method(HAM) gave the satisfaction at par with Greedy Aggregated Method(GRAM). On an average, it gave about 2.5% more satisfaction than the GRAM. However, this greater satisfaction is obtained at the cost of a much higher running time. Fig. 7 depicts the average user satisfaction for both the algorithms for ﬁxed values of m and k (m = 20 0 0, k = 100) and varying n between 800 and 1200. 2. Both the Least Misery methods gave almost equal overall satisfaction for the group when the values of the group budget were comparable with the total number of items. However as the group budget is reduced the Least Misery with Priority method gives better satisfaction than the normal approach. The variation of average user satisfaction with the number of users for both the methods is shown in Fig. 8. Although, the overall satisfaction produced by the Least Misery Method with Priority is greater in some cases, the corresponding least satisfaction produced by LMMP is lesser as compared to Least Misery. This can be attributed to the fact that including preferences of the users adds the property of aggregation to the normal least misery approach and hence there is a slight compromise in the least misery in such cases. The variation of the least satisfaction with the group budget for both the algorithms is shown in Fig. 9. 3. However, in cases where the group budget is quite small in comparison to the number of users and the total number of items, Least Misery with Priority method produces much better overall group satisfaction (as high as 30% greater) than Least Misery. This is apparent when the group budget is less. In such cases, it becomes diﬃcult to satisfy all the users, so the optimum recommendation would be the one that not only tries to maximize the least misery but also produces a good satisfaction for the group. This is the advantage of Least Misery with Priority method. The difference between the two algorithms for lower values of k is depicted in Fig. 10. From the above set of experiments we observe that:

Fig. 9. Comparison of Least Misery vs. Least Misery with Priority over least satisfaction.

Fig. 10. Comparison of Least Misery vs. Least Misery with Priority over average user satisfaction.

•

•

•

Least Misery with Priority and Hungarian Aggregated Method (HAM) are quite costly as compared to their counterparts. However these algorithms sacriﬁce time to produce better satisfaction results. HAM achieves satisfaction equal to the Brute Force Algorithm in most cases. However, GRAM also produces approximate results (97% of HAM) in much lesser time. The Least Misery Method with Priority method is useful in cases when the group budget is small as compared to the total number of items and users and it is not possible to satisfy all the users. In other cases, the Least Misery method is preferred as it produces greater value of least satisfaction and an approximate overall group satisfaction in lesser time.

7. Conclusions and future work Group Recommender Systems have been studied and applied widely. Conventional approaches of Group Recommender Systems employ either Aggregated Voting or Least Misery methods. While much work has been done to ensure maximum satisfaction of the group by processing user preferences, to the best of our knowledge order of recommended items were never considered. Through our proposed user satisfaction measure (USO) we show that order plays an integral part in achieving the best satisfaction for the group. We proposed two modiﬁed versions of Aggregated Voting strategy namely, GRAM and HAM which overcomes the shortcomings of the original aggregated method. Also, we introduced a variation of Least Misery method known as Least Misery with Priority to counter the pitfalls of the former. Most of the variations as proposed by us uses some form of intelligence. The resultant system takes into account both the preference and order as suggested by individual users to ensure maximum group satisfaction. An exhaustive set of experiments were carried out on a real-life dataset to corroborate our claims.

A. Agarwal et al. / Expert Systems With Applications 82 (2017) 115–127

As part of our future work, we would like to explore the eﬃciency and scalability of our algorithms to various other datasets, such as for travel itinerary. Besides, we intend to apply other intelligent machine learning based algorithms to capture the effect of order on both individual and group recommendation.

Algorithm 11: Hungarian procedure.

1 2

Acknowledgment This work was partially funded under the DST-SERB, India grant YSS/2015/0 0 0906.

3 4 5 6 7

Appendix A. Steps involved in computing assignment matrix H The following Hungarian algorithm is a modiﬁed form of the original Munkres’ Assignment Algorithm (Pilgrim, 2017). This algorithm describes the manipulation of a two-dimensional matrix by starring and priming zeros (explained later) and by covering and uncovering rows and columns.

8 9 10 11 12 13 14 15

1. For each row of the matrix, we ﬁnd the smallest element and subtract it from every element in its row. We deﬁne a local variable called minval that is used to hold the smallest value in a row. minval is subtracted from each element of that row. Next, we ﬁnd a zero (Z) in the resulting matrix. If there is no starred zero in its row or column, star Z. This is repeated for each element in the matrix. In this step, we introduce the mask matrix H, which has the same dimensions as the cost matrix S and is used to star and prime zeros of the cost matrix. If H(i,j)=1 then S(i,j) is a starred zero, if H(i,j)=2 then S(i,j) is a primed zero. We also deﬁne two vectors R_cov and C_cov that are used to “cover” the rows and columns of the cost matrix S. We check to see if S(i,j) has a zero value and if its column or row is not already covered. If not, then we set a star to this zero (i.e. set H(i,j)=1) and cover its row and column (i.e. set R_cov(i)=1 and C_cov(j)=1). Next, we uncover all rows and columns so that we can use the cover vectors to help us count the number of starred zeros. We repeat this to cover each column containing a starred zero. If y columns are covered, the starred zeros describe a complete set of unique assignments. In that case, done, a ﬂag variable is set to true, otherwise, we go to the next step. Once we have searched the entire cost matrix, we count the number of independent zeros found. If we have found (and starred) y independent zeros then we are done. If not, we proceed to the next step. This step is presented as Algorithm 11. 2. The goal of this step is to ﬁnd a non-covered zero and prime it. If there is no starred zero in the row containing this primed zero, we go to Step 3. Otherwise, we cover this row and uncover the column containing the starred zero. We continue to repeat this until there are no uncovered zeros left. We save the smallest uncovered value and go to Step 4. In this step, statements such as “ﬁnd a non-covered zero” are clearly distinct operations that deserve their own functional blocks. We have decomposed this step into a main procedure (Algorithm 12) and two subprograms (Algorithms 15 and 16 ). 3. Now, we construct a series of alternating primed and starred zeros as follows. Let path_row represent the uncovered primed zero found in Step 2. Let path_col denote the starred zero in the column of path_row (if any). We continue until the series terminates at a primed zero that has no starred zero in its column. Next, we unstar each starred zero of the series, star each primed zero of the series, erase all primes and uncover every line in the matrix. Once done, we return to Step 1. The operations of this step are segregated into a main procedure (Algorithm 13) and two relatively simple subprograms (Algorithms 17 and 18 ).

125

16 17 18 19 20 21 22 23 24 25 26

Data: S, k, m Result: Assignment matrix H C_cov ← 0; R_cov ← 0; path_row, path_col ← 0; for 1 ≤ i ≤ n do min_val ← min1≤ j≤n Si, j ; ∀ j Si, j ← Si, j − min_val; end for 1 ≤ i ≤ n do for 1 ≤ j ≤ n do if Si, j = 0&&C _cov j = 0&&R_covi = 0 then Hi, j , C _cov j , R_covi ← 1; end end end C _covi , R_covi ← 0 i ∈ ( 1, n ); count ← 0; ∀ j if ∃i : Hi, j ← 1 then C _cov j ← 1; count ← count + 1; end if count = n then return H; end else SubProcedureOne(); end

Algorithm 12: SubProcedureOne. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23

row, col, done ← 0; while done = 1 do FindZero(row,col); if row = 0 then done ← 1; SubProcedureThree(); end else Hrow,col ← 2; index ← StarRow(row); if index = −1 then col ← index; R_covrow ← 1; C _covcol ← 0; end else done ← 1; path_row ← row; path_col ← col; SubProcedureTwo(); end end end

4. In this step, we add the value found in Step 3 to every element of each covered row, and subtract it from every element of each uncovered column. We go back to Step 1 without altering any stars, primes, or covered lines. Algorithm 14 concludes the main Hungarian procedure.

126

A. Agarwal et al. / Expert Systems With Applications 82 (2017) 115–127

Algorithm 13: SubProcedureTwo. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22

23 24 25 26

pc ← 1; path pc,1 ← path_row; path pc,2 ← path_col; done ← 0; while done = 1 do r ← FindStarCol( path pc,2 ); if r > 0 then pc ← pc + 1; path pc,1 ← r; path pc,2 ← path pc−1,2 ; else done ← 1; if done = 1 then c ← FindPrimeRow( path pc,1 ); pc ← pc + 1; path pc,1 ← path pc−1,1 ; path pc,2 ← c; for 1 ≤ i ≤ pc do if H pathi,1 ,pathi,2 = 1 then H pathi,1 ,pathi,2 ← 0; else H pathi,1 ,pathi,2 ← 1;

∀i ∈ (1, n ) R_covi , C _covi ← 0; ∀i, j ∈ (1, n ) if Hi, j = 2 then Hi, j ← 0;

Algorithm 15: FindZero.

1 2 3 4 5 6 7 8 9 10 11 12 13 14

Data: row, col Result: row, col i ← 1; d ← 0; while d = 1&&i ≤ n do j ← 1; while j ≤ n do if Si, j = 0&&C _cov j = 0&&R_covi = 0 then row ← i; col ← j; d ← 1; end j ← j + 1; end i ← i + 1; end

Algorithm 16: StarRow.

1 2 3 4 5 6

Data: row Result: j for 1 ≤ j ≤ n do if Hrow, j = 1 then return j; end end return −1;

go to 16 in Hungarian(); Algorithm 17: FindStarCol.

Algorithm 14: SubProcedureThree. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19

mv ← ∞; for 1 ≤ i ≤ n do for 1 ≤ j ≤ n do if R_covi = 0&&C _cov j = 0&&mv > Si, j then mv ← Si, j ; end end end for 1 ≤ i ≤ n do for 1 ≤ j ≤ n do if R_covi = 1 then Si, j ← Si, j + mv; end if C _cov j = 0 then Si, j ← Si, j − mv; end end end SubProcedureOne();

References Adomavicius, G., & Tuzhilin, A. (2005). Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. Knowledge and Data Engineering, IEEE Transactions on, 17(6), 734–749. Adomavicius, G., & Tuzhilin, A. (2015). Context-aware recommender systems. In Recommender systems handbook (pp. 191–226). Springer. Amer-Yahia, S., Roy, S. B., Chawlat, A., Das, G., & Yu, C. (2009). Group recommendation: Semantics and eﬃciency. Proceedings of the VLDB Endowment, 2(1), 754– 765. doi:10.14778/1687627.1687713.

1 2 3 4 5 6

Data: col Result: i for 1 ≤ i ≤ n do if Hi,col = 1 then return i; end end return −1;

Algorithm 18: FindPrimeRow.

1 2 3 4 5 6

Data: row Result: j for 1 ≤ j ≤ n do if Hrow, j = 2 then return j; end end return −1;

Baltrunas, L., Makcinskas, T., & Ricci, F. (2010). Group recommendations with rank aggregation and collaborative ﬁltering. In Proceedings of the fourth acm conference on recommender systems (pp. 119–126). ACM. Black, P. E., & Pieterse, V. (2005). Greedy algorithm. Accessed: 2016-05-29 http:// www.nist.gov/dads/HTML/greedyalgo.html. Bobadilla, J., Ortega, F., Hernando, A., & Bernal, J. (2012). Generalization of recommender systems: Collaborative ﬁltering extended to groups of users and restricted to groups of items. Expert systems with Applications, 39(1), 172–186. Cantador, I., & Castells, P. (2012). Group recommender systems: New perspectives in the social web. In Recommender systems for the social web (pp. 139–157). Springer. Castro, J., Quesada, F. J., Palomares, I., & Martínez, L. (2015). A consensus-driven group recommender system. International Journal of Intelligent Systems, 30(8), 887–906. doi:10.1002/int.21730.

A. Agarwal et al. / Expert Systems With Applications 82 (2017) 115–127 Chakraborty, M., Agrawal, H., Shekhar, H., & Chowdary, C. R. (2015). Contextual suggestion using tag-description similarity. In Proceedings of the twenty-fourth text retrieval conference, TREC 2015, gaithersburg, maryland, usa, november 17–20, 2015. Chang, S., Harper, F. M., & Terveen, L. (2015). Using groups of items for preference elicitation in recommender systems. In Proceedings of the 18th acm conference on computer supported cooperative work social computing. In CSCW ’15 (pp. 1258– 1269). New York, NY, USA: ACM. doi:10.1145/2675133.2675210. Christensen, I. A., & Schiaﬃno, S. (2011). Entertainment recommender systems for group of users. Expert Systems with Applications, 38(11), 14127–14135. http://dx. doi.org/10.1016/j.eswa.2011.04.221. Gartrell, M., Xing, X., Lv, Q., Beach, A., Han, R., Mishra, S., & Seada, K. (2010). Enhancing group recommendation by incorporating social relationship interactions. In Proceedings of the 16th acm international conference on supporting group work (pp. 97–106). ACM. Ghazarian, S., & Nematbakhsh, M. A. (2015). Enhancing memory-based collaborative ﬁltering for group recommender systems. Expert Systems with Applications, 42(7), 3801–3812. http://dx.doi.org/10.1016/j.eswa.2014.11.042. GroupLens (2015). MovieLens 20M dataset. http://grouplens.org/datasets/movielens/ 20m/. Jameson, A., & Smyth, B. (2007). The adaptive web (pp. 596–627). Berlin, Heidelberg: Springer-Verlag. Kagita, V. R., Pujari, A. K., & Padmanabhan, V. (2015). Virtual user approach for group recommender systems using precedence relations. Information Sciences, 294, 15–30. Kaššák, O., Kompan, M., & Bieliková, M. (2016). Personalized hybrid recommendation for group of users: Top-n multimedia recommender. Information Processing & Management, 52(3), 459–477. http://dx.doi.org/10.1016/j.ipm.2015.10.001. Kompan, M., & Bieliková, M. (2014). Voting Based Group Recommendation: How Users Vote. In Proc. of 1st international workshop on social personalization in conjunction with 25th acm conference on hypertext and social media. Kuhn, H. W. (1955). The hungarian method for the assignment problem. Naval Research Logistics Quarterly, 2, 83–97. Lops, P., De Gemmis, M., & Semeraro, G. (2011). Content-based recommender systems: State of the art and trends. In Recommender systems handbook (pp. 73–105). Springer. Masthoff, J. (2011). Group recommender systems: Combining individual models. In Recommender systems handbook (pp. 677–702). Springer. McCarthy, K., Salamó, M., Coyle, L., McGinty, L., Smyth, B., & Nixon, P. (2006). Group Recommender Systems: A Critiquing Based Approach. In Proceedings of the 11th international conference on intelligent user interfaces. In IUI ’06 (pp. 267–269). New York, NY, USA: ACM. doi:10.1145/1111449.1111506. MovieLens (1998). GroupLens at University of Minnesota. http://grouplens.org/ datasets/movielens/100k/.

127

MovieLens (2009). GroupLens at University of Minnesota. http://grouplens.org/ datasets/movielens/10m/. O’connor, M., Cosley, D., Konstan, J. A., & Riedl, J. (2001). PolyLens: a recommender system for groups of users. In Ecscw 2001 (pp. 199–218). Springer. Ortega, F., Bobadilla, J., Hernando, A., & Gutirrez, A. (2013). Incorporating group recommendations to recommender systems: Alternatives and performance. Information Processing & Management, 49(4), 895–901. http://dx.doi.org/10.1016/j. ipm.2013.02.003. Ortega, F., Hernando, A., Bobadilla, J., & Kang, J. H. (2016). Recommending items to group of users using matrix factorization based collaborative ﬁltering. Information Sciences, 345, 313–324. http://dx.doi.org/10.1016/j.ins.2016.01.083. Pazzani, M. (1999). A framework for collaborative, content-based and demographic ﬁltering. Artiﬁcial Intelligence Review, 13(5–6), 393–408. doi:10.1023/A: 1006544522159. Pera, M. S., & Ng, Y.-K. (2013). A group recommender for movies based on content similarity and popularity. Information Processing & Management, 49(3), 673–687. Personalization and Recommendation in Information Access http://dx.doi.org/ 10.1016/j.ipm.2012.07.007 Pilgrim, R. A. (2017). Munkres’ assignment algorithm, course notes. http:// csclab.murraystate.edu/∼bob.pilgrim/445/munkres.html. [Online; accessed 08Mar-2017]. Resnick, P., & Varian, H. R. (1997). Recommender systems. Communications of the ACM, 40(3), 56–58. Ricci, F., Rokach, L., & Shapira, B. (2011). Introduction to recommender systems handbook. Springer. Roy, S. B., Thirumuruganathan, S., Amer-Yahia, S., Das, G., & Yu, C. (2014). Exploiting group recommendation functions for ﬂexible preferences. In IEEE 30th international conference on data engineering, chicago, ICDE 2014, il, usa, march 31 – april 4, 2014 (pp. 412–423). doi:10.1109/ICDE.2014.6816669. Sarwar, B., Karypis, G., Konstan, J., & Riedl, J. (20 0 0). Analysis of recommendation algorithms for e-commerce. In Proceedings of the 2nd acm conference on electronic commerce (pp. 158–167). ACM. Schafer, J. B., Konstan, J. A., & Riedl, J. (2001). E-commerce recommendation applications. In Applications of data mining to electronic commerce (pp. 115–153). Springer. Trewin, S. (20 0 0). Knowledge-based recommender systems. Encyclopedia of Library and Information Science, 69(32), 180. Villavicencio, C., Schiaﬃno, S., Diaz-Pace, J. A., & Monteserin, A. (2016). Advances in practical applications of scalable multi-agent systems. The PAAMS collection: 14th international conference, PAAMS 2016, sevilla, spain, june 1–3, 2016, proceedings (pp. 294–298). Cham:Springer International Publishing. doi:10.1007/ 978- 3- 319- 39324- 7_34.