Abstract
In this article we address the task of cross-lingual sentiment lexicon learning, which aims to automatically generate sentiment lexicons for the target languages with available English sentiment lexicons. We formalize the task as a learning problem on a bilingual word graph, in which the intra-language relations among the words in the same language and the inter-language relations among the words between different languages are properly represented. With the words in the English sentiment lexicon as seeds, we propose a bilingual word graph label propagation approach to induce sentiment polarities of the unlabeled words in the target language. Particularly, we show that both synonym and antonym word relations can be used to build the intra-language relation, and that the word alignment information derived from bilingual parallel sentences can be effectively leveraged to build the inter-language relation. The evaluation of Chinese sentiment lexicon learning shows that the proposed approach outperforms existing approaches in both precision and recall. Experiments conducted on the NTCIR data set further demonstrate the effectiveness of the learned sentiment lexicon in sentence-level sentiment classification.
1. Introduction
A sentiment lexicon is regarded as the most valuable resource for sentiment analysis (Pang and Lee 2008), and lays the groundwork of much sentiment analysis research, for example, sentiment classification (Yu and Hatzivassiloglou 2003; Kim and Hovy 2004) and opinion summarization (Hu and Liu 2004). To avoid manually annotating sentiment words, an automatically learning sentiment lexicon has attracted considerable attention in the community of sentiment analysis. The existing work determines word sentiment polarities either by the statistical information (e.g., the co-occurrence of words with predefined sentiment seed words) derived from a large corpus (Riloff, Wiebe, and Wilson 2003; Hu and Liu 2006) or by the word semantic information (e.g., synonym relations) found in existing human-created resources (e.g., WordNet) (Takamura, Inui, and Okumura 2005; Rao and Ravichandran 2009). However, current work mainly focuses on English sentiment lexicon generation or expansion, while sentiment lexicon learning for other languages has not been well studied.
In this article, we address the issue of cross-lingual sentiment lexicon learning, which aims to generate sentiment lexicons for a non-English language (hereafter referred to as “the target language”) with the help of the available English sentiment lexicons. The underlying motivation of this task is to leverage the existing English sentiment lexicons and substantial linguistic resources to label the sentiment polarities of the words in the target language. To this end, we need an approach to transferring the sentiment information from English words to the words in the target language. The few existing approaches first build word relations between English and the target language. Then, based on the word relation and English sentiment seed words, they determine the sentiment polarities of the words in the target language. In these two steps, relation-building plays a fundamental role because it is responsible for the transfer of sentiment information between the two languages. Two approaches are often used to connect the words in different languages in the literature. One is based on translation entries in cross-lingual dictionaries (Hassan et al. 2011). The other relies on a machine translation (MT) engine as a black box to translate the sentiment words in English to the target language (Steinberger et al. 2011). The two approaches in Duh, Fujino, and Nagata (2011) and Mihalcea, Banea, and Weibe (2007) tend to use a small set of vocabularies to translate the natural language, which leads to a low coverage of generated sentiment lexicons for the target language.
To solve this problem, we propose a generic approach to addressing the task of cross-lingual sentiment lexicon learning. Specifically, we model this task with a bilingual word graph, which is composed of two intra-language subgraphs and an inter-language subgraph. The intra-language subgraphs are used to model the semantic relations among the words in the same languages. When building them, we incorporate both synonym and antonym word relations in a novel manner, represented by positive and negative sign weights in the subgraphs, respectively. These two intra-language subgraphs are then connected by the inter-language subgraph. We propose Bilingual word graph Label Propagation Propagation (BLP), which simultaneously takes the inter-language relations and the intra-language relations into account in an iterative way. Moreover, we leverage the word alignment information derived from a parallel corpus to build the inter-language relations. We connect two words from different languages that are aligned to each other in a parallel sentence pair. Taking advantage of a large parallel corpus, this approach significantly improves the coverage of the generated sentiment lexicon. The experimental results on Chinese sentiment lexicon learning show the effectiveness of the proposed approach in terms of both precision and recall. We further evaluate the impact of the learned sentiment lexicon on sentence-level sentiment classification. When using words in the learned sentiment lexicon as features for sentiment classification of the target language, the sentiment classification can achieve a high performance.
We make the following contributions in this article.
- 1.
We present a generic approach to automatically learning sentiment lexicons for the target language with the available sentiment lexicon in English, and we formalize the cross-lingual sentiment learning task on a bilingual word graph.
- 2.
We build a bilingual word graph by using synonym and antonym word relations and propose a bilingual word graph label propagation approach, which effectively leverages the inter-language relations and both types (synonym and antonym) of the intra-language relations in sentiment lexicon learning.
- 3.
We leverage the word alignment information derived from a large number of parallel sentences in sentiment lexicon learning. We build the inter-language relation in the bilingual word graph upon word alignment, and achieve significant results.
2. Related Work
2.1 English Sentiment Lexicon Learning
In general, the work on sentiment lexicon learning focuses mainly on English and can be categorized as co-occurrence–based approaches (Hatzivassiloglou and McKeown 1997; Riloff, Wiebe, and Wilson 2003; Qiu et al. 2011) and semantic-based approaches (Mihalcea, Banea, and Wiebe 2007; Takamura, Inui, and Okumura 2005; Kim and Hovy 2004).
The co-occurrence-based approaches determine the sentiment polarity of a given word according to the statistical information, like the co-occurrence of the word to predefined sentiment seed words or the co-occurrence to product features. The statistical information is mainly derived from certain corpora. One of the earliest work conducted by Hatzivassiloglou and McKeown (1997) assumes that the conjunction words can convey the polarity relation of the two words they connect. For example, the conjunction word and tends to link two words with the same polarity, whereas the conjunction word but is likely to link two words with opposite polarities. Their approach only considers adjectives, not nouns or verbs, and it is unable to extract adjectives that are not conjoined by conjunctions. Riloff et al. (2003) define several pattern templates and extract sentiment words by two bootstrapping approaches. Turney and Littman (2003) calculate the pointwise mutual information (PMI) of a given word with positive and negative sets of sentiment words. The sentiment polarity of the word is determined by average PMI values of the positive and negative sets. To obtain PMI, they provide queries (consisting of the given word and the sentiment word) to the search engine. The number of hits and the position (if the given word is near the sentiment word) are used to estimate the association of the given word to the sentiment word. Hu and Liu (2004) research sentiment word learning on customer reviews and they assume that the sentiment words tend to be correlated with product features. The frequent nouns and noun phrases are treated as product features. Then they extract the adjective words as sentiment words from those sentences that contain one or more product features. This approach may work on a product review corpus, where one product feature may frequently appear. But for other corpora, like news articles, this approach may not be effective. Qiu et al. (2011) combine sentiment lexicon learning and opinion target extraction. A double propagation approach is proposed to learn sentiment words and to extract opinion targets simultaneously, based on eight manually defined rules.
The semantic-based approaches determine the sentiment polarity of a given word according to the word semantic relation, like the synonyms of sentiment seed words. The word semantic relation is usually obtained from dictionaries, for example, WordNet.1 Kim and Hovy (2004) assume that the synonyms of a positive (negative) word are positive (negative) and its antonyms are negative (positive). Initializing with a set of sentiment words, they expand sentiment lexicons based on these two kinds of word relations. Kamps et al. (2004) build a synonym graph according to the synonym relation (synset) derived from WordNet. The sentiment polarity of a word is calculated by the shortest path to two sentiment words good and bad. However, the shortest path cannot precisely describe the sentiment orientation, considering there are only five steps between the word good and the word bad in WordNet (Hassan et al. 2011). Takamura et al. (2005) construct a word graph with the gloss of WordNet. Words are connected if a word appears in the gloss of another. The word sentiment polarity is determined by the weight of its connections on the word graph. Based on WordNet, Rao and Ravichandran (2009) exploit several graph-based semi-supervised learning methods like Mincuts and Label Propagation. The word polarity orientations are induced by initializing some sentiment seed words in the WordNet graph. Esuli et al. (2006, 2007) and Baccianella et al. (2010) treat sentiment word learning as a machine learning problem, that is, to classify the polarity orientations of the words in WordNet. They select seven positive words and seven negative words and expand them through the see-also and antonym relations in WordNet. These expanded words are then used for training. They train a ternary classifier to predict the sentiment polarities of all the words in WordNet and use the glosses (textual definitions of the words in WordNet) as the features of classification. The sentiment lexicon generated is the well-known SentiWordNet.2
2.2 Cross-Lingual Sentiment Lexicon Learning
The work on cross-lingual sentiment lexicon learning is still at an early stage and can be categorized into two types, according to how they bridge the words in two languages.
Mihalcea et al. (2007) generate sentiment lexicon for Romanian by directly translating the English sentiment words into Romanian through bilingual English–Romanian dictionaries. When confronting multiword translations, they translate the multiwords word by word. Then the validated translations must occur at least three times on the Web. The approach proposed by Hassan et al. (2011) learns sentiment words based on English WordNet and WordNets in the target languages (e.g., Hindi and Arabic). Cross-lingual dictionaries are used to connect the words in two languages and the polarity of a given word is determined by the average hitting time from the word to the English sentiment word set. These approaches connect words in two languages based on cross-lingual dictionaries. The main concern of these approaches is the effect of morphological inflection (i.e., a word may be mapped to multiple words in cross-lingual dictionaries). For example, one single English word typically has four Spanish or Italian word forms (two each for gender and for number) and many Russian word forms (due to gender, number, and case distinctions) (Steinberger et al. 2011). Usually, this approach requires an additional process to disambiguate the sentiment polarities of all the morphological variants.
To improve the sentiment classification for the target language, Banea, Mihalcea, and Wiebe (2010) translate the English sentiment lexicon into the target language using Google Translator.3 Similarly, Google Translator is used by Steinberger et al. (2011). They manually produce two high-level gold-standard sentiment lexicons for two languages (e.g., English and Spanish) and then translate them into the third language (e.g., Italian) via Google Translator. They believe that those words in the third language that appear in both translation lists are likely to be sentiment words. These approaches connect the words in two languages based on MT engines. The main concern of these approaches is the low overlapping between the vocabularies of natural documents and the vocabularies of the documents translated by MT engines (Duh, Fujino, and Nagata 2011; Meng et al. 2012a). The shortcoming of these MT-based approaches inevitably leads to low coverage.
Our task resembles the task of cross-lingual sentiment classification, like Wan (2009), Lu et al. (2011), and Meng et al. (2012a), which classifies the sentiment polarities of product reviews. Generally, these studies use semi-supervised learning approaches and regard translations from labeled English sentiment reviews as the training data. The terms in each review are leveraged as the features for training, which has proven to be effective in sentiment classification (Pang and Lee 2008). We can regard the task of sentiment lexicon learning as word-level sentiment classification. However, for word-level sentiment classification, it is not straightforward to extract features for a single word. Without sufficient features, it is difficult for these approaches to perform well in learning. Another line of cross-lingual sentiment classification uses Latent Dirichlet Allocation (LDA) (Blei, Ng, and Jordan 2003) or its variants, like Boyd-Graber and Resnik (2010) or He, Alani, and Zhou (2010). These studies assume that each review is a mixture of sentiments and each sentiment is a probability over words. Then they apply the LDA-like approach to model the sentiment polarity of each review. Nonetheless, this assumption may not be applicable in sentiment lexicon learning because a single word can be regarded as the minimal semantic unit, and it is difficult, if not impossible, to infer the latent topics from a single word. Recall that different from the sentiment classification of product reviews where the instances are normally independent, words in sentiment lexicon learning are highly related with each other, like synonyms and antonyms. Through these relations, the words can naturally form a word graph. Thus we use the graph-based learning approach to leverage the word distributions in sentiment lexicon learning. In the next section, we will introduce our proposed graph-based cross-lingual sentiment lexicon learning.
3. Cross-Lingual Sentiment Lexicon Learning
In this work, we model the task of cross-lingual sentiment lexicon learning with a bilingual word graph, where (1) the words in the two languages are represented by the nodes in two intra-language subgraphs, respectively; (2) the synonym and antonym word relations within each language are represented by the positive and negative sign weights in the corresponding intra-language subgraphs; and (3) the two intra-language subgraphs are connected by an inter-language subgraph. Mathematically, we build a graph that consists of two intra-language subgraphs and as shown in Figure 1. These two subgraphs are connected by the inter-language graph . The elements of WE, WT, and WA are positive real numbers, that is, WE, WT, and WA ∈ ℝ+, and and . Because incorporates the words in two languages, we call it a Bilingual Word Graph. Specifically, the positive weights, WE and WT, represent the synonym intra-language relations, and the negative weights, and , represent the antonym intra-language relations. The inter-language relations, WA, represent the connections between the words in the two languages. For cross-lingual sentiment lexicon learning, denotes the labeled and unlabeled words in English and XT denotes the unlabeled words in the target language. Given the labels of the seeds , we aim to predict the sentiment polarities of the words XT. In the remainder of this section, we will present the bilingual word graph construction and the algorithm of bilingual word graph label propagation.
3.1 Bilingual Word Graph Building With Parallel Corpus and Word Alignment
We represent the words in English and in the target language as the nodes of the bilingual word graph. We use the synonym and antonym relations of the words in the same language to build W and in the intra-language graph, respectively. In the rest of this section, we will focus on how to build the inter-language relation.
Intuitively, there are two ways to connect the words in two languages. One is to insert links to the words if there exist entry mappings between the words in bilingual dictionaries (e.g., the English–Chinese dictionary). This method is simple and straightforward, but it suffers from two limitations. (1) Dictionaries are static during a certain period, whereas the sentiment lexicon evolves over time. (2) The entries in dictionaries tend to be the expressions of formal and written languages, but people prefer using colloquial language in expressing their sentiments or opinions on-line. These limitations lead to the low coverage of the links from English to the target language. An alternative way is to use an MT engine as a black box to build the inter-language relation. One can send each word in English to a publicly available MT engine and get the translations in the target language. Edges can then be inserted into the graph between the English words and their corresponding translations. This approach suffers from the problem of low coverage as well because MT engines tend to use a small set of vocabularies (Duh, Fujino, and Nagata 2011).
In this article, we propose to leverage a large bilingual parallel corpus, which is readily available in the MT research community, to build the bilingual word graph. The parallel corpus consists of a large number of parallel sentence pairs from two different languages that have been used as the foundation of the state-of-the-art statistical MT engines. Like the example shown in Figure 2, the two sentences in English and Chinese are parallel sentences, which express the same meaning in different languages. We can easily derive the word alignment from the sentence pairs, automatically using a state-of-the-art toolkit, like GIZA++4 or BerkeleyAligner.5 In this example, the Chinese word (happy) is linked to the English word happy and we say that these two words are aligned. Similarly the English words best and wishes are both aligned to (wish).
The word alignment information encodes the rich association information between the words from the two languages. We are therefore motivated to leverage the parallel corpus and word alignment to build the bilingual word graph for cross-lingual sentiment lexicon learning. We take the words from both languages in the bilingual parallel corpus as the nodes in the bilingual word graph, and build the inter-language relations by connecting the two words that are aligned together in a sentence pair from a parallel corpus. There are several advantages of using a parallel corpus to build the inter-language subgraph. First, large parallel corpora are extensively used for training statistical MT engines and can be easily reused in our task. The parallel sentence pairs are usually automatically collected and mined from the Web. As a result, they contain the different and practical variations of words and phrases embedded in sentiment expressions. Second, the parallel corpus can be dynamically changed when necessary because it is relatively easy to collect from the Web. Consequentially, the novel sentiment information inferred from the parallel corpus can easily update the existing sentiment lexicons. These advantages can greatly improve the coverage of the generated sentiment lexicon, as demonstrated later in our experiments.
3.2 Bilingual Word Graph Label Propagation
As commonly used semi-supervised approaches, label propagation (Zhu and Ghahramani 2002) and its variants (Zhu, Ghahramani, and Lafferty 2003; Zhou et al. 2004) have been applied to many applications, such as part-of-speech tagging (Das and Petrov 2011; Li, Graca, and Taskar 2012), image annotation (Wang, Huang, and Ding 2011), protein function prediction (Jiang 2011; Jiang and McQuay 2012), and so forth. The underlying idea of label propagation is that the connected nodes in the graph tend to share the same sentiment labels. In bilingual word graph label propagation, the words tend to share same sentiment labels if they are connected by synonym relations or word alignment and tend to belong to different sentiment labels if connected by antonym relations.
4. Experiment
4.1 Data Sets
We conduct experiments on Chinese sentiment lexicon learning. As in previous work (Baccianella, Esuli, and Sebastiani 2010), the sentiment words in General Inquirer lexicon are selected as the English seeds (Stone 1997). From the GI lexicon we collect 2,005 positive words and 1,635 negative words. To build the bilingual word graph, we adopt the Chinese–English parallel corpus, which is obtained from the news articles published by Xinhua News Agency in Chinese and English collections, using the automatic parallel sentence identification approach (Munteanu and Marcu 2005). Altogether, we collect more than 25M parallel sentence pairs in English and Chinese. We remove all the stopwords in Chinese and English (e.g., (of) and am) together with the low-frequency words that occur fewer than 5 times. After preprocessing, we finally have more than 174,000 English words, among which 3,519 words have sentiment labels and more than 146,000 Chinese words for which we need to predict the sentiment labels. To transfer sentiment information to Chinese unlabeled words more efficiently, we remove the unlabeled English words in the word graph (i.e., ). The unsupervised method, namely, BerkeleyAligner, is used to align the parallel sentences in this article (Liang, Taskar, and Klein 2006). As an unsupervised method, it does not require us to manually collect training data and does not need the complex training processing, and its performance is competitive with supervised methods. With these two advantages, we can focus more on our task of cross-lingual sentiment lexicon learning. Based on the word alignment derived by BerkeleyAligner, the inter-language WA is initialized with the normalized alignment frequency. The English and Chinese versions6 of WordNet are used to build the intra-language relations WE, , WT, and , respectively. WordNet (Miller 1995) groups words into synonym sets, called synsets. We collect about 117,000 synsets from the English WordNet and about 80,000 synsets from the Chinese WordNet. In total, we obtain 8,406 and 6,312 antonym synset pairs.
We first generate both positive and negative scores for each unlabeled word and then determine the word sentiment polarities based on its scores. We rank the two sets of newly labeled sentiment words according to their polarity scores. The top-ranked Chinese words are shown in Table 1. We manually label the top-ranked 1K sentiment words. For P@10K, we sequentially divide the top 10K ranked list into ten equal parts. One hundred sentiment words are randomly selected from each part for labeling. Similar to the evaluation of TREC Blog Distillation (Ounis, Macdonald, and Soboroff 2008), all the labeled words from each approach are used in the evaluation. We then evaluate the ranked lists with two metrics, Precision@K and Recall.
Word . | Meaning . | Polarity . | Word . | Meaning . | Polarity . |
---|---|---|---|---|---|
good | positive | disaster | negative | ||
correct | positive | tragedy | negative | ||
useful | positive | dangerous | negative | ||
smart | positive | harm | negative | ||
happy | positive | fault | negative | ||
reliable | positive | rage | negative | ||
accurate | positive | fail | negative | ||
happy | positive | damage | negative | ||
optimistic | positive | sore | negative | ||
loyal | positive | clash | negative |
Word . | Meaning . | Polarity . | Word . | Meaning . | Polarity . |
---|---|---|---|---|---|
good | positive | disaster | negative | ||
correct | positive | tragedy | negative | ||
useful | positive | dangerous | negative | ||
smart | positive | harm | negative | ||
happy | positive | fault | negative | ||
reliable | positive | rage | negative | ||
accurate | positive | fail | negative | ||
happy | positive | damage | negative | ||
optimistic | positive | sore | negative | ||
loyal | positive | clash | negative |
4.2 Evaluation of the Bilingual Word Graph
In this set of experiments, we examine the influence of graph topologies on sentiment lexicon learning.
Mono: This approach learns the Chinese sentiment lexicon based only on the Chinese monolingual word graph . Because it needs labeled sentiment words, we incorporate the English labeled sentiment words XE and the inter-language relation WA in the first iteration. Then we set XE and WA to be zero in later iterations.
BLP-WOA (bilingual word graph without antonym): This approach is based on the bilingual word graph. It only involves the inter-language relation WA and the synonym intra-language relations WE and WT. and are set to be zero.
BLP: This approach is based on the bilingual word graph. It incorporates the inter-language relation WA, the synonym intra-language relations WE and WT, and the antonym intra-language relations and .
In these approaches, μ is set to 0.1 as in Zhou et al. (2004). The precision of these approaches are shown in Figure 3. The figure shows that the approaches based on the bilingual word graph significantly outperform the one based on the monolingual word graph. The bilingual word graph can bring in more word relations and accelerate the sentiment propagation. Besides, in the bilingual word graph, the English sentiment seed words can continually provide accurate sentiment information. Thus we observe the increase in the approaches based on the bilingual word graph in term of both precision and recall (Table 2). Meanwhile, we find that adding the antonym relation in the bilingual word graph slightly enhances precision in top-ranked words and similar findings are observed in our later experiments. It appears that the antonym relations depict word relations in a more accurate way and can refine the word sentiment scores more precisely. However, the synonym relation and word alignment relation dominate, whereas the antonym relation accounts for only a small percentage of the graph. It is hard for the antonym relation to introduce new relations into the graph and thus it cannot help to further improve recall.
4.3 Comparison with Baseline and Existing Approaches
In this set of experiments, we compare our approach with the baseline and existing approaches.
Rule: For the intra-language relation, this approach assumes that the synonyms of a positive (negative) word are always positive (negative), and the antonyms of a positive (negative) word are always negative (positive). For the inter-language relation, we regard the Chinese word aligned to positive (negative) English words as positive (negative). If a word connects to both positive and negative English words, we regard it as objective. Based on this heuristic, we generate two sets of sentiment words.
We show recall of the learned Chinese sentiment words in Table 3. Compared with BLP and SOP, the Rule approach learns fewer sentiment words. The coverage of the Rule approach is inevitably low because many words in the corpus are aligned to both positive and negative words. For example, in most cases the positive Chinese word (helpful) is aligned to the positive English word helpful. But sometimes it is aligned (or misaligned) to the negative English words, like freak. Under this situation, the word tends to be predicted as objective. In SOP, the positive and negative scores are related to the distances of the word to the positive and negative seed words, and the distance is usually coarse-grained to depict the sentiment polarity. For example, the shortest path between the word good and the word bad in WordNet is only 5 (Kamps et al. 2004). The Rule and SOP approaches find different sentiment words. We then evaluate the learned Chinese polarity word lists by precision at k. As illustrated in Figure 4, the significance test indicates that our approach significantly outperforms the Rule and SOP approaches. The major difference of our approach is that the polarity information can be transferred between English and Chinese and within each language at the same time, whereas in the other two approaches the polarity information mainly transfers from English to Chinese and once a word gets a polarity score, it is difficult to change or refine. The idea of the MAD approach is similar to bilingual graph label propagation, but the MAD approach fails to leverage the antonym intra-language relation. We observe that the MAD approach can achieve a comparable result to the BLP approach. MAD can obtain a smoother label score by adding a dummy label. But the dummy label does not influence the sentiment labels too much because it is not used in the determination of the word sentiment polarity. Besides, MAD cannot deal with the antonym relation. As a result, these experiments demonstrate the overall superiority of our approach in cross-lingual sentiment lexicon learning. This also indicates the effectiveness of the BLP approach in Chinese sentiment lexicon learning.
4.4 Evaluation of the Inter-Language Relation
This set of experiments is to examine the ways to build the inter-language relation.
BLP-dict: The inter-language relation is built upon the translation entries from LDC7 and Universal Dictionary (UD).8 From these dictionaries (both English–Chinese and Chinese–English dictionaries), we collect 41,034 translation entries between the English and Chinese words. If the English word xi can be translated to the Chinese word xj in UD dictionary, wA(i,j) and wA(j,i) are set to 1.
BLP-MT: All the Chinese (English) words are translated into English (Chinese) by Google Translator. If the Chinese word xi can be translated to the English word xj, the wA(i,j) and wA(j,i) are set to 1. If a Chinese word is translated to an English phrase, we assume that the Chinese word is projected to each word in the English phrase. To improve the coverage, we translate the English sentiment seed words with three other methods; they are word collocation, coordinated phrase, and punctuation, as mentioned in Meng et al. (2012b).
The learned Chinese sentiment word lists are also evaluated with precision at k. As shown in Figure 5, we find that the alignment-based approach outperforms the dictionary-based and MT-based approaches. The reason that contributes to this is that we can build more inter-language relations based on word alignment, compared with the translation entries from the dictionary and the translation pairs from Google Translator. For example, the English word move is often translated to (shift) and (affect, touch) by dictionaries or MT engines. From the parallel sentences, besides these word translation pairs, the word move can be also aligned to (plain sailing bon voyage) that is commonly used in Chinese greeting texts. This translation entry is hard to find in dictionaries or by MT engines. The words are aligned between the two parallel sentences. Sometimes the word move may be forced to be aligned to in the parallel sentences good luck and best wishes on your career move and . Thus, when building the inter-language relations with word alignment, our approach is likely to learn more sentiment word candidates. It is also the reason why the dictionary-based and MT-based approaches learn fewer sentiment words than our approach, as indicated in Table 4. According to our statistic, on average a Chinese word is connected to 2.3 and 2.1 English words if we build the inter-language relations with the dictionary and Google Translator, respectively. By building the inter-language relation with word alignment, our approach connects a Chinese word to 16.21 English words an average, which greatly increases the coverage of the learned sentiment lexicon.
4.5 Evaluation of the Intra-Language Relation
The following set of experiments reveals the influence of the intra-language relation.
BLP-A: As the baseline of this set of experiments, it does not build the intra-language relations with either English or Chinese WordNet synsets. Only the inter-language relation with word alignment is used to build the graph. That means WE, , WT, and are defined as zero matrixes.
BLP-AE: Word alignment and the English WordNet synsets are used to build the intra-English relation WE, but the intra-Chinese relation WT and are set to zero matrixes.
BLP-AC: Word alignment and the Chinese WordNet synsets are used to build the intra-Chinese relation WT, but the intra-English relation WE and are set to zero matrixes.
As Figure 6 shows, when combining both English and Chinese intra-language relations, the precision curves of both positive and negative predictions increase. This indicates that adding the intra-language relations has a positive influence. The improvement can be explained by the ability of the intra-language relations to refine the polarity scores. For example, the English word sophisticated can be aligned to the positive Chinese word (delicate) as well as the negative Chinese word (wily, wicked). In the GI lexicon, the English word sophisticated is labeled as positive. In the bilingual word graph that contains only the inter-language relations, the negative Chinese word is likely to be labeled as positive. However, with the intra-language relation, the negative Chinese word may connect to the other negative Chinese words, like (foxy); and the Chinese positive word may connect to the other positive Chinese words, like (elaborate). Thus the polarity score of the word can be refined by the intra-language relation in each iteration of propagation. Another advantage of the intra-language relation is that it helps to reduce the noise introduced by the inter-language relation. For example, sometimes the Chinese positive word (help) is misaligned to the negative English word freak by the inter-language relation, but it is also connected to the synonyms (help) and (salutary) (which are positive) by the intra-language relations. The polarity score of the word can be adjusted by the intra-language relation. Thus, though the inter-language relation brings in certain noisy alignments, the intra-language relation can help to refine the polarity score of the word using its intra-language relation.
4.6 Sensitivity of Parameter ρ1∼4
ρ1 and ρ2 in Equation (3) tune the English and Chinese synonym intra-language propagation, while ρ3 and ρ4 in Equation (4) adjust the English and Chinese antonym intra-language propagation. For simplicity, let ρ1 equal ρ2 and let ρ3 equal ρ4. Then we tune ρ1,2 and ρ3,4 together. When ρ1,2 and ρ3,4 range from {1e − 2, 1e − 1, 1, 10, 100, 1000}, Precision@1K ranges from 0.631 to 0.689 and Recall ranges from 0.651 to 0.729 on average. In general, we find that when 1 ≤ ρ3,4 < ρ1,2 ≤ 10, we can obtain better results.
4.7 Evaluation on Sentiment Classification
Sentiment classification is one of the most extensively studied tasks in the community of sentiment analysis (Pang and Lee 2008). To see whether the performance improvement in lexicon learning also improves the results of sentiment classification, we apply the generated Chinese sentiment lexicons to sentence-level sentiment classification.
Data set: The NTCIR sentiment-labeled corpus is used for sentiment classification (Seki et al. 2008, 2009). We extract the Chinese sentences that have positive, negative, or neutral labels. The numbers of extracted sentences are shown in Table 5. The learned sentiment words in the Mono and BLP approaches are used as classification features. We implement the following baselines for comparison.
. | Positive . | Negative . | Neutral . |
---|---|---|---|
sentence number | 1,218 | 944 | 528 |
. | Positive . | Negative . | Neutral . |
---|---|---|---|
sentence number | 1,218 | 944 | 528 |
BSL_DF: The Chinese word unigrams and bigrams are extracted from the NTCIR data set as features. We rank the features according to their frequencies and gradually increase the value of N for the Top-N classification features.
BSL_LF: The words in existing Chinese sentiment lexicons are used as features. A total of 836 positive words and 1,254 negative words are collected from HowNet.9
We use LibSVM10 and perform 10-fold cross-validation on the NTCIR polarity sentences. The accuracies over N number of features are plotted in Figure 7. Our approach achieves a very promising improvement, although the features and the sentences that need to be classified are selected from different corpora. This suggests that the generated sentiment lexicon is adaptive and qualitative enough for sentiment classification.
5. Conclusions and Future Work
In this article, we studied the task of cross-lingual sentiment lexicon learning. We built a bilingual word graph with the words in two languages and connected them with the inter-language and intra-language relations. We proposed a bilingual word graph label propagation approach to transduce the sentiment information from English sentiment words to the words in the target language. The synonym and antonym relations among the words in the same languages are leveraged to build the intra-language relations. Word alignment derived from a large parallel corpus is used to build the inter-language relations. Experiments on Chinese sentiment lexicon learning demonstrate the effectiveness of the proposed approach. There are three main conclusions from this work. First, the bilingual word graph is suitable for sentiment information transfer and the proposed approach can iteratively improve the precision of the generated sentiment lexicon. Second, building the inter-language relations with the large parallel corpus can significantly improve the coverage. Third, by incorporating the antonym relations into the bilingual word graph, the BLP approach can achieve an improvement in precision. In the future, we will explore the opportunity of expanding or generating the sentiment lexicons for multiple languages by bootstrapping.
Acknowledgments
The work described in this article was supported by a Hong Kong RGC project (PolyU no. 5202/12E) and a National Nature Science Foundation of China (NSFC no. 61272291).
Notes
References
Author notes
Contribution during internship at Microsoft Research (Beijing).
Furu Wei, Xiaohua Liu, and Ming Zhou are from Microsoft Research, Beijing, China. E-mail: [email protected], [email protected], [email protected].
Dehong Gao and Wenjie Li are from the Department of Computing, the Hong Kong Polytechnic University. E-mail: [email protected], [email protected].