Poszerzanie comment do bardziej szczegółowej odpowiedzi tutaj: trzeba owinąć tolower
wewnątrz content_transformer
aby nie zepsuć obiekt VCorpus
- coś takiego:
> library(tm)
> data('crude')
> crude[[1]]$content
[1] "Diamond Shamrock Corp said that\neffective today it had cut its contract prices for crude oil by\n1.50 dlrs a barrel.\n The reduction brings its posted price for West Texas\nIntermediate to 16.00 dlrs a barrel, the copany said.\n \"The price reduction today was made in the light of falling\noil product prices and a weak crude oil market,\" a company\nspokeswoman said.\n Diamond is the latest in a line of U.S. oil companies that\nhave cut its contract, or posted, prices over the last two days\nciting weak oil markets.\n Reuter"
> tm_map(crude, content_transformer(tolower))[[1]]$content
[1] "diamond shamrock corp said that\neffective today it had cut its contract prices for crude oil by\n1.50 dlrs a barrel.\n the reduction brings its posted price for west texas\nintermediate to 16.00 dlrs a barrel, the copany said.\n \"the price reduction today was made in the light of falling\noil product prices and a weak crude oil market,\" a company\nspokeswoman said.\n diamond is the latest in a line of u.s. oil companies that\nhave cut its contract, or posted, prices over the last two days\nciting weak oil markets.\n reuter"
Co pakunek 'tm_map' z? Wydaje się, że zależy to od jakiegoś pakietu niebędącego pakietem podstawowym. Proszę rozważyć dołączenie instrukcji 'library' dla kompletności. –
@DanielKrizian: 'tm_map()' pochodzi z pakietu 'tm', a' tolower() 'pochodzi z' bazy' – smci