Cloud Commit and german umlauts cut into pieces

classic Classic list List threaded Threaded
5 messages Options
Reply | Threaded
Open this post in threaded view
|

Cloud Commit and german umlauts cut into pieces

DisasterArea
Hallo,

I have successfully implemented subversion in our development team and searched for a statistic tool 1 or 2 years ago. Simple and quick to install and configure is StatSVN and we let it run nightly on our server.

But the cloud commit is not working right. It has problems with german umlauts and splits word into pieces, cutting it where the umlaut is placed.
We run svn log within a batch-script. I found a solution for the umlauts in the xml output of svn log adding

SET LC_MESSAGES=de_DE

before the call to svn log. So the umlauts are right in the svn log output, but still StatSVN cuts the words into pieces. I think it's a bug. Or do you have a configuration advice for Java making it work?

And, by the way, this project is not very active, are you still developing it and going on to Version 1.0 or is this project dead?
Reply | Threaded
Open this post in threaded view
|

Re: Cloud Commit and german umlauts cut into pieces

jkealey@lavablast.com
Sounds similar to an issue here:

I think the user resolved the issue by changing the charset on the generated HTML files. 

The project isn't dead but all the core maintainers are busy working in their own startups. New blood would be welcomed :)

Thanks,
---
Jason Kealey, Ing. - [hidden email]
LavaBlast Franchise Software - http://www.lavablast.com
Simplifying day-to-day franchise operations


On Wed, May 18, 2011 at 9:25 AM, DisasterArea <[hidden email]> wrote:
Hallo,

I have successfully implemented subversion in our development team and
searched for a statistic tool 1 or 2 years ago. Simple and quick to install
and configure is StatSVN and we let it run nightly on our server.

But the cloud commit is not working right. It has problems with german
umlauts and splits word into pieces, cutting it where the umlaut is placed.
We run svn log within a batch-script. I found a solution for the umlauts in
the xml output of svn log adding

SET LC_MESSAGES=de_DE

before the call to svn log. So the umlauts are right in the svn log output,
but still StatSVN cuts the words into pieces. I think it's a bug. Or do you
have a configuration advice for Java making it work?

And, by the way, this project is not very active, are you still developing
it and going on to Version 1.0 or is this project dead?

--
View this message in context: http://statsvn.1260.n2.nabble.com/Cloud-Commit-and-german-umlauts-cut-into-pieces-tp6377791p6377791.html
Sent from the StatSVN Users mailing list archive at Nabble.com.

------------------------------------------------------------------------------
What Every C/C++ and Fortran developer Should Know!
Read this article and learn how Intel has extended the reach of its
next-generation tools to help Windows* and Linux* C/C++ and Fortran
developers boost performance applications - including clusters.
http://p.sf.net/sfu/intel-dev2devmay
_______________________________________________
Statsvn-users mailing list
[hidden email]
https://lists.sourceforge.net/lists/listinfo/statsvn-users


------------------------------------------------------------------------------
What Every C/C++ and Fortran developer Should Know!
Read this article and learn how Intel has extended the reach of its
next-generation tools to help Windows* and Linux* C/C++ and Fortran
developers boost performance applications - including clusters.
http://p.sf.net/sfu/intel-dev2devmay
_______________________________________________
Statsvn-users mailing list
[hidden email]
https://lists.sourceforge.net/lists/listinfo/statsvn-users
Reply | Threaded
Open this post in threaded view
|

Re: Cloud Commit and german umlauts cut into pieces

DisasterArea
Thanks Jason for your advice,

but it doesn't work. It just changes the encoding for the website and not for the internal way statsvn parses the svn log file and count the words.

Let me describe in more detail: i.e. you have a word, lets say (plötzlich), with an umlaut at the third position. (pl<umlaut>tzlich). statsvn cuts this word and in the cloud commit you'll find the word "tzlich" counted i.e. 34 times. So statsvn first cuts, then counts but it should not cut!! It should count the word "plötzlich"

Maybe this is a problem that could not be solved using batch scripts in windows which run in a DOS environment. Maybe one has to use VBScript on windows.
I did further tests in DOS Mode with setting the codepage to 65001:
DOS>chcp 65001
DOS> svn log ...>svn.log
DOS>java ..statsvn.jar ...svn.log

but this doesn't work as well.

So let us get clear: Is it a problem of statsvn and how it parses the input or is it a problem with the DOS environment?
Reply | Threaded
Open this post in threaded view
|

Re: Cloud Commit and german umlauts cut into pieces

jkealey@lavablast.com
I wouldn't be surprised it is a bug in StatSVN or the XML parser we're using. 


Feel free to tackle the issue and propose a patch for StatSVN. 

Thanks,
---
Jason Kealey, Ing. - [hidden email]
LavaBlast Franchise Software - http://www.lavablast.com
Simplifying day-to-day franchise operations


On Thu, May 19, 2011 at 2:36 AM, DisasterArea <[hidden email]> wrote:
Thanks Jason for your advice,

but it doesn't work. It just changes the encoding for the website and not
for the internal way statsvn parses the svn log file and count the words.

Let me describe in more detail: i.e. you have a word, lets say (plötzlich),
with an umlaut at the third position. (pl<umlaut>tzlich). statsvn cuts this
word and in the cloud commit you'll find the word "tzlich" counted i.e. 34
times. So statsvn first cuts, then counts but it should not cut!! It should
count the word "plötzlich"

Maybe this is a problem that could not be solved using batch scripts in
windows which run in a DOS environment. Maybe one has to use VBScript on
windows.
I did further tests in DOS Mode with setting the codepage to 65001:
DOS>chcp 65001
DOS> svn log ...>svn.log
DOS>java ..statsvn.jar ...svn.log

but this doesn't work as well.

So let us get clear: Is it a problem of statsvn and how it parses the input
or is it a problem with the DOS environment?

--
View this message in context: http://statsvn.1260.n2.nabble.com/Cloud-Commit-and-german-umlauts-cut-into-pieces-tp6377791p6380817.html
Sent from the StatSVN Users mailing list archive at Nabble.com.

------------------------------------------------------------------------------
What Every C/C++ and Fortran developer Should Know!
Read this article and learn how Intel has extended the reach of its
next-generation tools to help Windows* and Linux* C/C++ and Fortran
developers boost performance applications - including clusters.
http://p.sf.net/sfu/intel-dev2devmay
_______________________________________________
Statsvn-users mailing list
[hidden email]
https://lists.sourceforge.net/lists/listinfo/statsvn-users


------------------------------------------------------------------------------
What Every C/C++ and Fortran developer Should Know!
Read this article and learn how Intel has extended the reach of its
next-generation tools to help Windows* and Linux* C/C++ and Fortran
developers boost performance applications - including clusters.
http://p.sf.net/sfu/intel-dev2devmay
_______________________________________________
Statsvn-users mailing list
[hidden email]
https://lists.sourceforge.net/lists/listinfo/statsvn-users
Reply | Threaded
Open this post in threaded view
|

Re: Cloud Commit and german umlauts cut into pieces

DisasterArea
Hallo Jason,

it's some time ago that I posted here, but now our team has got subversion in use for a couple of years and this Bug is annoying now.
So I will try to fix it. Do you have an advice or a document describing how to set um a local development environment. I downloaded the sourcecode and it seem I have to use Eclipse. But how. I have installed Eclipse Helios for developing PHP.

So how to set um the environment, import the project and later: Commit my changes.