Detecting the sequence of letters in a string

Detecting the sequence of letters in a string - python

I've made a hasher. It works and it's really simple. I made one for fun and I thought that the code is way too long. It's over 1000 lines long and it's so simple. I just want to shorten it down.
Here's how I did the code:
wordorg = raw_input("Enter a word here: ")
## Checking if what you typed is correct
if len(wordorg) <= 10 and len(wordorg) > 1 and wordorg.isalpha():
## Comparison (JESUS THIS IS A LONG PIECE OF CODE)
print "Your original word was: " + wordorg
word = wordorg.lower()
if len(word) >= 1:
if word[0] == "a":
one = a
if word[0] == "b":
one = b
if word[0] == "c":
Bla bla bla, you get the idea, it goes like that. When it reaches Z
if word[0] == "z":
one = z
if len(word) >= 2:
if word[1] == "a":
And it goes on. My question is, how can I shorten my code?
EDIT:
The integers a, b, c are defined like this:
a = 2
b = 3
c = 5
and so on.

You could use a dict to split your line count by 26:
>>> import string
>>> translate = {l:i for i,l in enumerate(string.ascii_lowercase, 1)}
>>> translate
{'a': 1, 'b': 2, 'c': 3, 'd': 4, 'e': 5, 'f': 6, 'g': 7, 'h': 8, 'i': 9, 'j': 10, 'k': 11, 'l': 12, 'm': 13, 'n': 14, 'o': 15, 'p': 16, 'q': 17, 'r': 18, 's': 19, 't': 20, 'u': 21, 'v': 22, 'w': 23, 'x': 24, 'y': 25, 'z': 26}
Now, all you need is a dict lookup instead of 26 ifs:
>>> word='something'
>>> translate[word[0]]
19
>>> translate[word[1]]
15
You could replace the 1, 2, ... values with the ones defined as a, b, ... in your code.
If you want to do it for every letter, simply use a list comprehension:
>>> [translate[letter] for letter in word]
[19, 15, 13, 5, 20, 8, 9, 14, 7]
You now have a list of integers, ready for further processing!

I don't know where you go exactly from the code snippet you gave but I suggest to start from:
[1+ord(chr)-ord('a') for chr in wordorg]
ord is a function returning the ascii code of a character (a=97, b=98, etc.). So 1+ord(chr)-ord('a') will return 1 for 'a', 2 for 'b', etc.
It seems more interesting not to use a dictionary in your hash function, since a dictionary itself is a hash table.

I think this does what you're looking for. What I did is build a loop that runs over your word, so it can compare letter by letter. The second loop goes over the letters of the alphabet and if your word letter has a match, this is stored in the array results. This array counts the occurrences of each letter. If you want you can replace the print statement with writing to a file. There is also no need to restrict your code to run on short words anymore.
import string
alphabet = string.ascii_lowercase
results = [0] * len(alphabet) # array to count occurrences of letters
wordorg = raw_input("Input word here: ")
print alphabet
if wordorg.isalpha():
for i in range(len(wordorg)):
for j in range(len(alphabet)):
if (wordorg[i].find(alphabet[j])!=-1):
results[j] += 1
# print results
for i in range(len(alphabet)):
if (results[i]>0):
print "There are %d occurrences of the letter %s" %(results[i], alphabet[i])

Related

Trying to build a program in Python that sums the letters of a word based on the value of a letter

My project is looking at totaling the value of a word entered by a user based on assigned values for each letter. I'm struggling with getting the code to loop through each letter of the word and also sum the value for each letter. I'm still fairly new to Python so I'm unsure how much of it is correct or makes logical sense. Below is my code:
letter_values=[['a',-5],['b',2],['c',3],['d',4],['e',-10],['f',6],['g',7],['h',8],['i',-15],['j',10],['k',11],['l',12],['m',13],['n',14],['o',-20],['p',16],['q',17],['r',18],['s',19],['t',20],['u',-20],['v',22],['w',23],['x',24],['y',25],['z',26]]
word = input("Enter a word:")
for j in range(len(word)):
for i in range(len(letter_values)):
if letter_values[i][0] in(word):
word_value = letter_values[i][1]
break
word_total = sum(word_value)
print(word_total)
Any suggestions would be greatly appreciated!

You can use the python dictionaries, for example some like:
values = {"a": 1, "b":2, "c": 3, "d": 4}
You can access to the value of the letter 'a' using the key 'a':
print(values["a"])
An example:
values = {"a" : 1, "b" : 2, "c" : 3}
word = str(input("Enter a word:"))
total = 0
for letter in word:
total += values[letter]
print(total)

Your code is almost good. Your outer loop goes letter by letter in the word using j as an index. But j is never used in the inner loop. word_total should be outside the for loop. To fix, your code should look like.
for letter in word: # go letter by letter
for inner_list in letter_values:
# brute force check for against every entry in letter_values
if letter == inner_list[0] # compare to letter
word_value = inner_list[1]
break
word_total = sum(word_value)
I really like #BryanMontoyaOsorio's solution with dictionaries. It's elegant, easy to understand, and only a few lines of code.
The rest of this answer is a wonky way to generate a dictionary automatically. First Python has a built-in constant ascii_lowercase that contains all of the lowercase letters in the string module.
import string
print(string.ascii_lowercase)
'abcdefghijklmnopqrstuvwxyz'
And the random module can provide us with a list of values using the sample() method:
import random
value_list = random.sample(range(-26,26+1), 26)
print(value_list)
[-15, -26, 20, 9, -9, -23, -7, 1, -25, 18, 13, 0, 2, 15, -14, -20, 12, -2, -17, 21, -16, -6, 8, 14, -5, -13]
So to generate a dictionary:
import string
import random
valdict = {} # create empty dictionary
value_list = random.sample(range(-26,26+1), 26)
for i, letter in enumerate(string.ascii_lowercase):
valdict[letter] = value_list[i]
print(valdict)
{'a': 4, 'b': -17, 'c': 11, 'd': -11, 'e': 10, 'f': 1, 'g': -18, 'h': -4, 'i': -13, 'j': 9, 'k': -20, 'l': 25, 'm': 17, 'n': 14, 'o': -2, 'p': -1, 'q': 6, 'r': -5, 's': -21, 't': -26, 'u': -19, 'v': 24, 'w': -12, 'x': -25, 'y': -14, 'z': 20}

Character count in string

def charcount(stri):
for i in stri:
count = 0
for j in stri:
if stri[i] == stri[j]:
count += 1
I am new to python and currently learning string operations, can anyone tell me what is wrong in this program? The function tries to print a count of each character in given string.
For eg: string ="There is shadow behind you"
I want to count how many times each character have come in string

Counting characters in a string can be done with the Counter() class like:
Code:
from collections import Counter
def charcount(stri):
return Counter(stri)
print(charcount('The function try to print count of each character '
'in given string . Please help'))
Results:
Counter({' ': 14, 'e': 7, 'n': 7, 't': 7, 'c': 5, 'i': 5,
'r': 5, 'h': 4, 'o': 4, 'a': 4, 'f': 2, 'u': 2,
'p': 2, 'g': 2, 's': 2, 'l': 2, 'T': 1, 'y': 1,
'v': 1, '.': 1, 'P': 1})

Feedback on code:
In these lines:
for i in stri:
count = 0
for j in stri:
The outer loop is looping over each character in stri, and the inner loop is looping over every character in stri. This is like a Cartesian product of the elements in the list, and is not necessary here.
Secondly, in this line:
if stri[i] == stri[j]:
You are accessing stri by its indices, but i and j are not indices, they are the characters themselves. So treating them as indices does not work here, since characters are not valid indices for lists. If you wanted to access just the indices, you could access them with range(len()):
for i in range(len(stri)):
count = 0
for j in range(len(stri)):
if stri[i] == stri[j]:
Or if you want to access the elements and their indices, you can use enumerate().
Having said this, your approach is too complicated and needs to be redone. You need to group your characters and count them. Using nested loops is overkill here.
Alternative approaches:
There are lots of better ways to do this such as using collections.Counter() and dictionaries. These data structures are very good for counting.
Since it also looks like your struggling with loops, I suggest going back to the basics, and then attempt doing this problem with a dictionary.

This is what you need to do. Iterate through the input string and use a hash to keep track of the counts. In python, the basic hash is a dictionary.
def charCounter(string):
d = {} # initialize a new dictionary
for s in string:
if s not in d:
d[s] = 1
else:
d[s] += 1
return d
print charCounter("apple")
# returns {'a': 1, 'p': 2, 'e': 1, 'l': 1}

Just little modification in your solution
first you are looping wrong:-
Take a look:-
def charcount(stri):
d = {}
for i in stri:
if i in d:
d[i] = d[i] +1
else:
d[i] = 1
return d
print (charcount("hello")) #expected outpu

Counting each characters in a string
>>> from collections import Counter
>>> string ="There is shadow behind you"
>>> Counter(string)
Counter({' ': 4, 'h': 3, 'e': 3, 'i': 2, 's': 2, 'd': 2, 'o': 2, 'T': 1, 'r':
1, 'a': 1, 'w': 1, 'b': 1, 'n': 1, 'y': 1, 'u': 1})

If you don't want to use any import :
def charcount(string):
occurenceDict = dict()
for char in string:
if char not in occurenceDict:
occurenceDict[char] = 1
else :
occurenceDict[char] += 1
return(occurenceDict)

You can use the following code.
in_l = ','.join(str(input('Put a string: '))).split(',')
d1={}
for i in set(in_l):
d1[i] = in_l.count(i)
print(d1)

public class Z {
public static void main(String[] args) {
int count=0;
String str="aabaaaababa";
for(int i=0;i<str.length();i++) {
if(str.charAt(i)=='a') {
count++;
}
}
System.out.println(count);
}
}

Sum function is not iterable

score = {
'a': 1, 'b': 3, 'c': 3, 'd': 2, 'e': 1, 'f': 4,
'g': 2, 'h': 4, 'i': 1, 'j': 8, 'k': 10, 'l': 1,
'm': 2, 'n': 1, 'o': 1, 'p': 3, 'q': 8, 'r': 1,
's': 1, 't': 1, 'u': 1, 'v': 4, 'w': 10, 'x': 10,
'y': 10, 'z': 10,
}
x = input('Enter a Word: ')
y = list(x)
a = []
for i in x:
z = int(score[i])
a = sum(z)
print(a)
a= sum(z) is keep on saying it is "int' object is not iterable".
What did I do wrong?

I am guessing you want a to store the sum of the letters in the word. If so, what you are doing is wrong. sum() function expects an iterable, and then it takes the sum by adding each element of that iterable and returns the sum. Example -
>>> sum([1,2,3,4])
10
It does not work like you are expecting it to.
In your case , you do not even need to use sum, simply do -
a = 0 #before the loop
and
a += z #inside the loop
And also you should indent the print a outisde the for loop, so that you only print the final sum of letters.

As said #Amadan in the comments of #AnandSKumar answer, you could do the following :
a = sum(score[c] for c in x)
Supposing you are still learning Python, here is a explication of what does the above code.
First as explained #AnandSKumar, the sum built-in function takes as parameter an iterable. What is an iterable ? It's an object that implement the __iter__ method which allows to iterate over the data of the object. The best example is a list object. Considere the following code :
my_list = ['a', 'b', 'c']
try:
while True:
print next(my_list)
except StopIteration:
print "The end."
# Result :
a
b
c
The end.
We iterate over the list my_list. The sum function could/would looks like this in Python implementation :
def sum(iterable):
iterator = iter(iterable)
result = next(iterator, 0)
try:
while True:
result += next(iterator)
except StopIteration:
pass
return result
A good approche of what you are doing could be to create a list as following, and the sum it :
a = [score[c] for c in x]
print sum(a)
# Equivalent code (one liner) :
print sum([score[c] for c in x])
# Equivalent code :
a = []
for c in x:
a.append(c)
print sum(a)
The thing is that you generate entirely your list a and it takes memory. The approche of what #Amadan said is to generate dynamically the list a using a generator expression. The idea is to create an iterable, which has no representation in memory.
print sum(score[c] for c in x)
# Equivalent code :
result = 0
for c in x:
result += score[c]
print result
This is more memory efficient and looks like you're a god of Python :)
The one-liner of what you are trying to do :
# Python 3
print sum(score[c] for c in input("Enter a word: "))
# Python 2
print sum(score[c] for c in raw_input("Enter a word: "))

Python count/dictionary count

dct = {}
with open("grades_single.txt","r") as g:
content = g.readlines()[1].strip('\n')
for item in content:
dct[item] = content.count(item)
LetterA = max(dct.values())
print(dct)
I'm very new to python so please excuse me. This is my code so far and it works but not as it's intended to. I'm trying to count the frequency off certain letters on new lines so I can do a mathematical function with each letter. The program counts all the letters and prints them but I'd like to be able to count each letter one by one I.E 7As, new fuction 4Bs etc.
At the moment the program is printing them off in one function but yeah I'd like to split them up so I can work with each letter one by one. {'A': 9, 'C': 12, 'B': 19, 'E': 4, 'D': 5, 'F': 1}
Does anyone know how to count the frequency of each letter by letter?
ADCBCBBBADEBCCBADBBBCDCCBEDCBACCFEABBCBBBCCEAABCBB
Example of what I'd like to count.

>>> from collections import Counter
>>> s = "ADCBCBBBADEBCCBADBBBCDCCBEDCBACCFEABBCBBBCCEAABCBB"
>>> Counter(s)
Counter({'B': 19, 'C': 14, 'A': 7, 'D': 5, 'E': 4, 'F': 1})

collections.Counter is clean, but if you were in a hurry, you could iterate over all of the elements and place them into a dictionary yousrelf.
s = 'ADCBCBBBADEBCCBADBBBCDCCBEDCBACCFEABBCBBBCCEAABCBB'
grades = {}
for letter in s:
grades[letter] = grades.get(letter, 0) + 1

How to return the number of characters whose frequency is above a threshold

How do I print the number of upper case characters whose frequency is above a threshold (in the tutorial)?
The homework question is:
Your task is to write a function which takes as input a single non-negative number and returns (not print) the number of characters in the tally whose count is strictly greater than the argument of the function. Your function should be called freq_threshold.
My answer is:
mobyDick = "Blah blah A　B C A RE."
def freq_threshold(threshold):
tally = {}
for char in mobyDick:
if char in tally:
tally[char] += 1
else:
tally[char] = 1
for key in tally.keys():
if key.isupper():
print tally[key],tally.keys
if threshold>tally[key]:return threshold
else:return tally[key]
It doesn't work, but I don't know where it is wrong.

Your task is to return number of characters that satisfy the condition. You're trying to return count of occurrences of some character. Try this:
result = 0
for key in tally.keys():
if key.isupper() and tally[key] > threshold:
result += 1
return result
You can make this code more pythonic. I wrote it this way to make it more clear.

The part where you tally up the number of each character is fine:
>>> pprint.pprint ( tally )
{' ': 5,
'.': 1,
'A': 2,
'B': 2,
'C': 1,
'E': 1,
'R': 1,
'a': 2,
'b': 1,
'h': 2,
'l': 2,
'\x80': 2,
'\xe3': 1}
The error is in how you are summarising the tally.
Your assignment asked you to print the number of characters occurring more than n times in the string.
What you are returning is either n or the number of times one particular character occurred.
You instead need to step through your tally of characters and character counts, and count how many characters have frequencies exceeding n.

Do not reinvent the wheel, but use a counter object, e.g.:
>>> from collections import Counter
>>> mobyDick = "Blah blah A B C A RE."
>>> c = Counter(mobyDick)
>>> c
Counter({' ': 6, 'a': 2, 'B': 2, 'h': 2, 'l': 2, 'A': 2, 'C': 1, 'E': 1, '.': 1, 'b': 1, 'R': 1})

from collections import Counter
def freq_threshold(s, n):
cnt = Counter(s)
return [i for i in cnt if cnt[i]>n and i.isupper()]
To reinvent the wheel:
def freq_threshold(s, n):
d = {}
for i in s:
d[i] = d.get(i, 0)+1
return [i for i in d if d[i]>n and i.isupper()]

We Keep Coding

Python is a programming language that lets you work quickly and integrate systems more effectively.

Detecting the sequence of letters in a string - python

Related

Trying to build a program in Python that sums the letters of a word based on the value of a letter

Character count in string

Sum function is not iterable

Python count/dictionary count

How to return the number of characters whose frequency is above a threshold

Categories

Resources